🌏 中文版
Starting in late 2025, one term has been showing up more and more in engineering conversations: Agentic AI.
It is not a more accurate version of autocomplete for code suggestions — it is an entirely different way of working. The AI receives a goal, autonomously plans steps, executes tools, handles errors, and delivers a result. Stripe merges 1,300 PRs per week with it. Spotify’s top engineers have not written a single line of code by hand since last December.
This article aims to answer a practical question: If you want to integrate Agentic AI into your daily development workflow, where do you start?
First, Let’s Clarify: How Agentic AI Differs from Conventional AI Tools
Conventional AI tools (Copilot autocomplete, ChatGPT Q&A) are reactive — you ask, they answer, done.
Agentic AI is proactive — you give it a goal, and it will:
- Plan the steps needed to achieve the goal
- Call tools (read/write files, run tests, query APIs)
- Adjust the next step based on results
- Self-recover from errors instead of handing them back to you
This difference determines how it can be embedded into the SDLC.
The Five SDLC Phases x Agentic AI
1. Planning & Design
What agents can do:
- Read a PRD or a description and automatically break it down into user stories and technical tasks
- Generate acceptance test drafts based on the existing codebase
- Record Architecture Decision Records (ADRs)
Entry point: Create a prompt in Linear or GitHub Issues and let Claude turn vague requirements into concrete technical tasks. The key is to write a spec first — the strongest consensus best practice in the industry right now is spec-driven development: define what to build, then let the agent build it.
2. Coding
What agents can do:
- Receive a task -> understand the codebase -> write code -> run tests -> fix errors -> commit
- Security vulnerability scanning, code review, adding type annotations
- Cross-file refactoring and migrations
Entry point: Claude Code CLI is the starting point for this phase. Give it a task description and it will autonomously execute within your repo. No infrastructure setup needed — install the CLI and start.
3. Testing
What agents can do:
- Analyze the diff scope -> automatically add corresponding unit tests and integration tests
- Generate E2E test scripts
- Generate test data
Entry point: Turn “add tests for this PR” into a CI pipeline step. The agent reads the diff, finds untested paths, writes tests, and commits.
4. Deploy & Security
What agents can do:
- Detect CI failures -> analyze errors -> attempt fixes -> re-push
- IaC code generation and validation
- PR review: read diffs and leave meaningful comments
Entry point: GitHub Agentic Workflows (currently in technical preview) let you describe a workflow in Markdown, running Claude Code or Copilot as the coding agent engine underneath, triggered by GitHub Actions.
5. Ops & Monitoring
What agents can do:
- Subscribe to alerts -> determine root cause -> open an issue or attempt a hotfix directly
- Bottleneck analysis and performance recommendations
- Generate onboarding documentation for new team members
Entry point: This phase has the highest barrier to entry. You typically need to master the first four phases before considering an on-call agent.
Getting Started: Practical Recommendations
From lowest to highest complexity:
| Level | Approach |
|---|---|
| Beginner | Claude Code CLI for single tasks (build features, fix bugs, add tests) |
| Intermediate | Set up Hooks so the agent automatically runs checks on commit/push |
| Automated | GitHub Agentic Workflows: defined in Markdown, triggered by events |
| Full pipeline | Claude API + Agent SDK wired to GitHub Webhooks, multi-agent division of labor |
Most teams can feel the difference starting from the Beginner level. You do not need to wait until you are “ready” to begin.
Three Key Design Principles
From the real-world deployments of leading Silicon Valley companies (Stripe, Ramp, Coinbase, Spotify), three common principles emerge:
1. Spec first Write the spec, then let the agent execute. Giving the agent a clear description of the end state is far more effective than giving it step-by-step instructions.
2. Sandbox isolation Each agent task runs in an isolated environment — no access to production, no access to the internet. The blast radius must be kept within an acceptable range.
3. Where is the human-in-the-loop? High-risk operations (force push, production deploy) retain manual confirmation. Which operations can be auto-merged and which require review must be defined in advance — do not let the agent decide on its own.
References
- GitHub Agentic Workflows Official Introduction
- GitHub Next: Agentic Workflows Project
- How to build reliable AI workflows with agentic primitives (GitHub Blog)
- The New SDLC: A Practical Guide to Agentic Engineering
- Agentic SDLC: The AI-Powered Blueprint Transforming Software Development
- Modernizing the SDLC process with Agentic AI (Microsoft / Medium)
- Agentic Coding Best Practices (Blink)
- AI-Driven SDLC: Build Secure, Scalable Software with AI
- Securing the Agentic Development Lifecycle (Cycode)
- An AI-led SDLC with Azure and GitHub (Microsoft Community Hub)
- Top 10 Agentic AI Repos in 2025 (ODSC)
Loading...