Building a coding agent

Table of contents

This tutorial teaches you how to build a coding agent that can help with software development tasks. You'll start with a basic agent and progressively add capabilities until you have a production-ready assistant that can read code, make changes, run tests, and even look up documentation.

By the end, you'll understand how to structure agent instructions, configure tools, and compose multiple agents for complex workflows.

What you'll build

A coding agent that can:

Read and modify files in your project
Run commands like tests and linters
Follow a structured development workflow
Look up documentation when needed
Track progress through multi-step tasks

What you'll learn

How to configure cagent agents in YAML
How to give agents access to tools (filesystem, shell, etc.)
How to write effective agent instructions
How to compose multiple agents for specialized tasks
How to adapt agents for your own projects

Prerequisites

Before starting, you need:

cagent installed - See the installation guide
API key configured - Set ANTHROPIC_API_KEY or OPENAI_API_KEY in your environment. Get keys from Anthropic or OpenAI
A project to work with - Any codebase where you want agent assistance

Creating your first agent

A cagent agent is defined in a YAML configuration file. The minimal agent needs just a model and instructions that define its purpose.

Create a file named agents.yml:

agents:
  root:
    model: openai/gpt-5
    description: A basic coding assistant
    instruction: |
      You are a helpful coding assistant.
      Help me write and understand code.

Run your agent:

$ cagent run agents.yml

Try asking it: "How do I read a file in Python?"

The agent can answer coding questions, but it can't see your files or run commands yet. To make it useful for real development work, it needs access to tools.

Adding tools

A coding agent needs to interact with your project files and run commands. You enable these capabilities by adding toolsets.

Update agents.yml to add filesystem and shell access:

agents:
  root:
    model: openai/gpt-5
    description: A coding assistant with filesystem access
    instruction: |
      You are a helpful coding assistant.
      You can read and write files to help me develop software.
      Always check if code works before finishing a task.
    toolsets:
      - type: filesystem
      - type: shell

Run the updated agent and try: "Read the README.md file and summarize it."

Your agent can now:

Read and write files in the current directory
Execute shell commands
Explore your project structure

Note
directory. The agent will request permission if it needs to access other directories.

The agent can now interact with your code, but its behavior is still generic. Next, you'll teach it how to work effectively.

Structuring agent instructions

Generic instructions produce generic results. For production use, you want your agent to follow a specific workflow and understand your project's conventions.

Update your agent with structured instructions. This example shows a Go development agent, but you can adapt the pattern for any language:

agents:
  root:
    model: anthropic/claude-sonnet-4-5
    description: Expert Go developer
    instruction: |
      Your goal is to help with code-related tasks by examining, modifying,
      and validating code changes.

      <TASK>
          # Workflow:
          # 1. Analyze: Understand requirements and identify relevant code.
          # 2. Examine: Search for files, analyze structure and dependencies.
          # 3. Modify: Make changes following best practices.
          # 4. Validate: Run linters/tests. If issues found, return to Modify.
      </TASK>

      Constraints:
      - Be thorough in examination before making changes
      - Always validate changes before considering the task complete
      - Write code to files, don't show it in chat

      ## Development Workflow
      - `go build ./...` - Build the application
      - `go test ./...` - Run tests
      - `golangci-lint run` - Check code quality

    add_date: true
    add_environment_info: true
    toolsets:
      - type: filesystem
      - type: shell
      - type: todo

Try asking: "Add error handling to the parseConfig function in main.go"

The structured instructions give your agent:

A clear workflow to follow (analyze, examine, modify, validate)
Project-specific commands to run
Constraints that prevent common mistakes
Context about the environment (add_date and add_environment_info)

The todo toolset helps the agent track progress through multi-step tasks. When you ask for complex changes, the agent will break down the work and update its progress as it goes.

Composing multiple agents

Complex tasks often benefit from specialized agents. You can add sub-agents that handle specific responsibilities, like researching documentation while your main agent stays focused on coding.

Add a librarian agent that can search for documentation:

agents:
  root:
    model: anthropic/claude-sonnet-4-5
    description: Expert Go developer
    instruction: |
      Your goal is to help with code-related tasks by examining, modifying,
      and validating code changes.

      When you need to look up documentation or research how something works,
      ask the librarian agent.

      (rest of instructions from previous section...)
    toolsets:
      - type: filesystem
      - type: shell
      - type: todo
    sub_agents:
      - librarian

  librarian:
    model: anthropic/claude-haiku-4-5
    description: Documentation researcher
    instruction: |
      You are the librarian. Your job is to find relevant documentation,
      articles, or resources to help the developer agent.

      Search the internet and fetch web pages as needed.
    toolsets:
      - type: mcp
        ref: docker:duckduckgo
      - type: fetch

Try asking: "How do I use context.Context in Go? Then add it to my server code."

Your main agent will delegate the research to the librarian, then use that information to modify your code. This keeps the main agent's context focused on the coding task while still having access to up-to-date documentation.

Using a smaller, faster model (Haiku) for the librarian saves costs since documentation lookup doesn't need the same reasoning depth as code changes.

Adapting for your project

Now that you understand the core concepts, adapt the agent for your specific project:

Update the development commands

Replace the Go commands with your project's workflow:

## Development Workflow
- `npm test` - Run tests
- `npm run lint` - Check code quality
- `npm run build` - Build the application

Add project-specific constraints

If your agent keeps making the same mistakes, add explicit constraints:

Constraints:
  - Always run tests before considering a task complete
  - Follow the existing code style in src/ directories
  - Never modify files in the generated/ directory
  - Use TypeScript strict mode for new files

Choose the right models

For coding tasks, use reasoning-focused models:

anthropic/claude-sonnet-4-5 - Strong reasoning, good for complex code
openai/gpt-5 - Fast, good general coding ability

For auxiliary tasks like documentation lookup, smaller models work well:

anthropic/claude-haiku-4-5 - Fast and cost-effective
openai/gpt-5-mini - Good for simple tasks

Iterate based on usage

The best way to improve your agent is to use it. When you notice issues:

Add specific instructions to prevent the problem
Update constraints to guide behavior
Add relevant commands to the development workflow
Consider adding specialized sub-agents for complex areas

What you learned

You now know how to:

Create a basic cagent configuration
Add tools to enable agent capabilities
Write structured instructions for consistent behavior
Compose multiple agents for specialized tasks
Adapt agents for different programming languages and workflows

Next steps

Learn best practices for handling large outputs, structuring agent teams, and optimizing performance
Integrate cagent with your editor or use agents as tools in MCP clients
Review the Configuration reference for all available options
Explore the Tools reference to see what capabilities you can enable
Check out example configurations for different use cases
See the full golang_developer.yaml that the Docker team uses to develop cagent

Ask me about Docker

Building a coding agent