DevOps SRE Agent

Autonomous debugging and troubleshooting agent for production-grade systems from DevOps, SRE, and Operations perspective.

Architecture

There are two separate architecture options and execution this application supports:

OPTION A - Claude Code Mode for System Admins: Project-specific Claude Subagents and Agent SKILLS along with Default Claude Tools for SKILLS. These are organized under .claude folder (as per default project-specific agents).
OPTION B - LangGraph-based Application: Structured Custom Developed LangGraph-framework based Agents with below Multi-agent architecture pattern with an Orchestrator Agent coordinating specialized sub-agents:
- Observability Agent: Queries Grafana for metrics, dashboards, and alerts
- Infrastructure Agent: Queries Kubernetes for cluster state, pods, services, and logs
- Knowledge Search Agent: Queries Stackoverflow for troubleshooting knowledge
- Incident Management: ServiceNow integration for incident tracking
- Code Management: GitHub integration for issues and PRs

Technology Stack

LangGraph DeepAgent: Multi-agent orchestration, Deep Agent is n Agent harness built using LangChain as framework (for tools, model access) and LangGraph as Runtime (for checkpoints, memory)
LiteLLM: Unified LLM provider access
LangSmith: Observability and tracing
MCP Servers (via Docker Hub):
- Kubernetes MCP Server
- Grafana MCP Server
- Stackoverflow MCP Server
- ServiceNow MCP Server
- GitHub MCP Server

Quick Start

Install dependencies: uv sync
Configure environment variables (see .env.example)
Deploy MCP servers separately
Run agent: python main.py

Detailed Implementation

See IMPLEMENTATION_PLAN.md for complete implementation details.

References

LangGraph DeepAgent Quickstarts

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.claude		.claude
.langgraph_api		.langgraph_api
devops_sre_agent		devops_sre_agent
docs		docs
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
langgraph.json		langgraph.json
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DevOps SRE Agent

Architecture

Technology Stack

Quick Start

Detailed Implementation

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ankurkumarz/devops-sre-deepagent

Folders and files

Latest commit

History

Repository files navigation

DevOps SRE Agent

Architecture

Technology Stack

Quick Start

Detailed Implementation

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages