Multi-Agent Graduate-Level Research Platform

A sophisticated AI-powered research platform that orchestrates multiple specialized agents to conduct comprehensive, graduate-level research on any given topic.

Features

Multi-Agent System with specialized agents for different research tasks
LangGraph Orchestration for sophisticated workflow coordination
Google Gemini Integration for advanced AI capabilities
Temporal Workflows for robust distributed task execution
Docker & Kubernetes Ready for scalable deployment
Comprehensive CLI for complete API interaction
Real-time Progress Tracking with WebSocket support
MCP Protocol Support for tool integration

Quick Start

Prerequisites

Python 3.11+
Docker & Docker Compose
uv package manager

Installation

Clone the repository:

git clone https://github.com/jsogarro/cerebro.git
cd cerebro

Install dependencies:

uv pip install -e ".[dev]"

Set up environment:

cp .env.example .env
cp .env.cli.example .env.cli
# Edit .env files with your configuration

Start services:

# Using Docker Compose
docker-compose up -d

# Or start API server directly
uvicorn src.api.main:app --host 0.0.0.0 --port 8000

Verify installation:

research-cli health

CLI Documentation

The Research Platform CLI (research-cli) provides a comprehensive command-line interface for interacting with the Research Platform API. It supports multiple output formats, interactive modes, and batch operations.

For full documentation on configuration, commands, and scriptable use cases, please see the CLI Documentation Guide.

API Documentation

Base URL

http://localhost:8000

Endpoints

Health & Status

Endpoint	Method	Description
`/health`	GET	Basic health check
`/ready`	GET	Readiness check with service status
`/live`	GET	Liveness check
`/metrics`	GET	Prometheus metrics

Research Projects

Endpoint	Method	Description
`/api/v1/research/projects`	POST	Create new research project
`/api/v1/research/projects/{id}`	GET	Get project details
`/api/v1/research/projects`	GET	List all projects
`/api/v1/research/projects/{id}/progress`	GET	Get project progress
`/api/v1/research/projects/{id}/cancel`	POST	Cancel project
`/api/v1/research/projects/{id}/refine`	POST	Refine project scope
`/api/v1/research/projects/{id}/results`	GET	Get project results

Core Services

Endpoint	Method	Description
`/api/routes/memory`	*	Memory & Context Management
`/api/routes/qa`	*	Quality Assurance & Evaluation Suite
`/api/routes/improvement`	*	Self-Improving Agent System
`/api/routes/benchmarks`	*	Research Replication & Benchmarking
`/api/routes/costs`	*	Cost Management & Budgeting

Request/Response Examples

Create Project

POST /api/v1/research/projects
Content-Type: application/json

{
  "title": "AI Impact Research",
  "query": {
    "text": "What are the impacts of AI on society?",
    "domains": ["AI", "Ethics", "Sociology"],
    "depth_level": "comprehensive"
  },
  "user_id": "researcher-001",
  "scope": {
    "max_sources": 100,
    "languages": ["en", "es"]
  }
}

Response

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "title": "AI Impact Research",
  "status": "pending",
  "created_at": "2024-01-15T10:30:00Z",
  "query": {...},
  "scope": {...}
}

Development

Project Structure

research-platform/
├── src/
│   ├── agents/           # Agent implementations
│   ├── api/              # FastAPI application
│   ├── benchmarks/       # Replication & Benchmark classes
│   ├── cli/              # CLI implementation
│   ├── core/             # Core business logic
│   ├── costs/            # Cost tracking & optimization
│   ├── improvement/      # RLHF & Auto-optimization
│   ├── mcp/              # MCP protocol servers
│   ├── memory/           # Context & Memory management
│   ├── models/           # Data models
│   ├── orchestration/    # LangGraph workflows
│   ├── qa/               # Quality Assurance & Evaluation
│   ├── services/         # Service layer
│   └── temporal/         # Temporal workflows
├── tests/                # Test files
├── docker/               # Docker configurations
├── k8s/                  # Kubernetes manifests
├── helm/                 # Helm charts
├── examples/             # Example files
└── docs/                 # Documentation

Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=src --cov-report=html

# Run specific test file
pytest tests/test_cli.py -v

# Run tests in watch mode
pytest-watch

Code Quality

# Format code
black src tests

# Lint code
ruff check src tests

# Type checking
mypy src

# All quality checks
make quality

Local Development

Set up pre-commit hooks:

pre-commit install

Run API locally:

uvicorn src.api.main:app --reload --port 8000

Run with Docker:

docker-compose up

Access services:

API: http://localhost:8000
API Docs: http://localhost:8000/docs
Temporal UI: http://localhost:8080
pgAdmin: http://localhost:5050 (with --profile dev-tools)

Deployment

Docker Deployment

Build and run with Docker:

# Build images
docker build -t research-platform-api .
docker build -f docker/Dockerfile.worker -t research-platform-worker .

# Run with Docker Compose
docker-compose up -d

# View logs
docker-compose logs -f api worker

Kubernetes (GKE) Deployment

Build and push images:

export PROJECT_ID=your-gcp-project
docker build -t gcr.io/$PROJECT_ID/research-platform-api:latest .
docker push gcr.io/$PROJECT_ID/research-platform-api:latest

Deploy to GKE:

# Create cluster
gcloud container clusters create research-platform \
  --num-nodes=3 \
  --zone=us-central1-a

# Apply manifests
kubectl apply -k k8s/

# Or use Helm
helm install research-platform helm/research-platform/

Monitor deployment:

kubectl get pods -n research-platform
kubectl logs -f deployment/research-api -n research-platform

Environment Variables

Key configuration variables:

Variable	Description	Default
`GEMINI_API_KEY`	Google Gemini API key	Required
`DATABASE_URL`	PostgreSQL connection string	Required
`REDIS_URL`	Redis connection string	Required
`TEMPORAL_HOST`	Temporal server address	localhost:7233
`ENVIRONMENT`	Deployment environment	development
`LOG_LEVEL`	Logging level	INFO

Architecture

System Overview

graph TB
    subgraph Clients
        CLI[research-cli]
        Web[Web Dashboard]
        WS[WebSocket Clients]
    end

    subgraph API["API Layer (FastAPI)"]
        REST[REST Endpoints]
        WSS[WebSocket Server]
        QueryAPI[Query API]
        AgentAPI[Agent API]
        MASRAPI[MASR API]
        TalkHierAPI[TalkHier API]
    end

    subgraph Routing["Intelligence Routing"]
        MASR[MASR Router]
        CostOpt[Cost Optimization Engine]
    end

    subgraph Orchestration["Orchestration (LangGraph)"]
        Graph[Graph Builder]
        State[State Management]
        Supervisors[Hierarchical Supervisors]
        TalkHier[TalkHier Protocol]
    end

    subgraph Agents["Specialized Agents"]
        LitReview[Literature Review]
        CompAnalysis[Comparative Analysis]
        Methodology[Methodology]
        Synthesis[Synthesis]
        Citation[Citation & Verification]
    end

    subgraph Services["Support Services"]
        Memory[Memory System]
        QA[QA & Evaluation]
        Benchmarks[Benchmarks]
        Improvement[Self-Improvement]
        Costs[Cost Management]
    end

    subgraph Data["Data Layer"]
        PG[(PostgreSQL)]
        Redis[(Redis)]
        VectorDB[(Vector DB)]
    end

    subgraph External["External Integrations"]
        Gemini[Google Gemini]
        MCP[MCP Tool Servers]
        AcademicDB[Academic Databases]
    end

    CLI --> REST
    Web --> REST
    WS --> WSS

    REST --> QueryAPI
    REST --> AgentAPI
    REST --> MASRAPI
    REST --> TalkHierAPI

    QueryAPI --> MASR
    MASR --> CostOpt
    MASR --> Supervisors

    Supervisors --> TalkHier
    Supervisors --> Graph
    Graph --> State

    Supervisors --> LitReview
    Supervisors --> CompAnalysis
    Supervisors --> Methodology
    Supervisors --> Synthesis
    Supervisors --> Citation

    LitReview --> Gemini
    CompAnalysis --> Gemini
    Methodology --> Gemini
    Synthesis --> Gemini
    Citation --> Gemini

    LitReview --> MCP
    Citation --> AcademicDB

    Agents --> Memory
    Agents --> QA
    Agents --> Costs

    Memory --> PG
    Memory --> Redis
    QA --> PG
    Costs --> PG
    LitReview --> VectorDB

Agent Framework

graph LR
    subgraph Input
        Query[User Query]
    end

    subgraph MASR["MASR Router"]
        Classify[Classify Query]
        Strategy[Select Strategy]
        CostEst[Estimate Cost]
    end

    subgraph Strategies
        QF[Quality Focused]
        CE[Cost Efficient]
        BAL[Balanced]
    end

    subgraph Supervisor["Hierarchical Supervisor"]
        Plan[Plan Execution]
        Assign[Assign Workers]
        Refine[TalkHier Refinement]
        Consensus[Build Consensus]
    end

    subgraph Workers["Agent Workers"]
        direction TB
        W1[Literature Review]
        W2[Comparative Analysis]
        W3[Methodology]
        W4[Synthesis]
        W5[Citation Verification]
    end

    subgraph QA["Quality Gate"]
        FactCheck[Fact Extraction]
        CitVerify[Citation Verification]
        Plagiarism[Plagiarism Detection]
        Score[Quality Score]
    end

    subgraph Output
        Result[Research Result]
        Feedback[Feedback Loop]
    end

    Query --> Classify
    Classify --> Strategy
    Strategy --> CostEst
    CostEst --> QF & CE & BAL

    QF & CE & BAL --> Plan
    Plan --> Assign
    Assign --> W1 & W2 & W3 & W4 & W5
    W1 & W2 & W3 & W4 & W5 --> Refine
    Refine --> Consensus

    Consensus --> FactCheck
    FactCheck --> CitVerify
    CitVerify --> Plagiarism
    Plagiarism --> Score

    Score -->|Pass| Result
    Score -->|Fail| Refine
    Result --> Feedback
    Feedback --> MASR

Technology Stack

Language: Python 3.11+
API Framework: FastAPI
CLI Framework: Click + Rich
LLM: Google Gemini
Orchestration: LangGraph
Database: PostgreSQL + Redis
Container: Docker
Deployment: Kubernetes (GKE)
Package Management: uv

Contributing

Development Workflow

Fork the repository
Create a feature branch
Follow TDD principles - write tests first
Ensure all tests pass
Update documentation
Submit a pull request

Code Standards

Follow PEP 8 style guide
Use type hints
Write docstrings for all public functions
Maintain >80% test coverage
Use semantic commit messages

Commit Message Format

type(scope): description

[optional body]

[optional footer]

Types: feat, fix, docs, style, refactor, test, chore

Example:

feat(cli): add interactive mode for project creation

- Add prompts for all required fields
- Support scope configuration
- Add validation for user inputs

Closes #123

License

[Your License Here]

Acknowledgments

Built with FastAPI, LangGraph, and Temporal
Uses Google Gemini for AI capabilities
Implements Anthropic's MCP protocol for tool integration
CLI powered by Click and Rich

Support

GitHub Issues: Report bugs or request features
Documentation: Full documentation
Email: support@research-platform.ai

Roadmap

Phase 1 (Complete)

[x]Core platform & basic API setup
[x]CLI tool & documentation extraction
[x]Docker containerization & K8s manifests
[x]Advanced Memory & Context Management
[x]Quality Assurance & Evaluation Suite
[x]Self-Improving Agent System infrastructure
[x]Research Replication & Benchmarking
[x]Cost Management & Budgeting

Phase 2 (In Progress)

[ ]Temporal workflow implementation
[ ]Gemini integration
[ ]Agent implementations
[ ]LangGraph orchestration
[ ]Cross-domain research

Phase 3 (Planned)

[ ]MCP tool servers
[ ]WebSocket real-time updates
[ ]Advanced report generation
[ ]Authentication & Large-scale deployment

Phase 4 (Deferred)

[ ]Collaborative research features
[ ]Agent Marketplace & Plugin System
[ ]Visual Workflow Builder

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github		.github
alembic		alembic
cerebro/web		cerebro/web
config		config
configs/models		configs/models
docker		docker
docs		docs
examples		examples
k8s		k8s
landing_page		landing_page
migrations/versions		migrations/versions
scripts		scripts
src		src
tests		tests
.env.cli.example		.env.cli.example
.env.example		.env.example
.env.production.example		.env.production.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.production.yml		docker-compose.production.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
start.sh		start.sh
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Graduate-Level Research Platform

Features

Table of Contents

Quick Start

Prerequisites

Installation

CLI Documentation

API Documentation

Base URL

Endpoints

Health & Status

Research Projects

Core Services

Request/Response Examples

Create Project

Response

Development

Project Structure

Running Tests

Code Quality

Local Development

Deployment

Docker Deployment

Kubernetes (GKE) Deployment

Environment Variables

Architecture

System Overview

Agent Framework

Technology Stack

Contributing

Development Workflow

Code Standards

Commit Message Format

License

Acknowledgments

Support

Roadmap

Phase 1 (Complete)

Phase 2 (In Progress)

Phase 3 (Planned)

Phase 4 (Deferred)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages