🚀 RAG Context Engineering System

A complete RAG ecosystem with interactive Chainlit UI, multi-provider LLM support, and specialized context engines.

This repository demonstrates a production-ready Retrieval-Augmented Generation (RAG) system featuring:

🎯 Three specialized context engines for different domains
🖥️ Interactive Chainlit UI for seamless user experience
🔄 Multi-provider LLM support (OpenAI, Anthropic, DeepSeek, Ollama)
⚡ Smart provider fallback and automatic configuration
📊 Prometheus metrics and comprehensive testing
🏗️ Microservices architecture with FastAPI

🏗️ System Architecture

Context Engines

Engine	Domain	Use Cases
🏢 Enterprise	WICS framework, company policies	HR questions, policy lookup, process guidance
💰 Financial Compliance	Basel III, BACEN, CVM regulations	Regulatory compliance, risk assessment
⚙️ DevOps	SRE practices, troubleshooting	Infrastructure issues, operational guidance

Core Components

context_engine_core/: Shared RAG pipeline with LangChain + LangGraph
app.py: Main Chainlit application with UI
config.json: Multi-provider configuration
context_quality_monitor/: Prometheus metrics service
context_engineering_testing_suite/: Comprehensive test suite

🚀 Quick Start

1. Prerequisites

Python 3.8+
API key for at least one LLM provider

2. Installation

# Clone and navigate to the project
cd rag-context-engineering-examples

# Install dependencies
pip install -r requirements.txt

# Set your API key (choose one)
export OPENAI_API_KEY=sk-your-openai-key
export ANTHROPIC_API_KEY=sk-ant-your-anthropic-key
export DEEPSEEK_API_KEY=sk-your-deepseek-key

3. Launch the Application

Option A: Easy Launch (Recommended)

python launch.py

Option B: Direct Chainlit

chainlit run app.py --host 0.0.0.0 --port 8000

Option C: Python execution

python app.py

4. Access the Interface

Open your browser to: http://localhost:8000

The Chainlit interface will provide:

🔧 Engine selection via settings panel
💬 Interactive chat with all context engines
📱 Mobile-friendly responsive design
🎨 Rich markdown formatting

🔧 Configuration

Multi-Provider Setup

The system automatically selects the best available provider. Configure per-engine preferences in config.json:

{
  "engines": {
    "enterprise_context_engine": {
      "llm": {
        "provider": "anthropic",
        "anthropic": {
          "model_name": "claude-3-sonnet-20240229",
          "temperature": 0.1
        }
      }
    },
    "financial_compliance_context_engine": {
      "llm": {
        "provider": "openai", 
        "openai": {
          "model_name": "gpt-4",
          "temperature": 0
        }
      }
    }
  }
}

Supported Providers

Provider	Setup	Models	Best For
OpenAI	`export OPENAI_API_KEY=sk-...`	GPT-3.5, GPT-4	Balanced performance
Anthropic	`export ANTHROPIC_API_KEY=sk-ant-...`	Claude-3 Sonnet/Opus	Complex reasoning
DeepSeek	`export DEEPSEEK_API_KEY=sk-...`	DeepSeek Chat/Coder	Cost-effective
Ollama	Local installation	Llama2, Mistral	Privacy/offline

🧪 Testing & Development

Run Tests

# Full test suite
cd context_engineering_testing_suite
pytest tests/ -v

# Single test
pytest tests/test_rag_flow.py::test_query_returns_string -v

# Test with coverage
pytest tests/ --cov=context_engine_core --cov-report=html

Development Examples

# Run individual examples
cd examples/
python quick_demo.py              # Basic RAG demonstration
python multi_provider_testing.py  # Provider comparison
python provider_demo.py           # Provider switching demo

API Services (Alternative to UI)

# Start individual engines as APIs
uvicorn enterprise_context_engine.src.enterprise_context_engine.api:app --reload --port 8001
uvicorn financial_compliance_context_engine.src.financial_compliance_context_engine.api:app --reload --port 8002
uvicorn devops_context_engine.src.devops_context_engine.api:app --reload --port 8003

# Start metrics service
uvicorn context_quality_monitor.src.context_quality_monitor.api:app --reload --port 8004

💡 Usage Examples

Enterprise Context Engine

Q: "What is WICS?"
A: "WICS (Work Integration and Coordination System) is a framework for..."

Q: "What is the remote work policy?"
A: "The remote work policy allows for flexible arrangements..."

Financial Compliance Engine

Q: "What are Basel III requirements?"
A: "Basel III introduces enhanced capital requirements including..."

Q: "Tell me about BACEN regulations"
A: "BACEN (Central Bank of Brazil) regulations cover..."

DevOps Context Engine

Q: "How do I troubleshoot high CPU usage?"
A: "For high CPU usage troubleshooting, follow these steps..."

Q: "What are SRE best practices?"
A: "Site Reliability Engineering best practices include..."

🏗️ Architecture Deep Dive

RAG Pipeline Flow

graph TD
    A[User Query] --> B[Context Engine]
    B --> C[Document Retrieval]
    C --> D[FAISS Vector Search]
    D --> E[Relevant Documents]
    E --> F[LLM Provider]
    F --> G[Generated Response]
    G --> H[Chainlit UI]

LangGraph Workflow

Retrieve: Query vector store for relevant documents
Generate: Use LLM to synthesize answer from context
Return: Format response for UI display

Component Interactions

BaseContextEngine: Core RAG logic shared across all engines
LLMProviders: Abstraction layer for multiple AI providers
Chainlit App: User interface with engine switching
FastAPI Services: Alternative REST API access
Prometheus: Metrics collection and monitoring

📊 Monitoring & Metrics

Prometheus Metrics

Access metrics at: http://localhost:8004/metrics

Available metrics:

rag_queries_total: Total queries per engine
rag_query_duration_seconds: Query processing time
rag_errors_total: Error count by type

Health Checks

UI Health: http://localhost:8000
API Health: http://localhost:8001/health (per service)
Metrics Health: http://localhost:8004/health

🔄 Development Workflow

Adding New Context Engines

Create Engine Directory

mkdir new_context_engine/src/new_context_engine/

Implement Engine Class

from context_engine_core.base_engine import BaseContextEngine

class NewContextEngine(BaseContextEngine):
    def _default_docs(self):
        return ["Your domain-specific documents here"]

Add FastAPI Wrapper

from fastapi import FastAPI
app = FastAPI()

@app.post("/query")
async def query_endpoint(query: str):
    engine = NewContextEngine()
    return {"response": engine.query(query)}

Update Configuration

{
  "engines": {
    "new_context_engine": {
      "llm": {"provider": "openai"}
    }
  }
}

Integrate with UI

# Add to app.py engines dictionary
engines["new_engine"] = NewContextEngine()

Code Quality

# Linting
flake8 context_engine_core/ --max-line-length=100

# Type checking
mypy context_engine_core/

# Security scan
bandit -r context_engine_core/

🚨 Troubleshooting

Common Issues

❌ "No module named 'context_engine_core'"

# Ensure you're in the project root
cd rag-context-engineering-examples
python app.py

❌ "API key not found"

# Set your API key
export OPENAI_API_KEY=your-key-here
python launch.py  # Will check API keys

❌ "Chainlit not found"

# Install Chainlit
pip install chainlit>=1.0.0

❌ "Port already in use"

# Use different port
chainlit run app.py --port 8001

Debug Mode

# Enable verbose logging
export LANGCHAIN_VERBOSE=true
export LANGCHAIN_TRACING=true
python app.py

📚 Educational Resources

This system demonstrates key RAG concepts:

Context Engineering: Beyond prompt engineering to systematic context management
Multi-Modal RAG: Different engines for different knowledge domains
Provider Abstraction: Flexible LLM provider switching
Production Patterns: Monitoring, testing, configuration management
User Experience: Interactive UI with real-time engine switching

Learning Path

Start with Examples: Run python examples/quick_demo.py
Explore UI: Use Chainlit interface to understand user experience
Study Architecture: Review context_engine_core/base_engine.py
Extend System: Add your own context engine
Production Deploy: Scale with Docker/Kubernetes

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Run tests: pytest tests/
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

📖 Documentation: Check the inline code documentation
🐛 Issues: Open GitHub issues for bugs
💡 Features: Request enhancements via GitHub
📧 Contact: For educational/commercial inquiries

Built with ❤️ for the RAG community. Designed for learning, built for production.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
context_engine_core		context_engine_core
context_engineering_testing_suite/tests		context_engineering_testing_suite/tests
context_quality_monitor/src/context_quality_monitor		context_quality_monitor/src/context_quality_monitor
devops_context_engine/src/devops_context_engine		devops_context_engine/src/devops_context_engine
enterprise_context_engine/src/enterprise_context_engine		enterprise_context_engine/src/enterprise_context_engine
examples		examples
financial_compliance_context_engine/src/financial_compliance_context_engine		financial_compliance_context_engine/src/financial_compliance_context_engine
.chainlit		.chainlit
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
MULTI_PROVIDER_GUIDE.md		MULTI_PROVIDER_GUIDE.md
README.md		README.md
app.py		app.py
config.json		config.json
launch.py		launch.py
requirements-minimal.txt		requirements-minimal.txt
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

🚀 RAG Context Engineering System

🏗️ System Architecture

Context Engines

Core Components

🚀 Quick Start

1. Prerequisites

2. Installation

3. Launch the Application

4. Access the Interface

🔧 Configuration

Multi-Provider Setup

Supported Providers

🧪 Testing & Development

Run Tests

Development Examples

API Services (Alternative to UI)

💡 Usage Examples

Enterprise Context Engine

Financial Compliance Engine

DevOps Context Engine

🏗️ Architecture Deep Dive

RAG Pipeline Flow

LangGraph Workflow

Component Interactions

📊 Monitoring & Metrics

Prometheus Metrics

Health Checks

🔄 Development Workflow

Adding New Context Engines

Code Quality

🚨 Troubleshooting

Common Issues

Debug Mode

📚 Educational Resources

Learning Path

🤝 Contributing

📄 License

🆘 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages