AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories
-
Updated
Mar 25, 2026 - Python
AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories
Open-source self-hosted web tool for evaluating Agent Skills with rubric scores, Deep Review, and improvement suggestions.
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
Reward model engineering harness for evolutionary rubric search, deployable RM artifacts, online scoring, and RL experiment lineage.
A Claude Code skill that adds a rubric-based eval layer to any agent project. Framework-agnostic — generates rubric, test cases, judge prompt, and harness. Returns a weighted score plus a judge-leniency signal.
Export grades from assignment using advanced grading methods in excel format
Rubric-driven AI homework grading system built as a Claude Code Skill. Score student submissions with CoT reasoning, bias mitigation, and PDCA quality cycle.
Bilingual Codex skills for source-grounded academic writing, conservative audit, format-preserving revision, paper-to-PPT delivery, and final submission cleanup.
Context-compensation scaffold for LLM evaluation prompts — disclose, gate on evidence, hedge on thin
AskBench: LLM question-asking/clarification benchmark & dataset with evaluation and training code (paper: arXiv 2602.11199).
Customize, manage templates of rubrics and fast grade HTML/PDF files
Universal quality evaluation plugin for Claude Code — 7-dimension scoring (correctness, completeness, adherence, efficiency, safety), configurable rubrics, threshold blocking, auto-hooks & /judge command.
An Appscript to generate a Google Sheet that will allow you to import certain learning targets into a Google Classroom Assignment.
Add a description, image, and links to the rubric topic page so that developers can more easily learn about it.
To associate your repository with the rubric topic, visit your repo's landing page and select "manage topics."