v1.2.0 · 51.1k PyPI downloads · canonical-source workflow

Keep one canonical source
for every AI coding agent

If you landed here from HN, the pitch is simple: keep one repo-owned source of truth at .agentkit/source.md, project it into AGENTS.md, CLAUDE.md, GEMINI.md, COPILOT.md, AGENT.md, and llms.txt, then score whether the repo is actually ready for agent work.

$ pip install agentkit-cli
See what changes in a repo ↓ View on GitHub
Source Author one file, not six drifting ones.

Keep the canonical instructions in .agentkit/source.md, then project the filenames your tools already read.

Repo diff The first output is a git diff you can review.

Run source --init and project --write to add real files to the repo instead of hiding setup in another hosted UI.

Guardrails Get a fast score, then add gates when the repo is ready.

Use quickstart for the first read, score for the repo-owned setup, and gate once you want CI enforcement.

Proof 51.1k PyPI downloads, 102 shipped versions, 20 public scored repos.

Real installs, real scored examples, and a workflow that extends past the install command.

4824
Tests
102
Versions
6
Packages
20
Repos Scored
5.1k
PyPI 30d Downloads

PyPI stats source: pypistats.org, updated 2026-04-21.

What changes in a real repo

The first visible output is a repo diff. agentkit does not ask you to trust a hosted dashboard before it changes anything useful in the repo itself.

Before
repo/
  README.md
  pyproject.toml
  src/
  tests/

Normal codebase, but no shared agent instructions. Each tool starts from a different file or from no file at all.

After source --init + project --write
repo/
  .agentkit/source.md
  AGENTS.md
  CLAUDE.md
  GEMINI.md
  COPILOT.md
  AGENT.md
  llms.txt
  README.md
  pyproject.toml
  src/
  tests/

One repo-owned source fans out into the filenames Claude Code, Codex, Gemini, Copilot, and other agent surfaces already look for.

Reviewable in git

The setup becomes a normal repo change that maintainers can diff, review, and revert.

Shared across agents

You edit the source once instead of hand-syncing separate agent files every time the repo rules change.

Measurable afterward

Run agentkit score or agentkit gate after the files exist so the workflow stays inspectable.

Install to first value

There are two honest entry points: use quickstart if you want a score before editing files, or use the canonical-source flow if you already want repo-owned agent instructions in git.

Path 1

Get a score first

Best for skeptical first runs. You learn where the repo stands before adopting any workflow changes.

pip install agentkit-cli
agentkit quickstart .
  • quickstart runs doctor plus a fast composite score on the current repo.
  • No file writes required. You get findings and a score before deciding whether to standardize anything.
Path 2

Write the repo-owned setup

Best when you already know the repo needs one shared source of instructions across multiple agent surfaces.

agentkit source --init
agentkit project --write
agentkit score
  • source --init creates .agentkit/source.md as the file you actually maintain.
  • project --write writes the projected agent files next to the code.
  • score tells you if the repo is ready for gates and repeatable agent work.

How agentkit works

The front door is a repo workflow, not just a leaderboard. Start with a canonical source, project it into agent-specific files, and put measurable guardrails around the whole loop.

01

Write the canonical source

Initialize .agentkit/source.md so your team edits one file instead of manually keeping AGENTS.md, CLAUDE.md, and friends in sync.

02

Project into real agent files

Generate the filenames existing tools expect, so Claude Code, Codex, Gemini, Copilot, and MCP flows inherit the same context.

03

Score, gate, and learn

Run score, fail CI with gate, benchmark execution quality, and inspect transcripts so the workflow gets better over time.

Who should use this now

This is most useful once agent work is a repo problem, not just a personal prompt problem. The value starts when the instructions need to live in git and stay consistent across tools or teammates.

More than one agent touches the repo

You already have Claude Code, Codex, Gemini, Copilot, or MCP tools touching the same codebase and you do not want their instructions drifting apart.

You want repo-owned instructions

You want the setup to be a normal change in git, with files maintainers can review, diff, and enforce in CI.

You want proof before policy

You want to score real repos, inspect public examples, and measure readiness before telling a team to standardize on a workflow.

Why trust this

The value here is not just the thesis. There is a shipped toolchain behind it, public scored examples, and enough surface area to show the workflow has been exercised.

Shipped and installed

102 released versions and about 51.1k PyPI downloads show this is a real shipped toolchain, not a one-weekend demo.

Public examples, not mockups

20 public repositories are already scored on this site across 4 ecosystems, so visitors can inspect what the workflow measures.

Workflow surface, not a toy

The toolkit already spans quickstart, source management, projection, scoring, linting, benchmarking, transcript review, and MCP access across 6 packages, with 5.1k downloads in the last 30 days alone.

The workflow stack

agentkit-cli is the entry point, but the product story is the full workflow around source management, projection, scoring, linting, benchmarking, and transcript learning.

agentkit-cli

The orchestration entry point for quickstart, source, project, score, gate, and guard flows.

agentmd

Generate or tighten agent-facing context files from the canonical source before they drift.

agentlint

Catch stale paths, year-rot, missing references, and bloated instructions before they mislead agents.

coderace

Benchmark AI coding agents on your own tasks instead of trusting generic benchmark marketing.

agentreflect

Read transcript history to see where agent runs burned budget, stalled, or repeated avoidable mistakes.

agentkit-mcp

Expose the toolkit through MCP when you want the same workflow available inside other agent surfaces.

Public proof

Start here if you want evidence before install. These scorecards show what the toolkit measures against real repositories, not toy demos.

Loading...

Core commands

If you only try one path first, use the setup flow. The rest of the commands help once you want analysis, sharing, transcript review, or CI enforcement.

CommandDescription
agentkit quickstart .Run the fastest honest first pass: doctor plus a fast composite score on the current repo
agentkit source --initCreate the dedicated canonical source file at .agentkit/source.md
agentkit project --writeWrite AGENTS.md, CLAUDE.md, GEMINI.md, COPILOT.md, AGENT.md, and llms.txt from that source
agentkit scoreCompute the repo's composite agent-readiness score
agentkit analyze github:owner/repo --shareAnalyze a public repo and publish a shareable scorecard
agentkit burn --path ./transcriptsInspect local coding-agent transcript spend and waste patterns
agentkit gate --min-score 80Fail CI when the setup drops below your quality bar

Not ready to install yet?

Browse ecosystem scoreboards and repo pages first. It is the fastest way to understand what agent-readiness means in practice.

Org leaderboard

Rank all public repos in a GitHub org to spot where agent workflows are already strong and where they need cleanup.

agentkit org github:ORG

Share a developer scorecard

Generate a public profile card when you want a simple external artifact for a person or repository set.

agentkit user-scorecard github:USERNAME

Recent rankings

A sample of currently scored repositories. The rankings are still useful, but they now support the broader workflow story instead of pretending to be the whole product.

20 repos scored
RepositoryScoreGrade
langchain-ai/langchain
50
C
fastapi/fastapi
50
C
django/django
50
C
psf/requests
50
C
pallets/flask
50
C
vercel/next.js
50
C
microsoft/vscode
50
C
nestjs/nest
50
C
supabase/supabase
50
C
trpc/trpc
50
C
rust-lang/rust
50
C
tokio-rs/tokio
50
C
actix/actix-web
50
C
hyperium/hyper
50
C
serde-rs/serde
45
D
golang/go
50
C
gin-gonic/gin
50
C
gofiber/fiber
50
C
spf13/cobra
50
C
kubernetes/kubernetes
50
C