Changelog¶
All notable changes to this guide are documented here. Format loosely follows Keep a Changelog. The repo-root mirror lives at /CHANGELOG.md.
[0.6.1] — 2026-05-06 — Standalone Claude site¶
Changed¶
- Claude guidance now lives at the separate Claude site instead of inside the Field Guide navigation.
- Homepage, Mastery hub, examples, source audit, and Claude redirect page now point to the standalone site.
- Removed Field Guide loading of the Claude-only CSS and JavaScript assets.
Verified¶
- Claude source links were refreshed against official Anthropic docs, including the current canonical plan chooser URL.
[0.6.0] — 2026-05-06 — Claude Mastery microsite¶
Added¶
- Claude Mastery — a beginner-first, step-by-step Claude setup and mastery microsite with a path picker, local progress checklist, copyable prompt kit, Projects walkthrough, feature map, 30-day practice plan, and safety baseline.
- Scoped
claude-mastery.cssandclaude-mastery.jsassets. The checklist persists only in the visitor's browser vialocalStorage; copy buttons use the browser Clipboard API and make no network calls.
Changed¶
- Homepage now surfaces Claude Mastery as a first-class starting point for readers learning Claude specifically.
- The older
mastery/claude.mdtrack now points readers to the dedicated Claude Mastery microsite to avoid duplicated, drifting Claude guidance.
Verified¶
- Claude feature claims rechecked against current official Anthropic sources for plans, usage limits, personalization, Projects, memory, incognito chats, Artifacts, Skills, connectors, Claude Code, and prompting best practices.
[0.5.2] — 2026-05-06 — Audit remediation and source refresh¶
Changed¶
- Homepage now has a "Pick the right starting point" routing table for absolute beginners, subscription maximizers, product choosers, agent builders, and team/sensitive-data owners.
- GitHub Copilot references now use the current Copilot cloud agent name, with "formerly Copilot coding agent" retained only where it clarifies vendor renaming.
- Copilot cloud agent availability and billing guidance now reflects the current GitHub Docs: Pro, Pro+, Business, and Enterprise availability; Business/Enterprise policy enablement; premium-request usage per cloud-agent session and steering comment.
- xAI model links now use the current canonical
docs.x.ai/developers/modelsURL, and Grok API examples default to the currently documentedgrok-4.3model identifier while still reading fromXAI_MODEL. - The MkDocs Material header repo widget was removed to avoid unauthenticated GitHub API calls and possible 403 console errors in visitor browsers.
- OpenAI recipe snippets now use
OPENAI_MODELinstead of stale hard-coded model IDs, and beginner setup steps include the matching env var. - Exact live-cost claims were removed from recipes; cost sections now describe token drivers and tell readers to recalculate against current vendor pricing.
- Structured-output language now tells readers to validate returned objects locally instead of treating provider-side schema constraints as a substitute for production validation.
- Browser/computer-use comparison now marks
computer-use-previewas legacy and reflects current OpenAIcomputertool guidance.
Verified¶
- Source refresh checked current official GitHub Copilot cloud-agent docs, GitHub Copilot requests/billing docs, xAI structured-output docs, xAI model docs, and OpenAI computer-use docs.
- Site QA now guards against stale xAI model URLs, stale Copilot cloud-agent billing language, hard-coded older OpenAI model IDs, and exact live-cost claims.
[0.5.0] — 2026-05-06 — Subscription-first expansion (deepening pass)¶
Added¶
- Capability map — cross-product matrix covering 18 capability rows across Claude / ChatGPT / Gemini / Grok / Perplexity / Copilot / "any AI tool" with plan-availability tags and per-cell fallbacks.
- No-code automation guide — ChatGPT Tasks, Gemini scheduled actions, Claude Cowork, calendar-driven manual fallback, plus the safety drill (3 manual runs, off-switch, cost cap).
- Memory & preferences guide — portable AI profile, global vs project memory, migration playbook, 6 copy-paste templates.
- Examples library — 13 beginner-to-expert guided examples, each with a 6-layer ladder (chat → project → memory → native task → custom assistant → dev/API).
- Universal 7-mission learning path — the no-API on-ramp from "I have a subscription" to "I'm getting full value."
- Plain-English glossary with schematic SVG diagrams for ChatGPT Project & Task, Claude Project / Skill / Cowork, Gemini Gem / Scheduled action, Grok chat.
- Task Builder v0.5: 4-tier mode selector (Beginner / Power User / Builder / Developer), 7 new output panels (Use this in…, Learning ladder, What good output looks like, Practice exercise, Free vs paid, Maximize my subscription, Expert expansion). Provider packages render for all 7 surfaces with the user's primary app first.
- Repo-root
SUBSCRIPTION_FIRST_EXPANSION_STATUS.mdandUSER_REQUIREMENTS_TRACEABILITY.mdextended;docs/product-requirements/user-requirements-traceability.mdmirrors them.
Changed¶
- Homepage card grid expanded with Learning path, Capability map, No-code automations, Memory, Examples.
- mkdocs.yml nav re-ordered: Task Builder → Learning path → Mastery → Capability map → Examples → No-code automations → Memory → Surface router → (rest unchanged).
- Roadmap rewritten to reflect the subscription-first strategy and what landed in v0.5.
- Source map gains a v0.5 "Subscription-first vendor surfaces" section; source audit gains a v0.5 audit entry.
Verified¶
mkdocs build --strictexits 0.- Beginner / Power User / Builder modes never recommend API / CLI / MCP / local scripts.
- Task Builder JS contains no
fetch,XMLHttpRequest,WebSocket,sendBeacon, dynamic script loading, or analytics calls (only commented references to those names exist as guardrails). - All runnable scripts in starter kits remain gated behind
RIZWAN_APPROVED_TO_RUN=1. - All 16 user requirements and all 24 acceptance criteria are mapped to implementations in the traceability files.
[0.4.0] — 2026-05-06 — Subscription-first expansion¶
Added¶
- Mastery section: index plus 7 beginner-to-expert tracks for Claude, ChatGPT, Gemini, Grok, Perplexity, Coding agents, and "Any AI tool / I don't know."
- Task Builder Beginner Mode (the new default) plus a primary-app picker, a comfort-level picker, a 16-option job picker, and two new output panels: Level up this workflow and If your plan doesn't have this.
- A What to click output panel with exact step-by-step instructions per AI app.
- User requirements traceability page mapping every user requirement to its implementation.
Changed¶
- Homepage opens with "No API key required. Start with the AI app you already use."
- Task Builder page rewritten with primary-app + comfort + job at the top.
- Advanced surface picker is hidden in Beginner Mode.
Verified¶
mkdocs build --strictexits 0.- Beginner Mode never recommends API/CLI/MCP/local scripts.
- Task Builder JS contains no network calls.
[0.3.0] — 2026-05-06 — Task Builder transformation¶
Added¶
- Task Builder — a static, browser-local tool that turns a task description into a recommended surface plus a complete package (prompt, system instructions, memory block, setup, tool allowlist, HITL gates, evals, red-team probes, repeat-run playbook, troubleshooting, exportable JSON).
- Surface router — long-form decision matrix for chat vs. project vs. coding agent vs. skill vs. automation vs. agent vs. browser-use vs. MCP.
- Grok / xAI coverage — platform page, quickstart, two recipes (structured outputs and tool calling). Distinguishes consumer Grok, Grok on X, and the xAI API.
- Model freshness page — env-var pattern (
OPENAI_MODEL,ANTHROPIC_MODEL,GEMINI_MODEL,XAI_MODEL) and a quarterly drift checklist. - ADR 0004 documenting the recipe nav pattern.
IMPROVEMENT_PASS_STATUS.mdat the repo root.
Changed¶
- Homepage rewritten as a task-to-outcome on-ramp.
- Platforms index gains an xAI / Grok row.
- Source map and source audit include xAI / Grok.
Fixed¶
- MCP starter-kit approval bypass removed from both Python and TypeScript MCP servers;
RIZWAN_APPROVED_TO_RUN=1is now unconditionally required. - Deployment claims corrected in
DEPLOYMENT_STATUS.md, the rootCHANGELOG.md, and this page. - Link check gating split into a gating internal-links job and an advisory external-links job.
- Model currentness fixed in
starter-kits/local-script-agent/script.py(now readsOPENAI_MODEL).
Verified¶
mkdocs build --strictexits 0.- The Task Builder JavaScript contains no network calls.
- All v0.2 work preserved.
[0.2.0] — 2026-05-06 — Productionization pass¶
Added¶
- Quickstarts — five end-to-end recipes (read-only, file-editing, coding, browser, MCP).
- Recipe library — 40+ complete agent recipes covering common workflows.
- Starter kits — nine copyable kits with specs, prompts, allowlists, eval/red-team JSONL, and launch checklists. Example scripts are inert by default.
- Eval banks — 100+ golden cases and 100+ red-team cases with rubrics and per-platform usage notes.
- Agent Factory module — a 25-candidate → 5-shipped worked example.
- Source audit — a record of which pages were rechecked against which official URLs.
- Repo hygiene — issue/PR templates, Dependabot, PR build workflow, link-check workflow.
- Deployment docs —
DEPLOYMENT_STATUS.mdandPRODUCTION_DEPLOYMENT.mdat repo root.
Changed¶
- Removed all "(coming soon)" labels from the Platforms index — every page exists and is linked.
- Nav label
Safety/safety-checklistscorrected toSafety checklists. - Source map expanded with additional Anthropic, OpenAI, Google, GitHub, and MCP references.
Verified¶
mkdocs build --strictexits 0. The build emits an upstream Material for MkDocs informational banner about the future MkDocs 2.0 release (theme-level, not a project warning) and INFO-level notices for the 41 recipes accessed through the Recipes index — see ADR 0004.- GitHub Pages production deployment succeeds.
[0.1.0] — 2026-05-06¶
Added¶
- Initial public release of the AI Agent Builder Field Guide.
- Start Here section: what an agent is, good/bad tasks, decision tree, four setup paths, safety baseline.
- Platform setup guides for Claude Desktop, Claude Code, Claude Projects, Gemini app, Gemini CLI, Antigravity, Google AI Studio, ChatGPT, Custom GPTs, OpenAI API/Agents SDK, Codex CLI, GitHub Copilot cloud agent, and local scripts/schedulers.
- MCP section: concepts, installation, writing servers, remote/custom connectors, security.
- Browser & computer use section: Anthropic, OpenAI,
browser-use, operating boundaries. - Orchestration section: single-agent loops, multi-agent patterns, local-first, state/memory.
- Evaluation & safety section: eval sets, red-team workflows, safety checklists, HITL, incident response.
- Templates: agent spec, prompt, eval rubric, safety checklist, agent PRD.
- Reference: glossary, source map, roadmap, decision records.
- MkDocs Material build, GitHub Actions deploy workflow, MIT license, CONTRIBUTING, CODE_OF_CONDUCT, SECURITY.
Source verification¶
- All product pages verified against the official sources listed in
source-map.mdon 2026-05-06.