Independent Project Not affiliated with, sponsored by, or endorsed by the Watch Tower Bible and Tract Society or Jehovah's Witnesses.
jw-agent-toolkit
ES

Master roadmap · 63 phases

De librería técnica a plataforma adoptable

Nine consecutive releases covering complete discipleship, technical core at SOTA, runtime trust + community + JS frontier, multilingual SOTA with 1672 languages via Omnilingual ASR, 4 new packages (live-meeting, biblical KG, operational scheduler, legal plugin), verifiable agentic + multimodal + predictive + consented family voice, doctrinal alignment via CAI/RLAIF/DPO, tri-model mechanistic interpretability and congregational operations + TJ vs State legal cases multi-country. Same flow on every phase: brainstorming → approved spec → bite-sized TDD plan → red-green-commit implementation → audit.

Shipped

63

Total

63

▌ 6 releases

22-32

Release 22-32 · Discipleship ecosystem

11 orthogonal phases that close the active-discipleship loop plus an initial trust baseline.

Ver detalle →

33-38

Release 33-38 · Core at SOTA

6 phases that raise the technical ceiling: embeddings + audio + grammars + VLM + visual + generation. Eighth package (jw-gen) introduced.

Ver detalle →

39-48

Release 39-48 · Runtime trust + community + JS

10 shipped phases: live NLI, provenance, plugin SDK, scaffolding, tracing, judge, chunking, versification, TS port and browser extension.

Ver detalle →

49-56

Release 49-56 · jw-core complete + multilingual SOTA

8 phases that complete jw-core: persistent second-brain, JWPUB/.jwlibrary writers, organized-app schemas, Omnilingual ASR (1672 languages), NLLB-200, multilingual wire-up and JS bridge close-out with anti-drift CI.

Ver detalle →

57-66

Release 57-66 · 2 new packages + live-meeting + biblical KG

13 phases in a single continuous session: clean-room jw-meeting-media (WOL parser from DevTools + Tauri presenter), JW-pure bible-knowledge-graph (607 B.C.E.), opt-in letta-memory with Fernet, marker+markitdown loaders, diarized whisperX + speaker mapping, brain exposed via MCP. Plus 7 MVP+1 follow-ups.

Ver detalle →

65-76

Release 65-76 · Agentic + multimodal + predictive + voice

9 phases that open four new layers: verifiable agentic orchestration with NLI critic (F65-F67), end-to-end multimodal with a physical book camera and a visual quote verifier (F68-F71), diachronic analysis with temporal embeddings (F72) and consented family voice with a 3-layer license gate + Fernet encryption (F76). +480 tests, +15 new MCP tools.

Ver detalle →

77-79

Release 77-79 · Alignment to the published canon

3 phases applying Constitutional AI, RLAIF, and DPO/ORPO to the local fine-tune's concrete problem: not drifting from the published canon. F77 codifies 5 builtin principles in versioned YAML and exposes them to Judge + fidelity_wrap (lazy import, zero cycle). F78 promotes the Judge to a pairwise annotator and introduces SL-CAI (self-critique). F79 adds DPO and ORPO trainers with Unsloth plus three Qwen3.5-0.8B recipes (Apache-2.0) wired into the CLI with a new prepare-preference command. +41 tests, 1,326 passing.

Ver detalle →

80

Release 80 · Mechanistic interpretability, tri-model

Closes the F77-F79 loop: did the model internalise the principles or learn a stylistic shortcut? New jw-interp package with per-principle linear probing (F80.1), steering vectors + activation patching (F80.2), Qwen-Scope adapter (F80.3, TopK residual), Gemma Scope wrapper (F80.4, SOTA JumpReLU with MLP + attention) and a runtime probe store + observational fidelity_wrap Tier 4 (F80.5). Tri-model architecture: production Qwen 0.8B left untouched, lab Qwen 2B and lab Gemma 2B with public SAEs. Cross-family validation. +85 tests, 1,411 passing.

Ver detalle →

81-82

Release 81-82 · Congregational operations + legal cases TJ

Two new application-layer modules. F81 closes the operational loop of the meeting-scheduler with organized-app importer + CRDT diff (F81.0), a roster-editing CLI that honours last_updated so re-imports do not clobber manual edits (F81.1) and a strict Pydantic constraints YAML per congregation with init/lint/show CLI (F81.2). F82 opens the legal module with a Territory catalogue of 30 countries (F82.0 — RU, KP, ER, SG, TJ banned; CN, AZ, BY, VN, MM, TR, CU, KZ restricted; 17 free) that composes the existing LocaleContext without duplicating cultural fields, and the jw-legal plugin registering a legal-cases-tj BrainDomain via entry-point jw_agent_toolkit.brain_domains (F82.1) — 6 NodeTypeSpec + 8 EdgeTypeSpec, CONTRADICTS non-directional sensitive. 2 new packages. +305 tests, 3,318 passing.

Ver detalle →

▌ Master table · 63 phases (22 → 76 + follow-ups)

Fase Slug Tier Estado Release
22 eval-doctrinal T1 Trust ✓ entregada 22-32
23 citation-validator T1 Trust ✓ entregada 22-32
24 study-conductor T2 High value ✓ entregada 22-32
25 news-monitor T2 High value ✓ entregada 22-32
26 student-parts T3 Specialised ✓ entregada 22-32
27 pioneer-report T3 Specialised ✓ entregada 22-32
28 concordance T3 Specialised ✓ entregada 22-32
29 letter-composer T4 UX ✓ entregada 22-32
30 kingdom-songs T4 UX ✓ entregada 22-32
31 exporter T4 UX ✓ entregada 22-32
32 life-topics T4 UX ✓ entregada 22-32
33 embed-rerank T1 Core ✓ entregada 33-38
34 audio-premium T1 Core ✓ entregada 33-38
35 constrained-decoding T2 Enabler ✓ entregada 33-38
36 vlm-ocr T2 Enabler ✓ entregada 33-38
37 colpali-visual T3 Specialised ✓ entregada 33-38
38 jw-gen (8th package) T4 UX ✓ entregada 33-38
39 nli-runtime T1 Runtime trust ✓ entregada 39-48
40 content-provenance T1 Runtime trust ✓ entregada 39-48
41 plugin-sdk T2 Community ✓ entregada 39-48
42 scaffolding T2 Community ✓ entregada 39-48
43 agent-tracing T2 Community ✓ entregada 39-48
44 synth-judge T2 Community ✓ entregada 39-48
45 semantic-chunking T3 Frontier ✓ entregada 39-48
46 canonical-versification T3 Frontier ✓ entregada 39-48
47 jw-core-js-minimal T4 JS surface ✓ entregada 39-48
48 wol-browser-ext T4 JS surface ✓ entregada 39-48
49 second-brain T1 Persistent memory ✓ entregada 49-56
50 jwpub-writer T2 Generation ✓ entregada 49-56
51 organized-app schemas T2 Interop ✓ entregada 49-56
52 jwlibrary-writer T2 Generation ✓ entregada 49-56
53 omnilingual-asr T1 Multilingual SOTA ✓ entregada 49-56
54 nllb-translation T1 Multilingual SOTA ✓ entregada 49-56
55 multilingual wire-up T1 Integration ✓ entregada 49-56
56 jw-core-js close-out T4 JS surface ✓ entregada 49-56
57 jw-meeting-media (clean-room) T1 New layer ✓ entregada 57-66
58 bible-knowledge-graph T1 JW-pure biblical KG ✓ entregada 57-66
61 letta-memory (opt-in) T2 Persistent memory ✓ entregada 57-66
62 marker-markitdown (loaders) T2 External loaders ✓ entregada 57-66
64 whisperX-asr (diarization) T2 Diarized ASR ✓ entregada 57-66
66 mcp-jw-brain T3 Brain exposed via MCP ✓ entregada 57-66
57.14 drag-and-drop UI T4 MVP+1 follow-up ✓ entregada 57-66
57.15 automatic external monitor T4 MVP+1 follow-up ✓ entregada 57-66
57.16 multi-congregation T4 MVP+1 follow-up ✓ entregada 57-66
58.13 place geocoords T4 MVP+1 follow-up ✓ entregada 57-66
58.14 headwords expansion T4 MVP+1 follow-up ✓ entregada 57-66
61.8 auto-recap T4 MVP+1 follow-up ✓ entregada 57-66
64.7 speaker names mapping T4 MVP+1 follow-up ✓ entregada 57-66
65 meta-orchestrator T1 Agentic ✓ entregada 65-76
66 conversation-sparring T1 Agentic ✓ entregada 65-76
67 doctrinal-reasoner T1 Agentic ✓ entregada 65-76
68 talk-lab T2 Multimodal ✓ entregada 65-76
69 broadcasting-visual-index T2 Multimodal ✓ entregada 65-76
70 image-quote-verifier T2 Multimodal ✓ entregada 65-76
71 book-camera T2 Multimodal ✓ entregada 65-76
72 doctrinal-drift T3 Predictive ML ✓ entregada 65-76
76 family-voice-clone T4 Voice / accessibility ✓ entregada 65-76
77 fidelity-principles T1 Alignment ✓ entregada 77-79
78 rlaif-pipeline T1 Alignment ✓ entregada 77-79
79 dpo-orpo-trainers T2 Training ✓ entregada 77-79
80 interp tri-model (5 sub-phases) T1 Interpretability ✓ entregada 80
81 meeting-scheduler (organized-app importer + roster CLI + constraints YAML) T2 Operational ✓ entregada 81-82
82 legal-cases-tj (Territory catalog + BrainDomain plugin) T3 Research ✓ entregada 81-82

Cada fase planeada tiene su spec aprobado y plan TDD bite-sized en docs/superpowers/specs/ y docs/superpowers/plans/ respectivamente. Total de planning: ~92k líneas, 251 TDD tasks documentadas, paralelizables por tier.

▌ Hard rules that 63 phases respect

  • Sin LLM en el camino crítico — parsers/agentes/stores determinísticos.
  • Citas siempre verificables con URL canónica a wol.jw.org.
  • Local-first; toda persistencia personal en ~/.jw-agent-toolkit/.
  • No red en tests — fixtures + cassettes + injectable fetchers.
  • Multilenguaje desde día 1 (en/es/pt mínimo, hasta 17 locales).
  • Triple-target providers: API + GPU NVIDIA + MLX/Apple Silicon.
  • No sustituye la palabra de los ancianos.
  • Sin distribuir letras de cánticos ni texto bíblico (copyright).