Viewport 04 · The Crux
How a billion-node graph runs locally
The injection is two-layer, because one billion nodes cannot all be parameterized. The parametric experts carry the structure and the rubric; the retrieval layer carries the facts.
0
KiB per LoRA adapter (one knowledge unit)
0
to train one expert (single GPU)
0
add / update / delete, no backbone retrain
0
node target the retrieval layer carries
Why hybrid: pure parametric injection holds faithfulness under conflict but loses fine-grained facts; retrieval carries detail; together they beat either alone (arXiv 2510.12668). DMoE (arXiv 2606.14243) makes each knowledge unit a hot-swappable adapter, exactly what a nightly-growing graph needs. Agents-K1 (arXiv 2606.13669) is the document-to-graph factory feeding it.