https://chatgpt.com/share/69e1640c-c9e0-838a-802a-79ccf416c87e
https://osf.io/hj8kd/files/osfstorage/69e15fd85c323f0901affdd1
From Wiki Pages to Knowledge Objects -
An Illustrated Mini Textbook on Governed LLM Wiki Maturation
Raw Objects, Universe-Bound Assimilation, Coverage Ledgers, Perspective Assimilation Engines, Runtime Skill Cells, and Selectable Governance Packs
Part I — Why Persistent Wiki Is Not Yet Governed Knowledge
Chapter 1. From Wiki Pages to Knowledge Objects
1.1 The shift that already happened
Most early LLM knowledge workflows were built around a simple loop:
- retrieve raw documents
- synthesize an answer
- discard most of the intermediate structure
- repeat from scratch next time
That pattern works, but it traps the system in permanent rediscovery. A later query may overlap heavily with an earlier one, yet the runtime still redoes the same extraction, filtering, comparison, and summarization work.
The wiki pattern changed that. It replaced one-shot answering with cumulative compilation. Instead of treating every query as a new research job, the system begins to maintain a persistent knowledge surface: summaries, entity pages, concept pages, comparison pages, and cross-links.
That was already a major architectural upgrade.
The important point is this: the LLM stopped being only an answer generator and started becoming a maintainer of compiled structure.
In compact form:
K_T := (R, W, Σ ; I, Q, L ; N) (1.1)
where:
- R = raw sources
- W = wiki
- Σ = schema
- I = ingest
- Q = query
- L = lint
- N = navigation infrastructure
This kernel is small, but real. It gives the system a persistent memory surface and a maintenance loop.
1.2 Why that is not the end of the story
A persistent wiki is already much better than plain RAG, but it still leaves a deeper architectural question unanswered:
What exactly is the knowledge unit?
At first glance, the answer seems obvious: the page.
But a page is only a human-facing view. Architecturally, it is too coarse. One page may contain:
- source-grounded claims
- partial interpretations
- stable concept fragments
- weak generalizations
- unresolved contradictions
- speculative bridges
- multi-domain hints
All of that may read smoothly as prose, yet be internally heterogeneous.
So the real shift of this book is not from retrieval to wiki. That shift has already happened.
The next shift is from page to object.
This can be written very simply:
page ≠ final knowledge unit (1.2)
page = staged object in a maturation pipeline (1.3)
That sentence is the doorway into the whole framework.
1.3 Why the page is not enough
A page is useful because it is readable. A human can browse it, revise it, and link it. That is valuable. But readability is not the same as governability.
A page may look coherent while hiding:
- which parts are strongly source-grounded
- which parts are local synthesis only
- which parts are tentative
- which parts are unresolved residual
- which perspective shaped the consolidation
- which later objects should absorb which portions
A page can therefore succeed as prose while failing as runtime state.
That is the hidden weakness of page-only architectures. They optimize for surface legibility before they optimize for long-horizon replayability.
1.4 The idea of a knowledge object
A knowledge object is a page plus runtime discipline.
It is not just text. It is text bound to structure, provenance, traceability, and later transformation rules.
A real knowledge object must answer questions such as:
- Where did this come from?
- What phase is it in?
- What perspective is active?
- What remains unresolved?
- What later assimilations should it support?
- What evidence shows how it reached its current form?
That is why the book uses the word object rather than page. The object language forces us to think in terms of state, transition, and governance rather than only in terms of prose polish.
1.5 The maturation idea
Once we move from page-thinking to object-thinking, a new picture appears.
Knowledge artifacts do not all play the same role at the same time. A newly compiled artifact should not be forced to behave like a mature doctrine object immediately. Its first job is usually much more modest:
- preserve source-grounded structure
- expose stable internal segments
- retain fragility and uncertainty honestly
- remain reusable by later assimilation passes
That means the first durable wiki artifact above the raw source layer should often be treated as a Raw Object, not as final concept doctrine.
Later, after broader consolidation under a declared perspective, parts of that Raw Object may contribute to Mature Objects.
And some material may never deserve immediate maturity. It may instead become residual, or enter an Inspirational layer for guided exploration.
So the core ladder is:
wiki artifact → Raw Object layer → Mature Object layer → governed knowledge runtime (1.4)
This is the basic architectural staircase of the book.
1.6 Three object roles
The framework introduces three roles.
First:
Raw Object = source-grounded, immature concept object. (1.5)
Second:
Mature Object = perspective-bound, consolidated concept object. (1.6)
Third:
Inspirational Object = exploratory or weak-signal candidate not yet admitted into the mature core. (1.7)
These are not mere labels. They are phase distinctions.
The Raw Object preserves extracted structure without
pretending to be final doctrine.
The Mature Object supports broader reuse under stronger closure rules.
The Inspirational Object protects emerging patterns without contaminating the
trusted layer too early.
1.7 Why this matters for engineering
This book is not trying to sound philosophically sophisticated. It is trying to improve runtime control.
The engineering benefits of object-thinking are immediate:
- better replayability
- cleaner maintenance responsibilities
- stronger perspective discipline
- more honest handling of uncertainty
- easier later consolidation
- better human arbitration surfaces
A page-centric system asks:
“How should this page read?”
An object-centric system asks:
“What phase is this artifact in?” (1.8)
“What closure does it currently deserve?” (1.9)
“What later transformations should it support?” (1.10)
Those are much stronger questions.
1.8 The deeper design claim
The core claim of this chapter can now be stated clearly:
A serious LLM wiki should not treat its pages as flat prose files only. It should treat them as phase-specific knowledge objects designed for later governed maturation. (1.11)
That is the conceptual move that makes everything else in the book possible.
1.9 Chapter summary
The first architectural leap was:
retrieval → persistent compilation
The second architectural leap is:
persistent pages → governed knowledge objects
This textbook is about that second leap.
.png)
.png)
.png)
.png)