Dashboards Were the Last Central Planner. Context Windows Are the Next.

An essay on harnesses, knowledge, and the institutional layer that makes distributed action accountable.

May 23, 2026

In the 2010s, governments around the world began building “real-time governance” dashboards. Entire states and cities were reduced to live indicators on giant screens: procurement, sanitation, welfare delivery, grievances, district by district, in real time, for senior officials to monitor.

The dashboards were often genuinely impressive. They were also built on a claim that does not hold: that a complex society can be governed from the indicators that fit on a screen.

The dashboards were not wrong on their own terms. Most metrics were measured correctly enough. The failure was structural. A sanitation target appeared green because the asset existed on paper but not in the village. Procurement moved on schedule while the people it was meant to reach still waited. A grievance count dropped because the channel for filing complaints had quietly broken.

The dashboard’s apparent completeness made the missing reality disappear. What mattered was happening in the gap between the indicator and the ground, and the screen had no pixel for it.

This essay is about the version of that story we are now living through at much larger scale, with an instrument far more sophisticated than the dashboard. The instrument is the context window of a large language model. The promise is similar. The failure mode will be the same. And the response—the thing I have been calling the harness—is not new either. It is the latest name for an idea earlier generations already understood and the current wave is busy forgetting.

That is the argument. The problem the context window creates is not a discovery. It is the rediscovery of a structural fact that a cybernetician named in the 1940s, a science fiction writer dramatised in the 1950s, an economist proved in 1945, a philosopher grounded in 1966, and an information theorist has now restated in the 2020s.

Five vocabularies, one insight: a system that veers cannot be governed by a better description of itself. It can only be governed by a correction structure built around it. Remove the correction structure and you have noise pretending to be infrastructure.

The genealogy of one idea

Start with the cybernetician, because he stated it first and most cleanly.

Norbert Wiener, 1948. Cybernetics: Or Control and Communication in the Animal and the Machine named a field around a single principle: stable systems are closed-loop. A system that acts on the world, observes the result, and feeds the error back into its next action is governable. A system that acts without that feedback is open-loop, and open-loop systems drift, oscillate, and fail. Wiener took the word from the Greek kybernetes, the steersman, the one whose whole function is continuous correction against a current that never stops pushing. Two years later, in The Human Use of Human Beings, he made the social version of the argument: the human and the machine form one feedback structure, and the danger is not the machine but the human who removes himself from the loop and lets the machine run open.

Translate that to the present and the whole agentic governance debate falls into place. The model is the plant, the thing that acts and veers. The harness is the controller, the structure that observes what the agent did and feeds the correction back before the next action. An agent without a harness is not autonomous. It is open-loop. It is a steersman who has let go of the tiller and is calling the drift a destination.

Susan Calvin, 1950. Asimov’s robopsychologist is a fictional character, but the discipline he invented her to embody is the sharpest statement of the second half of the idea. In I, Robot, robopsychology exists because the engineers who built the robots could not specify their behaviour in advance.

The important shift in Asimov’s framing was not that robots required debugging, but that their behaviour became easier to interpret externally than to fully specify internally. A new discipline emerged in the gap between execution and predictability.

Calvin does not read the code. She interprets the system from the outside, by observing behaviour.

That is close to the ceiling we are now approaching with large language models. We can often interpret them. We cannot fully specify them.

The lesson worth carrying forward is simple: a system’s safety cannot rely on the system’s own account of itself. You do not ask the agent whether it stayed inside its mandate. You build the structures capable of answering the question independently.

Friedrich Hayek, 1945. Three years before Wiener, the economist had already proved why the enclosure cannot be a central plan. The Use of Knowledge in Society opens by granting the planner everything: if we possessed all the relevant information, the problem of the best use of resources would be, in Hayek’s words, purely one of logic. The whole argument is that this condition never arrives. The knowledge an economy runs on, he wrote, never exists in concentrated or integrated form, but only as dispersed bits of incomplete and often contradictory knowledge held by separate individuals. It is a problem of using knowledge that is not given to anyone in its totality. The knowledge of how much steel is needed in a factory next Tuesday is held by the foreman. It cannot travel upward without being abstracted, and the abstraction destroys most of what made it actionable.

The context window is the central planner’s tool, and it makes the planner’s exact promise: assemble enough of the situation onto one surface and the decision becomes a matter of logic. Retrieved from documents, chunked into the embedding space, the distributed knowledge is gathered toward a single point of decision. And it fails for exactly the reason Hayek named. The completeness never arrives, because the knowledge does not survive the trip. The collapse of the Soviet planning ministries was supposed to have settled this debate. What the context window reveals is that Hayek was describing an architectural pattern, not a regime. Any system that tries to decide by aggregating distributed knowledge into central representation hits the same wall. The state-on-a-wall dashboard hit it. The agent-driven enterprise is about to.

Hayek’s own answer was not a better plan. It was the price system, which coordinates distributed actors without ever collecting their knowledge into one place. No one node knows why steel got more expensive; the price carries just enough for each actor to adjust, and the coordination happens without the centralization. The harness is the price system for agents. It does not try to know what the agent knows or to assemble the full situation onto a screen. It bounds, prices, and corrects the agent’s action from outside, coordinating without centralizing. That is the move the whole genealogy points to, and it is the move the context-window-as-planner cannot make.

Michael Polanyi, 1966. Hayek’s argument depended on a deeper philosophical claim that Polanyi later formalised. The Tacit Dimension opens with the sentence the whole genealogy turns on: we can know more than we can tell. The doctor who diagnoses on intuition, the engineer who hears a failing bearing, all operate on knowledge that is real, reliable, and structurally inarticulate. As economist David Autor noted in his work on Polanyi’s Paradox, the tasks hardest to automate are exactly the ones whose rules we cannot state. The context window can hold text, but it cannot hold the tacit dimension. This is why context windows are not merely incomplete. They are structurally incomplete. The relevant knowledge is, in large part, not encodeable.

Vishal Misra, 2026. The information theorist closes the loop. Misra’s argument, grounded in his work on Bayesian Attention, isolates a fatal limit in how foundation models learn. They are trained to minimise cross-entropy, making them the ultimate statistical guessers of the next token within distributions they have seen.

What they do not necessarily do is Kolmogorov construction: discover the underlying generative rule itself and reliably apply it outside the original distribution.

They fit the statistical shape of the curve. They do not consistently derive the governing law beneath it.

The consequence is that these systems often do not fail the way humans fail. Humans usually experience uncertainty when entering unfamiliar terrain because they recognize the absence of an underlying model. Predictive systems frequently possess no equivalent internal boundary signal. They continue generating fluent, highly probable outputs beyond the range where their internal representations remain reliable, with little indication that they have crossed from inference into invention.

Five thinkers, one idea, eighty years apart.

Wiener said the system veers and needs a controller.
Calvin said you cannot get safety from the system’s own report.
Hayek said the knowledge to plan it centrally does not exist.
Polanyi said the missing knowledge is structurally unencodeable.
Misra said the mechanism that makes the model useful is the same mechanism that makes it veer.

The current wave of enterprise AI is not discovering a new problem. It is discarding a prior generation’s answer to it.

The .md dream, and why it fails

One of the clearest contemporary forms of this forgetting is the belief that agentic systems can be governed simply by writing everything down. Put institutional memory into Markdown files. Feed the documents into retrieval systems. Give the model enough context and the organisation becomes machine-readable.

The current enthusiasm around “LLM-native operating systems” and markdown-based agent memory revives an older enterprise dream: that institutional knowledge can be fully externalized into documentation and centrally retrieved on demand.

The interface has changed. Embeddings, retrieval, and conversational context windows have replaced search bars, but the underlying assumption remains familiar: if enough context is written down, institutional coordination becomes reproducible.

It does not.

Wikis never completely failed because documentation was useless. They failed because institutions do not primarily run on explicit knowledge. They run on tacit coordination, local judgment, informal escalation paths, and context that changes faster than documentation can stabilize.

Production knowledge is procedural, contingent, and distributed. A markdown repository can document procedures, but it cannot fully capture the lived coordination patterns of an institution. The `.md` file becomes the dashboard again: a compressed representation of institutional reality mistaken for the institution itself.

The two engineers about to resign are not in the `.md` file. The regulatory shift the procurement team noticed informally last week is not in the repository. The gradual erosion of trust between teams is not embedded in the vector database.

This is Hayek’s distributed knowledge problem and Polanyi’s tacit dimension arriving together inside a developer workflow. It is the deepest kind of collective tacit knowledge, the operational reality that institutions rely on precisely because it cannot be fully formalized. Pull institutional memory into documentation, and much of what made it useful stays behind.

The same wall, in a new room.

The difference is that stale documentation in a wiki misled humans. Stale institutional memory in an agentic system becomes executable. The representation no longer merely informs action; it begins autonomously propagating it.

The Hayekian trap: distributed intelligence is not distributed accountability

A prevailing narrative around AI claims that distributed intelligence has finally arrived as infrastructure. Analytical capacity once concentrated inside large institutions is now accessible to individuals, small firms, and autonomous systems at planetary scale.

The claim is partially correct, which is what makes it dangerous.

The democratization of analytical capability is real. But the argument quietly inherits an assumption from Hayek’s theory of distributed actors: that decisions remain locally bounded, failures remain partially independent, and actors absorb the consequences of their actions close to where those actions occur.

Autonomous systems weaken those assumptions.

Agents act faster than humans can reliably review. More importantly, their failures are often correlated. Systems built on shared foundation models inherit similar representational assumptions, training distributions, and blind spots. When one agent drifts outside its reliable operating conditions, millions of others may drift in structurally similar ways at roughly the same time.

The surface appears distributed while the failure mode remains centralized inside the underlying representational substrate.

This is the hidden trap in the current wave of “distributed intelligence.” Analytical capability disperses outward while epistemic dependency recentralizes underneath it. Distributed intelligence without a distributed accountability structure is just the central planner reborn.

The evidence is already arriving

The pressure is already visible in three places

The first is technical.

Agent failure is not random. It clusters around the boundary of the model’s training distribution. Close to familiar territory, performance can appear highly capable. Outside it, behaviour degrades invisibly because the system often lacks a reliable internal signal that it has moved beyond the conditions where its representations remain dependable.

The result is not merely error, but confident propagation beyond reliable grounding.

That is Misra’s wall and Polanyi’s tacit dimension showing up in production at the same time. The knowledge that would tell a human to slow down because this case is unusual is not in the context window, and the agent cannot acquire it from the prompt.

The second pressure is organisational.

Enterprises see agents producing outputs, dashboards turning green, workflows accelerating, and governance structures appearing to function. Compliance reports are generated. Audit trails are stored. Observability improves.

But visibility does not guarantee control.

Most institutional governance systems were built on a hidden temporal assumption: review arrives before propagation. Continuous computational coordination reverses this ordering. Execution now spreads faster than institutions can reliably reconstruct or modulate authority.

The result is a growing coordination latency between what systems can do and what institutions can meaningfully supervise.

This is why observability increasingly becomes high-resolution panic. Institutions can often see recursive failures propagating through workflows while remaining structurally incapable of interrupting them safely.

The third is historical.

Erik Brynjolfsson and his co-authors’ work on the Productivity J-Curve supplies the pattern. Transformative technologies, electrification, computerisation, now generative AI, produce declining productivity at first, because the organisational forms inherited from the previous technology are wrong for the new one and the redesign takes a generation. The firms that win are not the earliest adopters of the technology. They are the earliest adopters of the organisational redesign around it.

The technology is the easy part. The institutional architecture is what decides the outcome.

The model fails at the distribution boundary. The organisation fails to notice because the dashboard is green. And the firms that get it right are the ones that build the institutional layer first and the technology second.

The firms that survive major technological transitions are rarely the ones that adopt the technology first. They are the ones that redesign institutional coordination around it first.

What the harness actually is

The word harness is doing triple duty in the current conversation, and the three meanings must be separated if we are to build infrastructure rather than theatre.

There is the technical harness: the Action Boundary. This is the runtime proxy that intercepts tool calls and mathematically enforces the institution’s rules before any external action commits.

There is the institutional harness: the Mandate Specification. This is the cryptographically signed structure that determines exactly what counts as authorised, setting the jurisdictional limits of the agent.

And there is the semantic harness: the LWD-R layer (Logic, Weights, Data, Representation). If the underlying model’s representational geometry, how it categorizes the world, is closed or inherited from a proprietary frontier model, the system cannot be contested.

A technical harness without an institutional mandate is just rate limiting. An institutional mandate without an Action Boundary is just documentation. And an Action Boundary wrapped around a closed Representation layer still leaves institutions dependent on external epistemic assumptions they cannot meaningfully contest. Together they separate three things the current debate runs together: control of execution, authority over the mandate, and ownership of representation.

This is also why the value is migrating into harnesses and services rather than into the models themselves. A service compounds for the same reason a market does: it coordinates distributed knowledge that no single corpus can hold. The model is the corpus, the attempt to compress the world’s knowledge into one set of weights, and like every central representation it is structurally incomplete. The service that wraps it, that watches what it does in a particular context and corrects it against a particular institution’s rules, is doing the price system’s work, supplying the local knowledge the weights never captured. Capital follows that work because the work is where the coordination actually happens. The market is simply noticing the structure ahead of the discourse.

The word harness is currently being used to describe several different things simultaneously, and separating them matters.

The architecture of structural accountability

What connects Wiener, Calvin, Hayek, Polanyi, and Misra is the same structural demand: systems require boundaries capable of correction under incomplete knowledge.

Wiener built feedback loops. Calvin showed that behavioural interpretation replaces full specification. Hayek showed that distributed coordination cannot be centrally represented without loss. Polanyi showed that much of the relevant knowledge cannot be fully articulated. Misra showed that predictive compression itself introduces invisible drift.

The institutional layer emerging around agents is ultimately an architecture of structural corrigibility. Not because institutions can fully specify machine behaviour in advance, but because they cannot.

The institution accepts that execution now happens at machine speed, and it responds by constructing boundaries that preserve accountability anyway: constraining execution, localizing authority, modulating delegation, and preserving the ability to reconstruct failures after propagation begins.

This is the opposite of the central planner’s architecture. The planner attempts to gather everything into one representation. The harness begins from the assumption that complete representation is structurally impossible.

Most institutions were built for a world where coordination remained partially fragmented. Departments separated responsibility, geography slowed propagation, manual review inserted delay, and local failures stayed locally bounded.

Continuous computational coordination erodes those buffers. Execution propagates across APIs, workflows, identity systems, and organizational boundaries faster than institutions can reliably localize authority or interrupt cascading decisions.

In the state governance dashboards of the 2010s, the gap between the indicator and the ground was a district, a season, a broken grievance channel. The next gap, the one that opens when an enterprise runs its agents confidently in territory they do not know is out of sample, is the exact same gap, stripped of all its natural friction. The dashboard will still be green when the propagation has already begun.

The institutions that survive this transition will not necessarily be the ones with the largest models or the widest context windows. They will be the ones with the better Action Boundary. The steersman does not get to know the whole sea. He only has to keep his hand on the tiller and force the correction. Let go, call the drift a destination, and the green dashboard will be the last thing the system shows you before it fails.

Anivar Aravind is an Engineering Executive and Systems Thinker. The Layer 8 is a professional newsletter on the power, incentive, and governance layer of digital infrastructure. His structural framework on corrigibility is at anivar.net/corrigibility, with preprints on SSRN. Async. Cross-posted to LinkedIn. You can subscribe on Substack or LinkedIn.

Discussion about this post

Ready for more?