agentnetwork.ai · Groundwork · The Intelligence Operating System

Groundwork

The blueprint for building an organisation around knowledge, context, and agents. From first principles to working infrastructure.

April 2026 · By Craig Hepburn · Ground Truth ↗

Section I

The premise

The way companies are built is changing at the foundation level. Not the tools on top. The logic underneath. Most organisations are adding AI to an existing model. A small number are building a different model entirely. This is the blueprint for the second approach.

The shift in one sentence

The traditional company was built around humans as the execution unit. The new company is built around knowledge and context as the core asset, with agents as the execution unit.

Two fundamentally different bets

Bet one · most organisations

Augmentation

Add intelligence to the existing stack. Give people better tools. Make the translation faster. The human remains the bridge between the business need and the machine. They just cross the bridge more quickly.

Produces: faster execution of existing work. Measured by: adoption rates, time saved, productivity scores.

Bet two · the structural shift

Substitution

Build systems that handle the execution so the human no longer needs to be the bridge at all. The human defines the outcome, provides the context, sets the direction. The agent finds the path and runs it.

Produces: a different kind of working day. Measured by: what your people spend their time thinking about.

60%

of the average knowledge worker's day is work about work

Navigation, coordination, information retrieval, status updates, approval chains. Not the skilled work. The overhead around it. Asana found this across 10,000+ knowledge workers globally. The average worker switches between applications over 1,200 times a day. If AI makes that 60% faster, the hamster wheel runs faster. The hamster is still on the wheel. The opportunity is escaping it.

Source: Asana Anatomy of Work Index · Harvard Business Review · Cornell Workgeist Report 2024

The diagnostic question. Has your AI adoption changed what your people spend their time thinking about? If the answer is no, you are making the first bet regardless of what the procurement deck says. The test is not which tools you have deployed. The test is whether the execution layer is still consuming the attention of your most valuable people.

Section II

The intelligence operating stack

Five layers. Built from the bottom up. The models will change. The frameworks will evolve. But the logic of the stack is durable because it reflects what agents actually need to function well: knowledge, context, infrastructure, execution, and human direction. The diagram below shows the architecture at a glance. Click each layer below to explore in detail.

The architecture — overview

The Groundwork intelligence operating stack Five-layer architecture from knowledge architecture at the foundation to human intelligence at the top. Compounds upward The intelligence operating stack The models change. The frameworks evolve. What compounds permanently is what you build here. Build order: Foundation first Agents built on top Humans elevated above 01 · Foundation Knowledge architecture SYSTEM.md · MEMORY.md · STANDARDS.md · CLIENT.md Plain markdown Agent-readable Irreplicable Compounds forever Cost: time 02 · Living state Context layer CLIENT.md · decision logs · project state · relationship notes · pattern library Per client Per project Per domain Auto-maintained Cost: time 03 · Operating environment Agent infrastructure Harness · heartbeat · skills · tools · shell · orchestration · logs Soul SYSTEM.md Memory MEMORY.md Context CLIENT.md Tools APIs · shell Heartbeat Cron · loop Outcomes Standards Cost: build once 04 · Agent workforce Execution layer Agents doing the work · continuously · without being asked · at token cost Email triage Research sync Client briefings CRM updates Reporting Cost: tokens 05 · Human intelligence Direction & judgement Defining what matters · relationships · knowing when the brief is wrong · building what is new Strategy Relationships Judgement Creation Cost: attention groundwork · agentnetwork.ai · Craig Hepburn · 2026

The stack — explore each layer

05

Human intelligence layer

Direction, judgement, relationships, and the things that require accountability

Humans only

What lives here

  • Defining what the outcome should be
  • Relationships that make difficult conversations possible
  • Knowing when the brief is wrong before starting
  • Decisions where being wrong has real consequences
  • Reading context no dataset fully captures
  • Building things that do not yet exist
  • Holding accountability for what agents produce
  • Setting the direction when the path is genuinely unclear

What this replaces

  • Management layers that relayed information up and down
  • Status meetings that monitored execution progress
  • Project coordination that tracked tasks
  • Reporting that aggregated context already held in systems
  • Middle management as the primary coordination mechanism

Cost unit

Human attention · irreplaceable · the scarcest resource in the new model

Elevated work — what humans do when execution is handled

Defining what matters this quarter Client relationships that require trust Strategic decisions with real stakes Creative direction and vision Sensing what the market actually needs Knowing when to stop a project Building what has never been built Reading the room in a negotiation
Block (Dorsey + Botha, March 2026): "Three roles going forward. Individual contributors doing actual work, augmented by agents. Directly responsible individuals owning specific cross-cutting problems. Player-coaches who raise the capability of the people around them. No permanent middle management layer. Everything else the old hierarchy did, the intelligence layer now coordinates."
04

Execution layer

Agents doing the work — continuously, without being asked, at token cost

Agents

What agents handle today

  • Email triage, extraction, and drafted responses
  • Research synthesis across multiple sources
  • Client briefing documents from context files
  • Meeting intelligence logged to memory
  • Reporting generated from live system state
  • Scheduling and coordination routing
  • Code scaffolding, tests, and documentation
  • CRM updates from conversation context
  • First-draft proposals from context and standards files
  • Data extraction and transformation

Still developing (honest assessment)

  • Complex multi-step reasoning across ambiguous inputs
  • Regulated decision-making in banking and healthcare
  • Novel problem types with no training precedent
  • High-stakes actions without human review
  • Tasks requiring real-time physical world interaction

Cost unit

Tokens per task · Gartner forecasts 90% cost reduction by 2030 · agentic workflows consume 5–30x more tokens than simple queries

Example tasks moving from human to agent

Weekly client status report Inbound email triage and routing Market research synthesis Meeting notes to action items Proposal first draft CRM contact updates Competitor monitoring Invoice processing Code review and documentation Social media scheduling Internal FAQ responses Data formatting and transformation
Block's Goose agent framework: developer productivity up 40% per engineer since September 2025. Gross profit per employee reached $1M in 2025, targeting $2M in 2026. A risk underwriting model that previously took a full quarter to build was completed in days.
03

Agent infrastructure layer

Harness, heartbeat, tools, memory, orchestration — the operating environment

Built once

Core components

  • SYSTEM.md — identity, purpose, rules of engagement
  • MEMORY.md — accumulated knowledge across all sessions
  • Heartbeat — scheduled loop running without being asked
  • SKILL files — capabilities described in plain text
  • Shell access — direct execution against the machine
  • Tool connections — APIs, email, calendar, CRM, data sources
  • Logs — auditable record of decisions and actions

Current frameworks (April 2026)

  • OpenClaw — 345K stars, gateway architecture, 5,700+ skills
  • Hermes — self-improving, persistent memory, agent-loop
  • Pi / Claude Code — minimal harness, 4 core tools
  • Goose (Block) — open source, model-agnostic
  • LangGraph, CrewAI — multi-agent orchestration
  • Claude Managed Agents — production-grade hosted infrastructure
  • Perplexity Computer — 19-model routing, persistent sessions

The minimal viable harness — what you actually need

A model (reasoning engine) A filesystem (storage) Markdown files (knowledge) A shell (execution) A cron job (heartbeat)
Marc Andreessen (Latent Space, April 2026): "It's basically LLM plus shell, plus filesystem, plus markdown, plus cron." Five components. Four of them have existed since the 1970s. The infrastructure was always there. What was missing was a model that could understand human intent and act on it directly. See the full Agent Landscape at agentnetwork.ai for current framework comparison.
02

Context layer

The living picture of every relationship, project, commitment, and decision

Maintained continuously

What context contains

  • CLIENT.md — who they are, what they need, what is open
  • Decision log — what was decided, when, and why
  • Project state — current status, blockers, next actions
  • Relationship notes — preferences, communication style
  • Pattern library — what works in this sector or situation
  • Commitment tracker — what has been promised and to whom

How it is maintained

  • Agent updates MEMORY.md after every significant session
  • Structured folder per client, project, and domain
  • Plain markdown — readable by humans and models alike
  • No login required — no SaaS wall between agent and knowledge
  • Version controlled — full history of what changed and when
  • Human review at decision points, agent maintenance in between
Andrej Karpathy (April 2026): 100 articles, 400,000 words, maintained entirely by an agent. His instruction: write documentation for agents, not for people. If the agent understands it, it can explain it to any human who needs it. The discipline of making information legible to agents supersedes making it legible to humans.
01

Knowledge architecture layer

The foundation — what the company knows, encoded in a form agents can act on

Built first

What goes here

  • Every process described precisely enough to evaluate
  • Definition of good for every deliverable type
  • Domain expertise encoded in plain language
  • Lessons from every engagement, accessible to every agent
  • Brand voice, standards, and non-negotiables
  • Structured data sources agents can query directly
  • Failure patterns — what has gone wrong and why

Why this layer must come first

  • Agents without context are expensive and unreliable
  • McKinsey: 80% of enterprises cite data limitations as the primary blocker to scaling agentic AI
  • Context cannot be bought — only built over time
  • Competitors can buy the same models; not your knowledge
  • Every week this layer compounds — the moat deepens automatically

Cost unit

Time to encode · the only irreplicable asset · compounds indefinitely

McKinsey (2026): fewer than 10% of enterprises that experiment with agents scale them to deliver tangible value. Eight in ten cite data limitations as the primary blocker. The stack fails without this foundation. You are not building an agent. You are building the knowledge infrastructure the agent runs on.

Section III

Where are you now?

Before deciding where to focus, it helps to be honest about which layers you currently have and which are missing or broken. Answer the five questions below. No right answers. The output is a starting point, not a verdict.

Five questions — honest answers only

1. Where does your organisation's knowledge currently live?

2. When a new person joins your team, how do they learn what good looks like?

3. How does your team currently use AI?

4. What percentage of your senior people's day is execution versus direction?

5. If your two most valuable people left tomorrow, what would you lose?

Your starting point

Section IV

Building the knowledge architecture layer

This is the layer most organisations skip. It is also the reason most agent deployments underperform. Before you think about which framework to use or which model to run, you need to answer one question: what does this agent need to know about how we work in order to do this work without us?

The three files every domain needs

SYSTEM.md Identity · purpose · rules
# Agent identity and operating constitution # This file is read at the start of every session # Treat it as the onboarding document for a senior hire ## Who you are You are [name], the [role] for [organisation]. You serve [who you serve] by [what you do]. Your primary objective is [specific outcome, not vague purpose]. ## How you operate You run on a [frequency] heartbeat. You check for [what to check]. You act on [what you can act on without asking]. You flag and wait on [what requires human decision]. You never [hard constraints — things you will not do]. ## Standards you hold Every output you produce must meet these standards: - [Standard one — specific and testable] - [Standard two — specific and testable] - [Standard three — specific and testable] When in doubt, flag rather than proceed. ## Voice and style When communicating externally: [tone, register, what to avoid] When logging internally: [format, level of detail required]
MEMORY.md Accumulated knowledge · patterns · lessons
# Accumulated knowledge across all sessions # Updated by the agent after significant interactions # This is where institutional knowledge lives ## Patterns observed - [Pattern]: [Context in which it appears] → [What it usually means] - [Client name] responds better to [approach] than [approach] - [Process] tends to stall at [stage] because [reason] - When [trigger], the right action is almost always [action] ## Decisions made - [Date] [Decision]: [Reasoning] [Outcome] - [Date] [Decision]: [Reasoning] [Outcome] ## Things that have not worked - [Approach] failed when [context] because [reason] - [Do not try X] — tried on [date], result was [outcome] ## Active context Current priorities: [list] Things to watch: [list] Open questions: [list]
CLIENT.md Context · history · commitments
# Living context file — updated after every significant interaction # Template: one file per client, per project, per engagement ## Who they are Organisation: [name, sector, size] Primary contact: [name, role, how they prefer to communicate] Relationship since: [date] · Relationship health: [honest assessment] ## What they have asked for Original brief: [plain language summary] What they actually need: [your interpretation, may differ from brief] What success looks like to them: [specific, not vague] ## What has been decided - [Date] [Decision] [Who made it] [Any conditions] ## What is still open - [Question or decision pending] [Who owns it] [By when] ## What we know that is not in the brief - [Observation about how they work] - [Preference or sensitivity worth knowing] - [Political or organisational context that affects the work]

The folder structure

# Your organisation's knowledge architecture
knowledge/
  agents/
    SYSTEM.md     # Agent identity and constitution
    MEMORY.md    # Accumulated learning
    skills/      # Capability files
  clients/
    [client-name]/
      CLIENT.md   # Living context
      decisions.md # Decision log
      briefs/    # Project briefs
  standards/
    delivery.md  # Definition of good
    voice.md    # Brand and communication standards
    process.md  # How we do things here
  domain/
    market.md   # Sector knowledge and patterns
    tools.md    # What tools exist and how to use them

The first encoding session

Block out two hours with the person who knows the most about the domain you have chosen. Ask three questions.

Question one

What does good look like?

How would you know if something was excellent versus merely acceptable? Be specific enough that someone new could evaluate their own work without asking you.

Question two

What consistently goes wrong?

What are the failure modes? Where do things typically stall? What mistakes keep happening? What do you wish people understood before starting?

Question three

What takes six months to learn?

What context does someone need to do this well that they would only absorb after significant time? What is never in the documentation but should be?

Record or transcribe the conversation. Have an agent turn it into the first draft of your STANDARDS.md and MEMORY.md. Review and correct. That two-hour session, properly encoded, is worth months of agent training. The act of encoding the knowledge is itself clarifying — most organisations discover they have never made explicit what they have always assumed was obvious.

Section V

Building the agent infrastructure layer

Once the knowledge architecture exists, you can build the agent. Not before. The infrastructure layer is where most organisations start, which is why most agent deployments underperform. The order matters more than the tooling.

Choosing the right framework — honest comparison

Framework Best for Technical requirement What it gives you Honest limitation
OpenClaw / Hermes Personal agent, always-on across messaging channels Moderate — comfortable with command line WhatsApp/Telegram integration, 5,700+ skills, self-improving loop Security CVEs in OpenClaw; complex for regulated environments
Claude Code / Pi Developer workflows, coding agents, minimal harness Developer comfort required Deep codebase context, git integration, extensible via tools Claude-only; not a full autonomous agent without additional harness
Goose (Block) Teams wanting open-source, model-agnostic foundation Technical team required for setup Privacy-first, local execution, proven at scale Less ecosystem than OpenClaw; requires more self-built skills
LangGraph / CrewAI Multi-agent orchestration, complex workflow automation Engineering resource required Full orchestration control, role-based agents, custom pipelines Higher complexity; over-engineered for simple use cases
Claude Managed Agents Production deployments requiring security and reliability Low — managed infrastructure Durable sessions, crash recovery, credential vaults, SOC2 Claude-only; ongoing infrastructure cost
n8n / Zapier (no-code) Teams without technical resource who need automation fast None — visual builder Fast to deploy, wide integrations, no code required Not truly agentic — triggered workflows, not autonomous agents

The full landscape of every current agent framework, platform, and tool is mapped at agentnetwork.ai. Use the Agent Mapper to understand where you sit on the spectrum from tool-use to full autonomous agents. The Groundwork resource focuses on the architecture decisions; the landscape covers the tooling in full.

What separates a tool from an agent

A tool

Waits to be asked

Responds when prompted. Has no memory between sessions. Exists only in the moment of being asked. Does not monitor, initiate, or accumulate. Most people who believe they are using AI agents are using tools.

An agent

Has a heartbeat

Runs on a scheduled loop whether or not a human asks it to. Has persistent memory that accumulates across sessions. Has a defined role, tools to act with, and the authority to act within boundaries without waiting to be asked.

Section VI

The build sequence

The order matters more than the speed. Most failed deployments started with the agent. The ones that work started with the knowledge. Here is the sequence that compounds correctly.

Week by week — the first ninety days

1
Week 1–2 · Foundation

Choose one domain. Map the knowledge that currently exists in people's heads.

Pick the single area where the most valuable knowledge lives informally. Run the first encoding session. Create SYSTEM.md, MEMORY.md, and the first CLIENT.md or STANDARDS.md. Do not deploy an agent yet. The knowledge layer must exist first. Most teams find this harder than expected — writing down what seems obvious is itself clarifying work.

2
Week 3–4 · First agent

Deploy a minimal agent against the knowledge you have just built. Give it one task.

Choose the single highest-value task in your chosen domain that is currently consuming human time. Brief the agent with the knowledge files you built. Run it. Observe what it gets right and what it gets wrong. Everything it gets wrong is a gap in your knowledge architecture, not a failure of the agent. Add what is missing to the files. Run it again.

3
Week 5–8 · Heartbeat

Add the heartbeat. Let the agent run without being asked.

Set up the scheduled loop. Define what the agent checks for and what it does when it finds something. This is the moment the agent shifts from a tool to a system. The first time it handles something you did not ask it to handle, and handles it correctly, is the moment you understand what this infrastructure actually is. Log everything. Review the first two weeks of autonomous outputs carefully.

4
Week 9–12 · Extend

Add a second domain. Let the first agent's memory inform the second.

By now the first agent has a MEMORY.md that is accumulating real knowledge. The patterns it has observed, the decisions it has logged, the failures it has recorded — all of this is now part of your knowledge architecture. When you extend to a second domain, the second agent starts with a richer foundation than the first. This is the compounding logic at work. Each deployment improves the substrate for the next.

5
Month 3 onwards · Compound

Stop measuring adoption. Start measuring what your people are spending their time thinking about.

By month three you should be able to answer a question you probably could not answer on day one: what would your most valuable people build if the execution layer was not consuming their attention? That question, and its answer, is the signal that the architecture is working. The organisations that reach this point are not talking about AI productivity anymore. They are talking about what they are building now that they could not previously reach.

The most common failure mode. Deploying the agent before the knowledge layer exists. The agent is technically functional but produces generic, context-free outputs that require constant correction. The team concludes agents are not ready. The reality is the foundation was not ready. The agent was willing; the knowledge was not there to guide it.

Section VII

The agent's constitution

The models will change. Frameworks will be superseded. Costs will fall. What will not change is what an agent needs to function well: a clear sense of what it is, what it knows, what it remembers, what it can do, and what it is for. This is the permanent structure underneath all of it.

The six elements that persist across every model change

Soul

Identity and purpose

Who the agent is, who it serves, what it is for, and what it will not do. This is the SYSTEM.md. Every other element depends on this being clear. An agent without a soul is just an expensive autocomplete.

Memory

Accumulated knowledge

What the agent has learned across every session. Patterns, decisions, failures, preferences. The MEMORY.md is the thing that turns a capable model into an entity that knows your organisation. Without memory, the agent starts from zero every time.

Context

Situational awareness

The living picture of the current state. Who are we working with. What has been decided. What is open. What has changed since last time. Context is what makes the agent's output relevant rather than generically correct.

Tools

Capability to act

The connections to external systems, data sources, and services. Email, calendar, CRM, APIs, files. Without tools, the agent can think but not do. The tools are the hands. The quality of the connections determines the scope of what is possible.

Heartbeat

Autonomous operation

The scheduled loop that wakes the agent and asks it to check, decide, and act without being prompted. This is the architectural difference between a tool and an agent. A chatbot waits. An agent with a heartbeat is already working.

Outcomes

Definition of good

What success looks like, precisely enough for the agent to evaluate its own output. This is the STANDARDS.md. An agent without a clear definition of good will produce technically correct outputs that miss the point. This file is what turns capability into quality.

The permanent truth

"The models will be replaced. The frameworks will evolve. The costs will fall. What compounds permanently is the knowledge, the context, and the memory you build today. That is the moat. And unlike every other advantage in technology, it is one your competitors cannot buy."

Craig Hepburn · Ground Truth · April 2026