Garry Tan System Map

Product Anatomy

A layer-by-layer map of the product parts, surfaces, loops, and hidden infrastructure.

gstack = methodgbrain = continuitygbrain-evals = proofYC = network

Product Anatomy

Core Read

Garry's system is best understood as one product with multiple visible surfaces, not three unrelated repos. The product is an AI-native operating system for high-leverage builders:

  1. gstack gives the agent a work process.
  2. gbrain gives the agent durable memory.
  3. gbrain-evals proves the memory and workflow claims.
  4. YC turns the proof into distribution, recruiting, founder affinity, and cultural authority.

The product is not "skills." The product is a way for one high-agency operator to convert intent into shipped software, preserved context, measured learning, and public proof.

Product Stack

LayerProduct PartUser SeesSystem Actually DoesValue Created
1Narrative / belief layerREADME story, contribution graphs, Karpathy/OpenClaw references, YC credibilityReframes AI coding from autocomplete into company-building leverageMakes the user willing to try a new operating model
2Workflow layergstack slash commandsEncodes product, design, engineering, QA, security, release, and retro rolesTurns AI labor into a managed sprint
3Browser/action layerbrowse, QA, screenshots, sidebar, pair-agentGives agents eyes, clicks, shared tabs, cookies, and verifiable UI evidenceMoves from text-only claims to real-world inspection
4Safety/gate layerreview, cso, careful, freeze, guard, ship checksAdds stop points, approvals, confidence thresholds, and scoped editsMakes faster work survivable
5Memory layergbrain pages, search, MCP tools, source citationsPersists pages/chunks/links/sources/embeddings/timelines across sessionsMakes work compound instead of reset
6Skillpack behavior layergbrain skillsTells agents when to read, write, cite, enrich, verify, publish, and askTurns a database into operational memory
7Eval/proof layerbenchmark reports and scorecardsRuns public and in-house memory benchmarks with artifacts and caveatsMakes claims falsifiable
8Distribution/funnel layerMIT repo, install commands, OpenClaw/host adapters, YC hiring/apply CTAsConverts OSS attention into users, contributors, candidates, and foundersBuilds movement and talent flow

Product Parts By User Job

User JobProduct Part That Handles ItWhy That Part Exists
"Help me think through what to build."gstack office-hours, plan-ceo-review, autoplanThe first failure in AI coding is often building the wrong thing faster.
"Turn this into a plan I can trust."plan-eng-review, plan-design-review, plan-devex-reviewThe system forces taste, architecture, DX, failure modes, and tests into the open.
"Build without wrecking my repo."careful, freeze, guard, investigate, reviewAI's speed increases the blast radius of bad assumptions.
"Show me what actually happens in the app."browse, open-gstack-browser, qa, qa-only, design-reviewScreenshots, clicks, console errors, and repro steps beat model assertions.
"Ship this safely."ship, setup-deploy, land-and-deploy, canary, benchmarkShipping is not done until tests, PR, deploy, and production verification are handled.
"Remember what we learned."learn, context-save, context-restore, setup-gbrain, sync-gbrainThe productivity gain collapses if every session starts cold.
"Make my knowledge usable by agents."gbrain pages/chunks/sources/MCP/queryKnowledge must become queryable, citable, source-scoped, and writable.
"Make research and notes operational."gbrain ingest, enrich, meeting-ingestion, briefing, book-mirror, data-researchThe system turns raw material into decisions, timelines, and daily prep.
"Prove the memory works."gbrain-evals LongMemEval, BrainBench, reportsWithout evals, memory quality is vibes.
"Join or contribute to the movement."MIT license, host adapters, ClawHub skills, YC hiring/apply CTAsThe artifact becomes recruiting and distribution.

The User-Facing Product Surfaces

1. README As Product

The README is not just documentation. It is the landing page, credibility asset, demo, onboarding script, competitive argument, and recruiting funnel.

Its jobs:

2. Slash Commands As Product UI

The command names are the interface:

This is strong because the commands are memorable role metaphors. The user does not need to learn an abstract API; they hire a specialist.

3. Skill Files As Product Logic

The skills are product logic written in markdown. They encode:

This is why the system feels like process, not prompting.

4. CLI/MCP As Product Infrastructure

gbrain's CLI and MCP surfaces turn the brain into typed tools. This matters because durable memory cannot rely on natural-language shell-outs alone.

Important product choices:

5. Browser As Product Surface

The browser layer is a major product surface because it makes agent output inspectable.

It supports:

The product insight: a coding agent without a browser is blind to a large part of software quality.

6. Benchmarks As Product Surface

gbrain-evals makes proof public. The benchmark docs are not internal QA artifacts; they are market-facing credibility.

They communicate:

Hidden Infrastructure

Hidden PartWhy It Matters
Source scopingPrevents personal/team/client/source contamination.
Chunking and embeddingsDetermines what the agent can recall and cite.
Code graph edgesMoves coding memory beyond grep into symbol relationships.
MCP operation contractsPrevents tool schema drift and lets clients call memory safely.
OAuth/scopes/loggingMakes remote memory access possible without total trust.
Skill routingKeeps the agent from loading everything all the time.
Context save/restoreProtects long-running work from session resets.
Eval candidates/artifactsConverts real use and benchmarks into reviewable proof.
Host adaptersLets the method travel beyond Claude Code.

Feedback Loops

Loop 1: Build Loop

Idea -> office-hours -> CEO/design/eng/DX review -> implementation -> review -> QA -> ship -> canary -> retro.

This loop makes AI coding feel like an engineering org.

Loop 2: Memory Loop

Work happens -> pages/chunks/links/timelines update -> future sessions search/cite/reuse context -> new decisions are written back.

This loop makes the system compound.

Loop 3: Proof Loop

System changes -> evals run -> benchmark reports update -> public claims become sharper -> contributors know where to improve.

This loop makes the system credible.

Loop 4: Distribution Loop

Public proof -> OSS adoption -> contributors and host adapters -> more use cases -> stronger story -> YC hiring/founder funnel.

This loop makes the system a movement, not just a repo.

Product Requirements By Part

PartMust DoFailure Mode If Missing
NarrativeMake the user believe the new workstyle is realTool looks like another prompt pack
InstallCreate a fast first winUser churns before experiencing leverage
WorkflowGive agents roles, gates, and stop conditionsParallel work becomes chaos
BrowserProduce real UI evidenceAgent claims remain unverified
SafetyConstrain destructive or speculative behaviorFaster work creates faster damage
MemoryPersist source-scoped, cited contextEvery session starts cold
SkillsEncode behavior around memory and workDatabase stays passive
EvalsMeasure retrieval and workflow qualityClaims become marketing
FunnelConvert attention into users/contributors/candidatesOSS interest dissipates

What Is Actually Being Sold

The public artifact is free and MIT licensed, so the sale is not a subscription. What is being sold is belief in an operating model:

That is why the YC CTA belongs inside the repo. The repo is proof of the workstyle YC wants builders to adopt or help build.

Open Product Questions

  1. What is the true first magical moment: /office-hours, /qa, /ship, or /setup-gbrain?
  2. Does the full product need a unified dashboard, or does the command-line/README surface remain enough?
  3. Where does gstack end and gbrain begin from a user's mental model?
  4. Which memory writes should happen automatically versus require explicit approval?
  5. Can gbrain-evals become a general benchmark standard, or is it mainly proof for gbrain?
  6. How much of the YC funnel should be explicit before it starts to feel like recruiting instead of OSS?
  7. What is the smallest productized version of this system that a normal technical founder can adopt in under 30 minutes?

Ren/OpenClaw Implications

The lesson for Ren is not to copy the exact skill list. The lesson is the product architecture:

For OpenClaw specifically, the opportunity is to become the orchestration layer where these parts are easier to run together: agent spawning, session continuity, channel delivery, skill routing, browser access, memory writes, and eval evidence.