2026-06-18By Léandre Desmaretzai-operating-systems

Codex as a GTM cockpit

How I used Codex as a GTM cockpit to turn content, CRM, website, enrichment, Ads diagnostics, and reporting work into inspectable, reviewable operating infrastructure.

Themes: AI Workflows / Operational Tools / Product Systems

Tags: AI / Codex / Gtm / Revops

I did not use Codex to automate GTM.

I used it to make GTM inspectable.

Inside a B2B game research platform, the visible tasks looked familiar enough: articles, landing pages, CRM records, enrichment workflows, Ads diagnostics, internal reports, contact paths, review queues.

The real problem was invisible drift. A tag means one thing in a spreadsheet, another thing in the database, and a third thing in someone's head. A qualified lead can be a signal, a superstition, or a side effect of a script nobody looked at for three weeks.

Most of that work is not glamorous. It is a long sequence of small, reversible decisions: which lead deserves attention, which email should not be sent, which campaign is leaking intent, which dashboard is lying by omission.

Codex did not make those decisions disappear.

It changed where I could make them from.

The leverage came from moving between layers without changing rooms.

Reading Map

This note moves through five claims:

GTM work breaks down when every layer translates the same signal differently
Codex became useful as an operator cockpit, not as a task robot
the cockpit works because acquisition, backend, workers, and review sit in one field of view
the durable asset is operating memory, not the first campaign or report
the limit is still operator judgment, privacy, and disciplined gates

The Work Was Never One Job

The tempting summary would be to say that Codex let one person replace a small team.

I do not think that claim is precise enough.

A team is not only a bundle of tasks. It is judgment, context, disagreement, taste, memory, and responsibility. You do not replace that by opening an AI tool and asking for outputs.

What changed is narrower and more useful: Codex compressed the handoffs that normally separate the work.

Content manager. RevOps. CRM operator. Marketing ops. Web developer. Data plumber. Ads diagnostician. QA reviewer. In a normal company, those motions sit in different tools, heads, backlogs, and levels of urgency.

The friction is not only execution. It is translation.

A landing-page idea becomes a copy doc, then a ticket, then a branch, then a QA pass, then a tracking question. A CRM rule becomes a Slack thread, then a spreadsheet, then a half-remembered exception.

Codex gave me a way to keep the question close to the machinery.

When a page needed a variant, I could move from positioning to code to rendered output. When a CRM rule looked suspicious, I could inspect the path before turning it into a ticket. When enrichment produced questionable rows, I could turn the issue into a review queue, a dry-run, and an operating note.

The gain was not that the work became automatic.

The gain was that the work became operable.

The Four-Layer Cockpit

I think of the setup in four layers.

The first layer is the public acquisition surface: the website, articles, landing paths, forms, tracking, consent, campaign parameters, and the visible promises people encounter before they know anything about the company.

This layer needs to stay legible. It is where positioning becomes pages, where intent becomes traffic, and where every vague claim eventually has to turn into something observable.

Behind it sits the operational backend: CRM, analytics, content workflows, reporting, consent state, segmentation, campaign state, enrichment results, internal documentation, and the ledgers that remember what happened.

The backend is where a page becomes accountable. A page is not just a page. It has routing, attribution, eligibility, reporting, and consequences downstream.

The third layer is the worker layer: scripts, sync jobs, checks, enrichment steps, reporting jobs, and bounded background processes. Their job is not to do growth magically. Their job is to collect state, normalize it, compare it against rules, run dry-runs, and prepare actions that can be inspected before anything sensitive changes.

Codex is the fourth layer: the operator cockpit.

It does not replace the stack. It gives me a way to sit above it, read across it, and coordinate changes without pretending the stack is simpler than it is.

A normal loop looks like this: read the current state, classify what matters, run a dry-run, prepare reviewable actions, apply only through controlled paths, verify the readback, then update the operating memory so the next loop starts from a better default.

For sensitive work, the valuable artifact is often not an automated action. It is a draft, a queue, a diff, a report, a proposed CRM decision, or a short list of blocked records.

I am not trying to build blind outreach, unchecked CRM writes, or a black-box growth machine. I am trying to keep public acquisition, backend truth, worker behavior, and operator judgment in the same field of view.

From Outputs To Operating Memory

The first campaigns, landing pages, reports, and workflows were not the asset. They mattered, of course, but they were also test shots.

They showed where the data was messy, where the copy was too generic, where tracking was too optimistic, where an enrichment step created more confidence than it deserved, where a workflow needed a review gate.

Value started compounding when those discoveries became tools.

A one-off landing page became a pattern. A manual CRM check became a command. A repeated source-quality question became a gate. A fragile reporting habit became a readback. A vague content workflow became a pipeline with sources, drafts, review, and public validation.

The durable value was not the first result.

It was the ability to turn the result into an operating memory.

Sometimes that memory was a script. Sometimes it was a Codex skill. Sometimes it was a checklist, a dry-run command, a database readback, a GitLab merge request, or a worker that prepared the next batch for review.

If all I had built was a pile of prompts, the work would have stayed fragile. A prompt remembers nothing unless someone knows when to run it, what context to give it, what output to distrust, and where the approved action should go.

The stronger version is different.

Judgment gets externalized into defaults, commands, checks, schemas, docs, and review paths. The system does not contain my brain, but it contains many of the decisions my brain would otherwise have to repeat every week.

This is what makes the work transmissible.

If I leave tomorrow, another capable operator should not inherit a mysterious Slack trail and a folder of half-explained experiments. They should inherit a cockpit: commands, sources of truth, write boundaries, review queues, failure modes.

They still need judgment. They still need context. They still need taste.

But they do not start from fog.

Why This Matters For Developers

One of the most interesting consequences is organizational.

A lot of GTM work touches code, but not all of it deserves to interrupt product engineering.

A developer should not have to be pulled into every landing-page iteration, CRM hygiene pass, tracking audit, content pipeline adjustment, internal report, enrichment review, or small acquisition experiment.

Developers do not disappear.

Their attention gets protected.

The best use of engineering time is still the work that truly requires engineering depth: product architecture, reliability, security, core platform decisions, performance.

The cockpit moves repeated GTM development out of the product team's critical path when it can be made safe, observable, and reviewable.

The economics follow from that.

The tool cost is real, but the important question is not the monthly invoice. It is how much latency and context loss the company accepts when every content, CRM, reporting, and website adjustment becomes a separate handoff.

One visible proof, for me, was cadence: 400+ merge requests across CRM, research, reporting, and acquisition infrastructure. The number is not the story. The operating rhythm is.

What I Refused To Automate

The cockpit works because it refuses certain fantasies.

I do not want an agent preparing external outreach as if a spreadsheet were truth. I do not want a CRM state changing because a model sounded confident. I do not want Ads interpreted as success when the signal is too thin. I do not want public copy that exposes private proof, or internal logic that depends on a summary nobody verified.

Private data stayed private, sensitive actions stayed gated, and anything external-facing had to remain reviewable before activation.

The more powerful the cockpit becomes, the more important the gates become.

Some outputs are allowed to become drafts. Some become review queues. Some become reports. Some become merge requests. Some stay read-only because the risk is higher than the gain.

AI becomes useful precisely when it is not secretly in charge.

The operator still owns strategy, public claims, budget logic, audience judgment, brand taste, CRM sensitivity, and final activation. Codex can compress the path from question to proposed action. It should not erase the difference between a proposal and a decision.

Trust is part of the constraint.

A cockpit like this depends on Codex, on OpenAI, on the surrounding tools, and on the company's willingness to let operational knowledge move through that layer. None of that trust should be blind.

The Real Asset

This kind of cockpit is not for everyone.

The hard part is not opening Codex, writing a few prompts, or connecting a CRM to an enrichment tool. The hard part is wanting to build the operating layer in the first place.

You need to enjoy turning repeated work into procedures, vague decisions into review gates, messy signals into something inspectable tomorrow.

There is a personal reason this works for me.

I have spent a large part of my life playing management and automation games. I like loops, bottlenecks, queues, dashboards, constraints, upgrades, routing problems, and systems that become more elegant when each part finally feeds the next one.

Running a GTM cockpit through Codex can feel strangely close to living inside one of those games, except the resources are landing pages, CRM records, market signals, articles, leads, review queues, and engineering time.

The objective is not to make everything automatic. It is to build a machine where the next good action becomes easier to see.

Some people would hate this. They would experience it as tool sprawl, operational noise, too many moving parts. I experience it as a readable board.

That is the limit, and also the point.

A weak operator can use the same tools to automate noise faster. A strong operator uses them to make judgment travel further. They know which task can be delegated, which one needs a dry-run, which signal is too thin, and which technical problem still belongs to a developer.

Codex did not turn GTM into a push-button pipeline.

It let me operate closer to a small, careful team: architectural enough to see the whole system, practical enough to move through it, and disciplined enough not to let the machine pretend it knows more than it does.

A company should not replace its content manager, RevOps person, CRM operator, web developer, or growth lead with an AI subscription.

The better claim is that a company can build an operating layer where many of those motions become inspectable, reviewable, and reusable. A strong operator can then move across them with far less latency.

The asset is not Codex plus prompts.

The asset is operational taste made executable: a cockpit where judgment survives the task, becomes inspectable, and makes the next good action easier to see.

This kind of cockpit is not for everyone.

The hard part is not opening Codex, writing a few prompts, or connecting a CRM to an enrichment tool. The hard part is wanting to build the operating layer in the first place.

You need to enjoy turning repeated work into procedures, vague decisions into review gates, messy signals into something inspectable tomorrow.

There is a personal reason this works for me.

The objective is not to make everything automatic. It is to build a machine where the next good action becomes easier to see.

Some people would hate this. They would experience it as tool sprawl, operational noise, too many moving parts. I experience it as a readable board.

That is the limit, and also the point.

Codex did not turn GTM into a push-button pipeline.

A company should not replace its content manager, RevOps person, CRM operator, web developer, or growth lead with an AI subscription.

The asset is not Codex plus prompts.

The asset is operational taste made executable: a cockpit where judgment survives the task, becomes inspectable, and makes the next good action easier to see.

Related notes

More along the same thread.

Posts are connected by real editorial proximity: a shared series, overlapping tags, or both.

2026-05-06

cutepr

Building cutePR from a real PR constraint

cutePR started when a real PR workflow hit the gap between messy spreadsheets and expensive enterprise suites. The project became a prototype for a calmer PR operating system.

cutepr / product / pr

2026-04-02

me-chatgpt-and-the-others

Me, ChatGPT, and the others

A personal essay about what happened when AI moved from science fiction into daily life, and why it became the ultimate tool for a generalist who learns by doing.

ai / chatgpt / codex

2026-05-05

clouds-war

Building a game with hidden depth

Clouds-War is being built as a readable strategy game first: the map, the team, and the moment matter immediately, while the deeper world appears through seasons, doctrines, and traces.

clouds-war / game-design / lore

Related projects

The work behind the note.

Projects stay publicly discoverable through the notes that make the work legible.

2026

active

Léandre Site Manifest

A public proof of how I build: direction, editorial structure, and technical execution held together in one readable system.

Built as a site-manifest rather than a portfolio, it centralizes ongoing work in a docs-first static system shaped by MDX, targeted Pretext surfaces, and fast iteration with Codex.

Next.js / TypeScript / MDX

2026

active

cutePR

A PR operating system for contacts, campaigns, product seeding, coverage, EMV, reports, and review-first AI assistance.

Built from a real PR manager workflow, cutePR centralizes the operational memory that usually lives across Excel, Outlook, coverage trackers, and enterprise tools too expensive for small teams.

Bun / Next.js / TypeScript

Repository private for now