Launchpad Agent

An AI teammate that actually learns your business.

Most AI agents are a clever demo that dies on day 30. Launchpad Agent is built differently — every agent ships with a written job description, a paired reliability gate, and an orchestrator that supervises specialist sub-agents. It learns from the work it does and the corrections you make, so it stops being something you have to re-explain and starts being someone you can delegate to.

Book a discovery call See the three tiers
From £1,495 build
From £299 / month
Reliability gate at day 90
UK-hosted UK GDPR-aligned
Time recovered

10–20 hrs

Per week, per agent. Drawn from the tasks you currently spend most time on — inbox triage, quote prep, scheduling, reporting, lead enrichment.

Survives past

Day 30

90% of AI workflows die in 30 days because they were built without a job description or a reliability gate. Launchpad Agent ships with both.

Learning loop

Every run

The agent reads its memory at session start, watches itself work, and updates that memory when you correct it twice. You don't re-teach it.

What it actually is

A small fleet of AI agents that share one brain and one set of rules.

Underneath, Launchpad Agent is a Hermes-based deployment running on a dedicated UK VPS or in your own Microsoft 365 / Google Workspace environment. On top, you get three things: a written job description for each agent (what it watches, reads, produces, won't do, and how it knows it worked), an orchestrator that delegates narrowly scoped read-only tasks to specialist sub-agents, and a shared memory layer the agent updates as it learns your business.

This is the same architecture pattern Shopify uses internally for "River" — a fleet of public agents that authored one in eight pull requests across the company in 2026. We've productised it for SMEs. You don't run an engineering team to use it; we run it for you under a managed-service retainer.

Who it's for

Three businesses where this pays back fastest.

We don't sell Launchpad Agent into businesses where it can't earn its keep inside 90 days. These three shapes are where it consistently does.

10–50 staff

Ops-heavy SME

You're losing 15+ hours a week across the business to repetitive admin — inbox triage, quote prep, follow-ups, scheduling — and you're past the point where adding another person fixes it.

  • Inbox triage and auto-draft replies
  • Quote prep from a shared template library
  • Follow-up sequence on stalled leads
  • Weekly performance digest into Slack or email
Likely fit: Single Agent or Agent Team tier.
Accountants, solicitors, consultants

Professional services firm

Your fee-earners are doing too much non-billable work — file prep, document review, client onboarding, internal research. The agent handles the prep so the partner stays on the case.

  • Document review and clause extraction
  • Client onboarding pack assembly
  • Internal research with cited sources
  • Engagement letter and proposal drafting
Likely fit: Agent Team tier with a context-engineering retainer.
Trades, hospitality, fleet

Multi-site operator

You run multiple sites or vehicles and you can't be everywhere. The agent watches the operational data — bookings, jobs, MOT due, stock — and only escalates what actually needs a human decision.

  • Job booking and dispatch optimisation
  • MOT, service and certificate reminders
  • Customer reactivation on the back of completed jobs
  • Cross-site reporting in plain language
Likely fit: Agent Fleet tier, often with Spatial Capture add-on.
How we build it

Every agent ships with a five-part job description.

The most common failure mode for SME AI is "we vibed it." An agent without a written job description quietly drifts off-task by week three, produces output that's technically a response but substantively useless, and dies by day 30. Ours don't, because every one of them is shipped with this:

What it watches

The exact trigger or schedule. "Every new email tagged @sales" — not "incoming work."

What it reads

The exact sources. The shared drive folder, the CRM view, the inbox label. Named, not implied.

What it produces

The exact output format. A draft reply in a specific style, a structured Notion entry, a JSON line.

What it won't do

The guardrails. Don't auto-send. Don't quote without sign-off. Don't touch the production database.

How we know it worked

The success condition. A canary field on every output. A weekly four-minute spot check we send you.

Pricing

Three tiers. Same architecture. Different scale.

One agent for a single workflow, a small team with an orchestrator, or a fleet covering most of the back-office. All prices exclude VAT. Build is one-off; monthly covers hosting, model usage, reliability checks and continued tuning.

Single Agent

One agent. One job description.

£1,495build

£299/month hosted, monitored, tuned.

  • One specialist agent, one written job description
  • Hosted on a UK VPS or in your own Workspace tenant
  • Memory layer, reliability gate, weekly spot-check report
  • One workflow — inbox triage, quote prep, follow-ups, scheduling, lead enrichment
  • Up to 2,000 agent runs per month
Book a discovery call

Agent Fleet

Five or more agents with full on-call.

£9,995build

£1,250–2,500/month hosted, monitored, on-call.

  • Five or more specialist sub-agents with orchestration
  • Dedicated Slack channel for handover and exception review
  • Eval framework + golden dataset maintained for each agent
  • Quarterly architecture review and skill expansion
  • Up to 30,000 agent runs per month
Book a discovery call

Model usage at typical SME loads is included in the monthly figure. Pricing is reviewed every six months as model costs change. Agent run counts are guidelines, not hard caps — we tell you before any meaningful overage rather than passing it through silently.

How it works

From discovery call to a working agent — five steps.

Most single-agent builds go from discovery to live in two to three weeks. Team and Fleet tiers add a tuning sprint and run six to eight weeks.

Discovery

45-minute call. Walk us through a normal week. We pull out the candidate workflows and rank them by hours-recovered.

Job description

One-page written spec per agent — watches, reads, produces, won't do, success condition. You sign it off before we build.

Build & prototype

Two weeks. We grow the agent on your real data inside a sandbox, not a generic demo. You see drafts before any go-live.

Pilot run

Two weeks live, in suggest-only mode. Every output gets a canary field. We meet weekly to read one run end-to-end.

Hand-over

Move to autonomous mode where appropriate. Reliability gate review at day 90. Quarterly tune-ups thereafter.

90-day reliability gate

Why most AI workflows die in 30 days — and how we stop yours.

The failure pattern is always the same. Day 1 works. Day 9 something changes silently. Day 14 the output is technically a response but substantively useless. Day 23 a customer notices. Day 30 you kill it and blame AI. Every Launchpad Agent ships with three defences against this.

Three defences, built in from day one.

Not a bolt-on, not a separate audit — every agent we ship has these. If yours doesn't, it's not done.

At day 90 we sit down with you and go through the reliability gate together. If the agent isn't earning its monthly fee in time saved, we tell you. We don't string clients along on dead workflows.

  • Canary output — every run includes a verifiable field (timestamp of the most recent source, count of items processed, hash of the input). If the canary stops moving, we know.
  • Silent-failure alert — if the agent finds nothing to do, it sends an alert, not an empty output. Empty outputs are the most dangerous failure mode in AI.
  • Weekly four-minute spot check — you read one full output end-to-end every week. We send the URL. It catches drift the canary can't.
UK-first by design

Built on UK rules — not retrofitted.

Your agent reads real business data — emails, documents, customer records. That carries the same compliance load any internal system does, and we treat it that way. 72+ years of combined experience across NHS, Police and MOD, including SC-cleared personnel for regulated-sector work.

  • UK GDPR + DPA 2018 alignment
  • UK-hosted VPS or your own tenant
  • Per-agent permission scopes
  • Read-only sub-agents by default
  • Versioned memory with full audit trail
  • Per-engagement DPA available on request
  • Eval framework + golden dataset retained
  • SC-cleared lead on regulated-sector work
Frequently asked

The questions we get on the first call.

If yours isn't here, the contact form gets to a human inside one working day.

How is this different from ChatGPT or Claude with a few custom instructions?

A chat session forgets you the moment you close the tab. Launchpad Agent runs on a schedule, reads named sources, writes to named outputs, has a memory layer that survives sessions, and reports back on a canary field every run. You don't talk to it — you delegate to it.

The other difference is the architecture. A single Claude tab is a generalist. Launchpad Agent is a small team — an orchestrator that decides what to do, plus specialist sub-agents that each do one thing well. That separation is what stops the "drifted off-task" failure mode.

What does the agent actually run on?

By default, a dedicated UK VPS we provision and manage. The agent is a Hermes-based deployment with our own opinion-set layered on top (the five-part job description, the orchestrator pattern, the reliability gate). The model is whichever current-generation frontier model best fits the workflow — usually Claude Opus or Sonnet for language-heavy work, GPT-class for structured planning. We treat models as swappable; you don't get locked in.

If you prefer, we'll deploy inside your own Microsoft 365, Google Workspace, or Azure tenant. The architecture is the same; the hosting boundary moves.

What can it not do?

It can't replace work that requires physical presence, regulated professional judgement, or commercial negotiation. It can prep the email; it shouldn't sign the contract. It can draft the quote; it shouldn't issue it without sign-off on anything over your threshold.

We write the "won't do" line into the job description before the build starts. If a workflow doesn't have a clean "won't do" boundary, that's a sign the workflow isn't ready for an agent — and we'll tell you that before quoting.

What happens at the 90-day reliability gate?

A one-hour review. We go through every agent against its job description and its success condition. Hours recovered, canary stability, exceptions raised, corrections you made, weekly spot-check results. If the agent isn't earning its monthly fee in time saved, we tell you — and either rewrite the job description, retire the workflow, or refund the last month.

We don't keep clients on retainers that aren't paying back. It's how the model survives.

Will it learn our voice and our terminology?

Yes. Every agent runs against a memory layer that includes your terminology, your preferred phrasing, your past corrections, and your decisions log. The first two weeks of any build is heavy on this — we sit alongside the people doing the work and capture the tacit knowledge that lives in their heads.

If your business has a substantial body of internal knowledge (documents, transcripts, decision logs) that needs to be made queryable, that's a KnowledgeForge conversation — the context-engineering retainer that sits underneath the agent layer.

What's the difference between this and a Zapier or Make automation?

Zapier and Make are rule engines — "if X happens, do Y." They're brilliant for deterministic workflows where the answer is always the same shape. Launchpad Agent is for judgement work — "is this a real enquiry or a duplicate? Which draft response best matches our voice? Should this quote use the standard template or the premium one?"

In practice the two work together. The agent makes the judgement call; Zapier or Make moves the result to the right system. We'll often deploy both.

Can I start small and scale up?

That's the recommended path. Start with one agent on the workflow that's costing you most time. Run it for 90 days through the reliability gate. If it earned its keep, add a second — and you're into Team tier without redoing the foundation. The architecture is the same at every tier; the agent count, runs and on-call cover are what change.

What if I want to leave?

You own the job descriptions, the memory layer, the eval dataset, and the agent definitions. We hand them over in a portable format (Markdown for the JDs and memory, JSON for the evals, plain Python or TypeScript for the agent definitions). The hosting moves with you, or we run it out for 90 days while you find an alternative. No lock-in, no exit fee.

Ready to stop re-explaining yourself to an AI?

Book a 45-minute discovery call. We'll walk through your week, pick the candidate workflows, and tell you which tier fits — same call, no follow-up sales script.

Book a discovery call