Fundamental Research Labs Lands $33M for AI Spreadsheet Agent Shortcut That Beats Human Analysts

Fundamental Research Labs, an MIT spin-out that originally built AI agents to play Minecraft, has raised $33 million in Series A funding to scale its autonomous spreadsheet agent, Shortcut. The tool promises to turn a single natural-language prompt into a fully audited financial model—complete with DCF valuations, sensitivity analyses, and Monte Carlo simulations—in minutes. Demos that went viral this summer claim Shortcut already outperforms first-year analysts at top firms, forcing Windows users to confront a stark question: if an AI can build a spreadsheet end-to-end, why open Excel at all?

What Shortcut Does

Shortcut presents an Excel-like grid interface but replaces manual cell-by-cell construction with multi-agent AI orchestration. A user uploads source documents—a 10-K filing, a PDF with assumptions—and types a prompt such as: “Build a five-year DCF model, use their latest 10-K, then run sensitivity on revenue growth and WACC.” Shortcut’s agents then parse the documents, extract hard numbers, infer accounting relationships, populate the workbook with formulas, and output a finished model along with audit trails that link every input back to its source.

The system uses what Fundamental Research calls PIANO-inspired architecture, assigning distinct agents to subtasks like data ingestion, formula construction, and simulation runs. It currently connects to Anthropic’s Claude language model, though the company has stated it is working on its own frontier models.

The $33 Million Bet on Agentic Productivity

The Series A, led by Prosus with participation from Stripe CEO Patrick Collison, brings the startup’s total funding to over $40 million. A previous $9 million seed round drew backing from a16z Speedrun and Eric Schmidt. The cash infusion will expand Shortcut’s capabilities and accelerate development of the company’s broader platform, which also includes Fairies, a consumer assistant that schedules tasks across apps.

Sandeep Bakshi, an investment partner at Prosus, told TechCrunch that the team’s “ability to attract some of the brightest minds in the world, and turn that talent into real-world products, makes this a uniquely compelling venture opportunity.” CEO Dr. Robert Yang, a former MIT faculty member, sees productivity apps as a stepping stone toward building physical robots. “Eventually, we want to solve physical problems and move towards working on embodiment,” he said.

Performance Claims Under Scrutiny

Shortcut’s most eye-catching numbers come from company demos and social posts. The startup claims the agent scores over 80% on Microsoft Excel World Championship cases and completes them roughly 10× faster than human competitors. In a viral post, it asserted a 89.1% head-to-head win against first-year analysts from McKinsey and Goldman Sachs when judged blindly by managers, and the humans were given ten times more time.

These metrics are company-reported and rely on internal tests. No independent third-party benchmark or adjudicated contest has yet verified them, so the figures should be treated as provisional. Similarly, the use of Anthropic’s Claude is reported by news outlets but lacks a formal technical whitepaper confirming model versioning or on-premise alternatives.

Why Agentic Spreadsheets Are So Hard to Get Right

Turning a high-level prompt into a trustworthy financial model isn’t just about generating formulas. It demands a chain of technical feats:

Document understanding: Parsing inconsistent PDF tables, footnotes, and OCR noise from filings is a brittle, error-prone process.
Task decomposition: The system must break the prompt into subtasks, schedule agents, and reconcile conflicting outputs—requiring retry logic and fallback mechanisms.
State management: Spreadsheets are deeply stateful. Changing a file path or UI layout can break brittle automations unless the agent can adapt mid-task.
Auditability: Every hard-coded cell must trace back to its source. Shortcut’s UI shows source links and change logs, a necessary but not yet battle-tested feature for regulated industries.
Error detection: Generative models can hallucinate plausible-but-wrong formulas. Defensive verification—reconciliation checks, assertion tests—must be baked in.

Risks for Enterprise Adoption

For CIOs and finance leaders, the productivity promise is tempered by real risks:

Accuracy and hallucination: Early user reports indicate cells sometimes require full audit before use. Even small formula mislinks could cascade into material errors.
Regulatory exposure: Models that feed regulatory filings demand deterministic, versioned audit trails that agentic systems aren’t proven to provide.
Data residency: Shortcut touts on-device processing for sensitive documents, but customers must verify whether attachments ever reach the cloud and how encryption keys are managed.
Vendor lock-in: Turning core workflows over to a third-party agent creates dependency. Exiting must be painless, with robust Excel export and intact formulas.
Skills erosion: Over-reliance on agents could dull the modeling muscles needed to catch subtle mistakes.
Model drift: As hosted models update, results may change unpredictably—a nonstarter in finance where reproducibility is paramount.

A Pragmatic Adoption Blueprint for Windows Shops

Organizations can capture Shortcut’s speed gains while managing risk with a phased approach:

Pilot on non-critical models. Start with internal forecasts or budget templates where errors carry low external impact.
Build audit checks. Run parity tests against human-built models; script reconciliation steps that flag inconsistencies.
Keep humans in the loop. Require documented review and sign-off before any output feeds external reporting.
Verify data flows. Confirm whether files leave your tenant, how long data persists, and how deletion requests work.
Demand export guarantees. Workbooks must export to native Excel with trace metadata intact.
Cross-train staff. Analysts should be able to reproduce models independently, ensuring no single point of failure.

The Threat—and Opportunity—for Microsoft

Shortcut doesn’t just plug into Excel; it aims to be the destination where spreadsheets are born. If agentic interfaces gain traction, Microsoft faces a direct challenge: users might spend less time in Excel itself, opening it only to review AI-generated outputs. Yet Microsoft is far from defenseless. Copilot for Excel already assists with formulas, charts, and Python integration, and the company’s deep engineering resources could embed similar multi-agent orchestration natively. The upcoming battle won’t be about who can build a prettier grid, but who delivers the most trustworthy, auditable, and governable AI-first spreadsheet.

For Windows users, the immediate lesson isn’t that Excel will vanish. It’s that spreadsheet work is bifurcating: the UI remains familiar, but the heavy lifting increasingly shifts to AI agents. In that new world, the spreadsheet becomes a canvas for verification and oversight rather than primary construction.

What Comes Next

Shortcut’s public demos are a compelling proof of concept, but the gap between a viral demo and enterprise-grade reliability is measured in quarters or years. The product’s pricing—starting with a free trial and scaling to $40/month for Pro—positions it as a premium SaaS tool for finance teams and consultancies. Broader enterprise adoption will require SOC 2 compliance, SSO, data-loss prevention integrations, and hardened audit controls.

Watching eyes should treat the most dramatic claims as marketing until neutral benchmarks arrive. The real test will be whether Shortcut can deliver consistent accuracy across messy, real-world data sets—not just curated demos. If it can, the way financial professionals interact with spreadsheets will change. If it can’t, it’ll be a powerful assistant, not yet an analyst replacement.

For now, Windows shops can experiment carefully. Run a pilot, keep your governance hat on, and remember that even the smartest agent still needs a skeptical human reviewer.