📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google emphasizes that the core of modern software development isn’t the AI model, but the surrounding harness and context engineering. The model accounts for only about 10% of the system’s behavior, shifting focus to configuration, verification, and judgment.

A new Google whitepaper titled The New SDLC With Vibe Coding states that the AI model constitutes only about 10% of the overall system behavior, shifting the focus to the harness and context engineering. This challenges the common perception that advances in models alone will drive software innovation, emphasizing instead the importance of configuration, verification, and judgment in AI development.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, reports that 85% of professional developers now use AI coding agents regularly, with 51% doing so daily. It notes that roughly 41% of all new code is AI-generated, but the key insight is that the model’s role is minor.

The authors argue that the behavior of AI agents depends predominantly on the harness — the prompts, rules, tools, and context around the model — which accounts for about 90% of the system’s effectiveness. Evidence from benchmarks shows that changing only the harness can significantly improve performance, even with the same model.

They emphasize that cost and reliability are driven more by how the AI is configured and integrated than by the choice of model, advocating for a disciplined approach called agentic engineering that emphasizes verification, structured context, and continuous oversight.

At a glance

reportWhen: published early 2026

The developmentGoogle’s new whitepaper highlights that the most critical part of AI-assisted SDLC is the harness and context engineering, not the AI model itself.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This shift in understanding redefines how organizations should invest in AI. Instead of chasing the latest model, companies should focus on building robust harnesses and context management. This approach can lead to more cost-effective, reliable, and secure AI systems, as the whitepaper highlights that most failures are configuration issues rather than model limitations. It suggests that the future of AI in software engineering lies in structured engineering practices rather than solely in model innovation.

Amazon

AI code verification tools

As an affiliate, we earn on qualifying purchases.

Background on Evolving AI Development Practices

Over the past year, the AI industry has heavily promoted new models as the primary drivers of innovation. However, the whitepaper from Google signals a paradigm shift, emphasizing that the real work is in how models are integrated and controlled. The concept of vibe coding — quick prompts with minimal oversight — is contrasted with agentic engineering, which involves formal specs, testing, and verification. This reflects a broader trend toward disciplined AI practices that prioritize reliability and cost-efficiency.

Prior to this, the industry largely viewed model improvements as the main lever for progress, but recent experiments show that tweaking harnesses and context can yield greater performance gains at lower costs.

“The biggest shift isn’t the model itself but how we engineer around it, focusing on verification and context.”
— Addy Osmani

Amazon

AI development environment with context engineering

As an affiliate, we earn on qualifying purchases.

Uncertainties About Practical Implementation

It remains unclear how quickly organizations will adopt this disciplined approach at scale, and what specific tools or frameworks will emerge to support it. Further research is needed to quantify the cost savings and reliability improvements across different industries and use cases.

Amazon

automated testing software for AI systems

As an affiliate, we earn on qualifying purchases.

Next Steps for AI-Driven SDLC Adoption

Organizations are likely to begin investing more in developing structured harnesses, verification protocols, and context management tools. Industry standards and best practices may evolve to emphasize configuration and testing over model selection. Future research and case studies will clarify how this approach impacts productivity, cost, and security in real-world deployments.

Amazon

AI model harness configuration tools

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

The whitepaper indicates that the model’s behavior is heavily influenced by how it is configured, with the harness, prompts, tools, and context playing a larger role in determining output.

What is agentic engineering?

Agentic engineering refers to a disciplined approach that involves formal specs, verification, structured context, and oversight, moving beyond quick prompts to reliable AI systems.

How does this shift affect AI development costs?

Focusing on harness and context engineering can reduce costs by decreasing token burn, improving reliability, and lowering maintenance and security expenses, compared to vibe coding approaches.

Will this change how AI tools are built and sold?

Yes, it suggests that tools and frameworks supporting harness creation, context management, and verification will become more central, shifting value away from model innovation alone.

What are the main challenges in adopting this approach?

The main challenges include developing standardized practices, training teams in structured engineering, and integrating these methods into existing workflows.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Évian and the Fallout: What Europe Actually Wants From Amodei, Hassabis, and Altman

Author

CheckingMarket Team

Share article

The model is only 10%