Jagged Intelligence: The Hidden Risk of Windows Copilot and AI Overtrust

Melanie Mitchell warns that treating LLMs like Microsoft Copilot as human minds leads to dangerous overtrust. Her concept of 'jagged intelligence' explains why these models excel at some tasks but fail unexpectedly at others, posing governance risks for Windows IT. The article outlines practical strategies to mitigate these risks through verification, training, and policy.

A Windows IT administrator asks Copilot to generate a PowerShell script for bulk user provisioning. The output looks flawless—syntax clean, commands logical. But buried inside is a parameter that silently disables multi-factor authentication for new accounts. The admin, trusting the AI’s apparent expertise, deploys it across the organization. Within hours, a breach occurs.

This scenario is not hypothetical. It is the inevitable result of treating large language models (LLMs) like Microsoft Copilot as human-like minds rather than what they truly are: powerful, brittle pattern-matchers with no understanding of the world. Cognitive scientist Melanie Mitchell has spent years warning against this anthropomorphic trap, coining the term “jagged intelligence” to describe how AI systems can appear brilliant one moment and absurdly incompetent the next. For Windows users and IT professionals, understanding this jaggedness—and the governance it demands—is no longer optional. It is a frontline defense against cascading failures.

The Illusion of Understanding

Melanie Mitchell, a professor at the Santa Fe Institute and author of Artificial Intelligence: A Guide for Thinking Humans, argues that the central mistake in current AI discourse is assuming fluency equals intelligence. LLMs, she explains, are “trained on vast corpora of human-generated text, allowing them to mimic patterns of language extraordinarily well. But they do not possess the mental models, intentions, or common-sense reasoning that humans apply to every situation.”

This distinction is critical. When you ask Copilot to summarize a document, it doesn’t “read” it; it predicts tokens based on statistical regularities. It can produce coherent legal arguments, yet fail to answer whether a car can fit into a parking space. The intelligence is jagged—highly capable along certain dimensions, virtually absent along others. And because the peaks are so impressive, we overlook the valleys.

Microsoft itself has been careful in its official language. Copilot is described as “a powerful productivity tool that uses large language models” and is “designed to assist, not replace, human decision-making.” But in daily practice, the line blurs. When an AI outputs polished code, executives summaries, or security recommendations, the temptation to trust it entirely is immense. That trust, unchecked, becomes the threat.

The Jagged Frontier in Windows Environments

The concept of a “jagged frontier” of AI capability was empirically demonstrated in a 2023 study by researchers at Harvard Business School and Boston Consulting Group. They found that LLMs like GPT-4 excel at tasks well-represented in their training data—creative writing, certain coding tasks—but stumble on tasks requiring novel reasoning or domain-specific knowledge. For Windows IT, the frontier is dangerously unpredictable.

Consider common administrative tasks:
- Group Policy Configuration: Copilot can draft a GPO based on a natural-language prompt. It might include correct settings for password complexity but overlook dependencies that break network access for non-domain devices.
- Troubleshooting Scripts: A generated diagnostic script may correctly identify a known error code but apply a fix valid only for Windows Server 2022, crashing a 2019 instance.
- Security Analysis: Summary of a threat report might be articulate yet omit a key indicator of compromise because the statistical model did not weight it heavily enough.

These are not random bugs. They stem from the fundamental architecture of LLMs: autoregressive token prediction with no grounding in physical or logical reality. The model has never managed a real Active Directory, never felt the consequences of a misconfigured firewall, never experienced the nuance of a hybrid Azure environment. It simply composes the statistically most probable continuation of your prompt.

The Anthropomorphic Trap

Humans are hardwired to attribute agency. When Copilot responds in fluent paragraphs, we perceive a mind behind the words. Mitchell calls this the “Eliza effect,” after the 1960s chatbot that convinced many users it understood their emotions. Modern LLMs amplify this a hundredfold. They can pass the bar exam, yet fail to consistently generate unique random numbers. They can translate languages but cannot understand that a five-pound bag of feathers is just as heavy as a five-pound bag of steel.

This cognitive bias leads to what Mitchell terms “mode collapse of trust”: users begin to treat AI-generated outputs as authoritative, skipping verification. In a Windows enterprise, that means an admin might apply a Copilot-suggested registry edit without checking if it applies to the current OS build. It means a developer could integrate an AI-generated code snippet that introduces a subtle race condition, assuming the “smart” system wouldn’t produce errors.

The risk is compounded by how Microsoft markets Copilot. Its seamless integration into Windows 11, Office 365, and Azure creates an aura of infallibility. When an AI assistant can write an email, summarize a meeting, and generate code from comments, the natural conclusion is that it’s “thinking.” It’s not. It’s doing next-token prediction, and its jaggedness means that the next token can be catastrophically wrong.

Real-World Consequences

Already, there are documented cases of LLM overtrust causing damage. Lawyers have filed briefs containing fabricated case citations generated by ChatGPT. Software engineers have deployed AI-suggested code with critical vulnerabilities. In one high-profile incident, an AI-generated security policy recommendation omitted essential encryption requirements because the prompt lacked the word “compliance.”

For Windows-specific systems, the potential is even starker. An IT helpdesk chatbot powered by Copilot could confidently give an end user instructions that elevate their privileges incorrectly. A network administrator might use Copilot to analyze log files and receive a plausible but incorrect diagnosis, delaying response to a live intrusion. The jaggedness means that high-stakes areas—security, compliance, infrastructure—are where the valleys are deepest and most dangerous.

Governance: Treating AI as a Tool, Not a Colleague

The solution is not to abandon AI assistants. It is to build governance frameworks that treat them as what they are: unreliable narrators with no skin in the game. Mitchell’s research implies a set of principles for Windows environments:

Always Verify, Never Assume: All Copilot outputs must be treated as drafts, not final artifacts. PowerShell scripts should be run in sandboxes first; GPO suggestions must be diffed against baselines; security recommendations should be cross-checked with Microsoft Defender data.
Anticipate Brittleness: Train users to probe for jagged edges. If Copilot writes a code, ask it to explain each line; if it can’t, the code may be fragile. If it summarizes a document, test it on specific factual questions the summary implied.
Limit Autonomy: Copilot should not be allowed to execute commands or apply configurations without explicit human approval. Even “confirmation mode” can become a rubber stamp if alerts are too frequent; design systems where AI suggestions are staged for review.
Contextual Hardening: In sensitive domains, constrain prompts to reduce the attack surface. For instance, when querying about firewall rules, require Copilot to cite specific Microsoft documentation and compare against existing policies.
Audit and Log Everything: Every interaction with Copilot should be logged, including prompts, responses, and the user’s decision. This creates a feedback loop to identify patterns of overtrust and improve training.

Microsoft is developing tools to address these concerns. Purview Compliance Manager can help audit AI interactions, and Azure Policy can enforce guardrails on AI-generated scripts. But technology alone won’t solve a problem rooted in human psychology. IT leaders must cultivate a culture of “healthy skepticism” toward AI outputs, reinforced by mandatory training and real-world examples of failures.

The Path Forward for Windows AI

Windows is becoming an AI-first platform. Copilot is the centerpiece, but underlying services—Click-To-Do, Recall, AI-driven search—extend the jagged frontier into every corner of the OS. Build 22635.4225 (KB5043186), released to the Beta Channel, deepened this integration by allowing Copilot to interact directly with local files and settings. These advances promise productivity but also multiply the points of failure.

Melanie Mitchell’s work is a call to humility. “We are easily fooled by superficial fluency,” she wrote in a 2023 paper. “The danger is not that AI will become too smart, but that we will think it is smarter than it is, and hand over responsibilities it cannot handle.”

The Windows IT community sits at the front line of this challenge. Every day, administrators make decisions that affect thousands of users. A single misplaced trust in an AI-generated command can cascade. The remedy is not to reject AI—it is to understand its jagged nature and build systems that respect it.

By acknowledging that LLMs are tools with profound limitations, we can harness their strengths without succumbing to their weaknesses. The goal is not to make AI more humanlike, but to make our use of it more rational. In the end, the security of Windows environments depends not on the intelligence of machines, but on the wisdom of the humans who wield them.

Windows Versions

Microsoft Services

Jagged Intelligence: The Hidden Risk of Windows Copilot and AI Overtrust