AI ‘Botsitting’ Costs: 11 Hours Saved vs 6.4 Hours of Human Supervision

AI tools are saving digital workers about 11 hours a week while also forcing them to spend roughly 6.4 hours supervising, correcting, rerunning, and contextualizing AI output, according to a June 2026 Work AI Institute report based on 6,000 workers in the United States, United Kingdom, and Australia. That is the awkward bargain now sitting underneath the corporate AI boom: the work is faster, but it is not necessarily lighter. The office has not eliminated drudgery so much as promoted it into a new managerial layer, where employees spend their days coaxing, auditing, and laundering machine output into something a colleague can trust. The result is a productivity story that looks impressive at the individual level and far less convincing once it reaches the balance sheet.

AI drafting and human review dashboard screens with security/compliance icons and productivity timers in a dark office.The Automation Dividend Is Real, but It Arrives With a Handler​

The first mistake in reading the new data is to dismiss the productivity gains as hype. Three-quarters of surveyed workers said AI makes them more productive, and the reported 11 hours saved per week is not a rounding error. Anyone who has used modern generative tools to summarize a messy document, draft a first-pass email, translate a spreadsheet into prose, or produce scaffolding code knows the feeling: a task that used to occupy an afternoon can sometimes collapse into minutes.
But the second mistake is the one executives are more likely to make. Saving time on a task is not the same as saving time in a workflow. The report’s central finding is not that AI fails; it is that AI often succeeds only after humans perform a new category of labor around it.
That labor now has a name: botsitting. It includes feeding the model enough context to understand the job, checking whether its answer is plausible, fixing mistakes, rewriting awkward output, rerunning prompts, and deciding when the machine is bluffing. The Work AI Institute found that workers spend 37 percent of their AI time on this invisible support work and 36 percent actually using AI to produce work.
That symmetry should make every CIO pause. If a tool requires nearly as much labor to supervise as it removes from the original task, the organization has not automated a process. It has redistributed the process into a human-machine loop whose costs are easy to miss because they are scattered across calendars, Slack threads, code reviews, and late-night cleanups.

The Office Did Not Get Rid of Management; It Pushed Management Downward​

Paul Leonardi of UC Santa Barbara, one of the report’s co-authors, frames the problem bluntly: companies are expecting individual contributors to act as managers of AI systems. That is not just a clever line. It describes a structural shift in white-collar work.
For decades, management meant coordinating people, resolving ambiguity, assigning priorities, and checking quality. Generative AI inserts a tool that can draft, synthesize, and produce, but it does not remove ambiguity. It often increases it, because the machine’s output can look finished before it is sound.
The junior analyst, engineer, marketer, paralegal, or operations worker is therefore asked to become a miniature production manager. They must know what to delegate, how to brief the machine, when to challenge it, how to validate its answer, and how to package the result for others. That is a sophisticated skill set, and many companies are treating it as if it were an instinct.
This is why the botsitting figure matters more than the headline time savings. A worker may save 11 hours on old chores, but if six or more hours are consumed by AI supervision, the net gain becomes smaller, more uneven, and more cognitively taxing. The old task may have been tedious, but at least it was legible. The new task is partly clerical, partly editorial, partly technical, and partly risk management.
That is not a bad trade in every case. In skilled hands, AI supervision can produce better work faster. But it is a different job than the one many employees were hired to do, and it deserves to be measured, trained, and managed as such.

The Spreadsheet of the Future Has a Human Hidden in Every Cell​

The most revealing number in the report may be the gap between personal productivity and organizational performance. While 75 percent of individuals reported a productivity boost, only 13 percent of organizations said they had seen significant business gains from AI adoption. That discrepancy is the whole enterprise AI debate in miniature.
Individuals experience AI at the task level. A worker sees a meeting summary appear instantly, a draft proposal take shape, or a chunk of code emerge from a prompt. The benefit is direct and emotionally persuasive.
Organizations experience AI at the system level. They care whether deals close faster, software ships with fewer defects, support queues shrink, compliance improves, and costs fall without quality collapsing. Those outcomes require more than individual enthusiasm. They require process redesign.
The business world has seen this movie before. The PC, email, cloud software, smartphones, and collaboration apps all promised productivity leaps. Each delivered real gains, but each also created new obligations: more messages, more meetings, more dashboards, more security reviews, more notifications, more “quick” requests because the tools made quick requests cheap.
Generative AI is following the same pattern, only faster. It reduces the cost of producing drafts, summaries, code, analysis, images, and plans. But when the cost of producing something falls, the volume of production rises. The bottleneck moves from creation to evaluation.
That is where the hidden human appears. Someone must decide whether the AI-generated output is accurate, useful, compliant, original, secure, and appropriate. In a healthy organization, that work is designed into the process. In a sloppy one, it lands on whichever employee happens to receive the machine-made artifact.

“Workslop” Is Not a Meme; It Is a Quality-Control Failure​

The report’s example of a junior software engineer pasting thousands of lines of AI-generated code before bed captures the new office pathology. The immediate problem is not that the worker used AI. Developers have always copied, adapted, scaffolded, and generated boilerplate. The problem is that the output entered a shared workflow without enough understanding attached to it.
When a senior engineer has to untangle code that the author cannot explain, the team has not saved time. It has transferred work from the generator to the reviewer. In software, that may show up as broken builds, security flaws, unmaintainable abstractions, or code that passes superficial tests while failing real-world use. In other office contexts, the same pattern appears as plausible but wrong research, bloated slide decks, generic strategy memos, and customer emails that sound polished while saying very little.
This is the phenomenon critics increasingly call workslop: AI-generated material that looks like work, occupies the attention of colleagues, and requires cleanup before it can become useful. It is not simply low-quality output. It is low-quality output that masquerades as finished work and imposes a tax on everyone downstream.
For WindowsForum readers, the analogy to IT operations is obvious. A script that “mostly works” can be more dangerous than no script at all if it runs against production machines. A generated PowerShell command that looks plausible but mishandles permissions, registry keys, or device policies can create a mess that takes longer to diagnose than the original manual task. The same is true of AI-written documentation, incident summaries, or configuration guidance that omits a critical caveat.
The office problem, in other words, is not that AI makes mistakes. Humans make mistakes too. The problem is that AI can make mistakes at industrial speed, in a tone of absolute confidence, and in formats that invite other people to trust it.

Microsoft’s Copilot Era Depends on the Same Unpaid Audit Layer​

This debate lands squarely in Microsoft’s world because Windows, Microsoft 365, Teams, Outlook, Edge, GitHub, Azure, and Security Copilot are becoming the places where enterprise AI meets everyday work. Microsoft’s pitch is not merely that Copilot can answer questions. It is that AI can sit inside the flow of work, drawing on email, files, meetings, chats, tickets, repositories, and business data.
That integration is powerful precisely because context is the missing ingredient in so many failed AI sessions. A generic chatbot can draft an answer. A workplace AI system that understands permissions, documents, calendars, and organizational history can theoretically produce something closer to useful output on the first try.
But integration does not abolish verification. It changes what verification means. If Copilot summarizes a Teams meeting, someone still needs to know whether the summary missed a decision made in the last five minutes. If it drafts a customer response, someone still needs to know whether the answer conflicts with policy. If it produces a PowerShell remediation, someone still needs to know whether it is safe for the target environment.
This is where IT administrators should be skeptical of “AI adoption” metrics that celebrate usage alone. More prompts do not automatically mean better work. More generated output can mean more surface area for mistakes, more data exposure risk, and more review burden on senior staff.
The Windows enterprise has spent years learning that automation without governance is just a faster way to make a mistake. Group Policy, Intune, endpoint detection, conditional access, software deployment rings, and change control all exist because scale magnifies error. AI belongs in that same mental category. It is not a magic intern. It is an automation layer that must be permissioned, monitored, tested, and constrained.

The Productivity Paradox Is Becoming a Governance Problem​

The Work AI Institute report says many workers spend substantial time gathering the right files, documentation, and tacit knowledge required for AI to produce high-quality output. That detail matters because it identifies the organizational failure behind much of the botsitting burden. AI is not struggling only because models are imperfect. It is struggling because companies are messy.
Most businesses do not have pristine knowledge bases. They have stale SharePoint folders, overlapping Slack and Teams channels, duplicate policies, tribal knowledge, half-updated CRM records, and documents whose titles made sense to someone three reorganizations ago. When AI is dropped into that environment, it inherits the mess.
That makes enterprise AI less like installing a new app and more like turning on a spotlight in a cluttered garage. The model can retrieve and recombine information, but it cannot always know which version of a policy is authoritative, which spreadsheet is obsolete, which engineer owns a service, or which exception became standard practice last quarter.
Workers then become the missing metadata layer. They tell the AI what matters, correct its assumptions, attach the right files, and explain the organizational context that should have been encoded somewhere else. Botsitting is partly a model problem, but it is also an information architecture problem.
For sysadmins and IT leaders, this is the practical takeaway: the quality of AI output depends heavily on the quality of the digital workplace beneath it. Permissions, labeling, retention, search, document hygiene, and identity management are no longer back-office chores. They are now part of the AI productivity stack.

The Risk Is Not Laziness; It Is Unexplainable Work​

One of the more alarming findings in the report is that 41 percent of workers said they sometimes deliver AI-generated work they could not explain if asked. That is a cultural warning light.
The common caricature is that workers use AI because they are lazy. Sometimes that is true, as it has always been true with every tool. But the larger danger is subtler: workers are being pushed into higher output expectations without being given enough time, training, or incentives to understand what they are shipping.
If an employee is judged on volume, speed, and AI adoption, they will optimize for those signals. They will produce more drafts, more summaries, more code, more tickets, more slide decks, and more “updates.” If the organization does not also reward understanding, accuracy, and restraint, it will get a flood of plausible output and a shrinking pool of people willing to say, “I don’t know.”
That is especially dangerous in technical environments. An administrator who cannot explain a configuration change should not deploy it. A developer who cannot explain a generated function should not merge it. A security analyst who cannot explain why a model flagged an event should not treat the output as evidence without further investigation.
AI can assist expert judgment, but it cannot replace accountability. The accountable party remains the human or the organization that ships the work. That reality is easy to forget when the machine produces a confident answer in seconds.

The Best AI Users Will Look Less Like Prompt Wizards and More Like Editors​

The early corporate AI era celebrated the “prompt engineer,” a figure who could coax better answers from a model through careful wording. That skill still matters, but the botsitting data points toward a more durable role: the editor.
An editor does not merely generate text. An editor understands audience, purpose, accuracy, tone, risk, sourcing, and consequence. In a software context, the equivalent is the reviewer who understands architecture and failure modes. In IT operations, it is the admin who knows which automation is safe, which environment is brittle, and which “quick fix” will become next month’s outage.
The best AI users will therefore be people who can decompose work, provide context, verify output, and decide what not to automate. They will be good at saying no to the machine. They will know when a generated answer is good enough, when it needs another pass, and when the fastest route is to do the work themselves.
That is a less glamorous vision than the autonomous-agent demos that dominate vendor conferences. It is also more realistic. Most office work is not a clean sequence of tasks waiting to be automated. It is a web of judgment calls, legacy constraints, interpersonal context, compliance requirements, and half-explicit assumptions.
AI can help navigate that web, but only if humans remain skilled enough to notice when it is pulling the wrong thread.

The Winners Will Redesign Work Instead of Counting Prompts​

The companies that benefit most from AI will not be the ones that simply pressure employees to “use AI more.” That strategy risks generating activity without value. It also risks exhausting the very workers whose judgment makes AI useful in the first place.
A better strategy starts by identifying workflows where AI can remove a real bottleneck rather than create a new review queue. That means looking beyond individual anecdotes and measuring cycle time, error rates, rework, customer outcomes, security incidents, and employee load. If the tool saves an analyst two hours but creates three hours of cleanup for a manager, the dashboard should show that.
Training also has to evolve. Many corporate AI training programs still focus on basic prompting and acceptable-use policies. Those are necessary, but insufficient. Workers need domain-specific examples of what good AI-assisted work looks like, what must be verified, what cannot be delegated, and how to document the role AI played.
The hard part is organizational discipline. AI makes it tempting to produce more of everything: more emails, more reports, more tickets, more code, more marketing copy, more meeting notes. But productivity is not the same as output volume. Sometimes the most valuable AI policy will be a norm that says not every generated artifact deserves to be sent.
This is where managers have to earn their salaries. If AI turns every employee into a partial manager of machines, then actual managers must redesign expectations around that reality. They need to account for review time, reward quality, and protect employees from becoming unpaid cleanup crews for poorly deployed automation.

The Windows Admin’s Version of Botsitting Is Already Here​

For IT pros, the botsitting problem is not theoretical. It shows up when users paste AI-generated fixes into support tickets. It shows up when help desk staff rely on chatbot answers that skip environment-specific constraints. It shows up when executives ask why Copilot cannot instantly produce perfect reports from a decade of inconsistent file storage.
It also shows up in security. AI-generated phishing emails, synthetic documents, automated reconnaissance, and low-effort malware variants increase the amount of machine-made material defenders must inspect. At the same time, defensive AI tools generate alerts, summaries, and suggested actions that analysts must validate. The result is a two-sided automation race in which humans remain the escalation path.
Windows environments are especially exposed because they are broad, heterogeneous, and deeply integrated into business processes. A midsize organization may have Windows 10 and Windows 11 endpoints, Entra ID, legacy Active Directory, Intune, third-party agents, line-of-business apps, local admin exceptions, VPN profiles, printer dependencies, and compliance controls. An AI assistant can help reason across that estate, but it can also flatten important distinctions.
The practical standard should be simple: no AI-generated administrative action should be trusted merely because it is syntactically correct. Commands need test environments. Scripts need review. Policy changes need staged rollout. Documentation needs ownership. The same habits that make good admins good admins are the habits that make AI useful rather than dangerous.
In that sense, botsitting is not an embarrassing side effect. It is the visible form of professional responsibility. The problem is not that humans must supervise automation. The problem is pretending that supervision is free.

The Six-Hour Robot Shift Is the Number Executives Should Not Ignore​

The Work AI Institute’s report should not be read as an anti-AI manifesto. It is more useful than that. It describes the cost structure of AI work after the novelty wears off and the tool becomes part of the office day.
The numbers suggest that AI has crossed the adoption threshold but not the organizational redesign threshold. Workers are using it. Workers are getting value from it. Workers are also absorbing a large amount of hidden labor that many companies have not priced, measured, or managed.
That creates a familiar corporate risk. Leadership sees the upside in aggregate: faster drafts, lower friction, more output. Employees experience the operational reality: more checking, more context gathering, more responsibility for tools that do not understand the business as well as the sales deck implied.
The next phase of AI adoption will be less about access and more about accountability. Who owns the output? Who verifies it? Who absorbs the rework? Who decides when AI is inappropriate? Who gets credit for the time saved, and who gets blamed when the machine-made work fails?
If companies cannot answer those questions, the six-hour robot shift will keep expanding. It will become another layer of office work that everyone performs and no one budgets for.

The New Office Bargain Needs Rules Before It Needs More Demos​

The most concrete lesson from the botsitting data is that AI’s value depends on the system around it. A capable model inside a chaotic workflow produces chaos faster. A capable model inside a disciplined workflow can become a real advantage.
That distinction should shape procurement, training, and management. Buying another AI license is easy. Building the conditions under which AI output can be trusted is harder, slower, and more valuable.
  • Companies should measure AI’s effect on full workflows, not just the time saved on individual tasks.
  • Workers should be expected to explain AI-assisted work before they ship it to colleagues, customers, or production systems.
  • Managers should treat verification, context gathering, and correction as real labor rather than invisible friction.
  • IT teams should connect AI rollouts to identity, permissions, data governance, retention, and endpoint management.
  • Organizations should reward employees who reduce low-value output, not just those who generate more material with AI.
  • AI tools should be introduced with clear rules for when human review is mandatory and when automation is not appropriate.
The promise of workplace AI is not dead because workers spend hours babysitting bots; it is becoming more concrete, more expensive, and more dependent on human judgment than the launch demos suggested. The companies that win will be the ones that stop treating AI as a replacement for office work and start treating it as a demanding new participant in that work — one that can accelerate good systems, expose bad ones, and punish anyone who confuses fluent output with finished thinking.

References​

  1. Primary source: Los Angeles Times
    Published: Fri, 12 Jun 2026 10:00:00 GMT
  2. Related coverage: glean.com
  3. Related coverage: ciodive.com
  4. Related coverage: ilo.org
  5. Related coverage: bcg.com
  6. Related coverage: computerworld.com
 

A June 2026 Glean Work AI Institute report found that office workers in the United States, United Kingdom, and Australia spend an average of 6.4 hours a week supervising AI systems, even as many say those tools make individual tasks faster. The finding should puncture the cleanest version of the AI productivity story. The issue is no longer whether generative AI can draft, summarize, code, classify, or search; it is whether organizations understand the new layer of human labor required to make those outputs usable. AI is not just automating office work. It is quietly reorganizing office work around inspection, correction, context feeding, and blame avoidance.

Office worker reviewing an AI compliance audit draft on dual monitors with human “quality gate” in view.The Productivity Miracle Has a Labor Shadow​

The first wave of enterprise AI messaging sold time. Copilots would shorten meetings, agents would clear queues, chatbots would turn messy prompts into polished work, and office employees would be freed from low-value toil. That claim was never wholly false; millions of workers have already discovered that a decent model can produce a rough draft, summarize a thread, or translate a pile of notes faster than a human starting from zero.
But time saved at the task level is not the same thing as time saved at the organizational level. The new reporting around “botsitting” captures the difference: workers are not merely using AI, they are managing it. They prompt, wait, check, correct, rerun, paste between systems, explain the obvious, and then absorb the reputational risk if the machine’s work makes it into a client deck, a legal memo, a support response, or a codebase.
That is why the 6.4-hour figure matters. It is not a stray anecdote from a cranky knowledge worker who hates change. It is a measurement of a hidden operating cost that many AI business cases still treat as a rounding error. Nearly a working day each week is not “friction.” It is a new job function, often performed by people whose job descriptions, performance reviews, and calendars have not caught up.
The Chicago Tribune framing — AI cutting hours of office work while creating a new kind of busywork — gets at the contradiction. AI can make one task feel magical and the day feel heavier. The worker who used to write the first draft now supervises the first draft, audits its assumptions, converts it into the organization’s tone, and explains why the result cannot simply be trusted.

Botsitting Is What Happens When AI Becomes Everyone’s Intern​

The term botsitting sounds comic, but it is useful because it restores the human to the center of the workflow. The fantasy of the autonomous agent is a system that receives a goal and returns a result. The reality, in many offices, is closer to an overconfident intern with instant recall, shaky judgment, and no institutional memory unless someone painstakingly supplies it.
That dynamic creates a peculiar kind of fatigue. Traditional busywork is often boring because it is repetitive. AI busywork is exhausting because it demands vigilance. The worker must stay alert enough to catch subtle errors, hallucinated details, missing context, formatting drift, policy violations, and confident nonsense dressed in executive prose.
This is not the same as learning a new keyboard shortcut or adopting a better search tool. Generative AI introduces probabilistic output into environments that still demand accountability. A spreadsheet formula either calculates or fails; a model may produce something plausible, incomplete, legally risky, or simply not aligned with how the company actually works.
The human therefore becomes the quality gate. In many organizations, that quality gate is not formally recognized. It is wedged into existing roles under the assumption that AI assistance is a net reduction in work. That assumption may hold for narrow, well-scoped tasks. It breaks down when AI is sprayed across departments without redesigned processes, training, data access, and governance.

The Personal Productivity Gap Is Now the Enterprise Problem​

One of the most important tensions in the Glean findings is the gap between individual and organizational benefit. A large majority of surveyed workers reportedly said they use AI at work, and many said it makes them more productive. Yet far fewer said their organizations are performing significantly better because of it. That is the productivity paradox in miniature.
The explanation is not mysterious. A worker may save 20 minutes drafting a memo and then spend 15 minutes checking it, five minutes reformatting it, and another 10 minutes clarifying the source material the model mangled. Even when the individual still comes out ahead, the organization may not. The saved time may disappear into review loops, tool switching, redundant approvals, and meetings about how to use AI “responsibly.”
Worse, productivity may become more uneven. The employee who already understands the domain, the company, and the risks can use AI as acceleration. The employee who lacks that context may use AI to generate more work for reviewers. In that world, the productivity gains accrue to the already capable, while the cleanup burden often lands on managers, senior analysts, security teams, legal reviewers, and IT.
That has a familiar ring for anyone who lived through earlier enterprise technology cycles. Email was supposed to reduce meetings; it created inbox work. Collaboration suites were supposed to centralize knowledge; they multiplied channels. Low-code tools were supposed to empower departments; they created shadow systems that IT later had to secure and rationalize. AI is following the same arc, only faster and with higher stakes.

The Real Tax Is Context​

The phrase “AI cleanup” makes the problem sound like error correction, but the deeper tax is context. Most office work is not hard because sentences are hard to write. It is hard because the worker knows which sentence can be said, which number is safe to use, which customer history matters, which policy applies, and which unstated constraint will blow up the plan.
AI systems do not automatically inherit that context. They need access to the right documents, permissions, metadata, workflow states, and business rules. When they do not have it, employees become the connective tissue. They paste snippets from one system into another, summarize the backstory, restate constraints, and clean up the output when the model optimizes for generic plausibility rather than local truth.
This is where WindowsForum readers should see the enterprise IT angle clearly. The model is only the visible surface. Beneath it are identity, access control, endpoint management, data classification, records retention, auditing, network boundaries, and integration architecture. If those foundations are weak, AI does not eliminate complexity. It exposes it.
That is why a worker can be using four or more AI tools and still feel less supported. More tools can mean more interfaces to babysit, more policy ambiguity, more copy-and-paste risk, and more output formats to reconcile. If the “agentic” future depends on humans moving data between agents, the future has already failed its own automation pitch.

Microsoft’s Copilot Ambition Runs Into the Same Office Physics​

Microsoft has been unusually well positioned to sell AI into the workplace because it already owns so much of the work surface: Windows, Microsoft 365, Teams, Outlook, SharePoint, OneDrive, Edge, Entra, Intune, Defender, Power Platform, and Azure. The Copilot strategy is built on a compelling idea: AI becomes more useful when it lives where the work already happens.
That is the right direction, but it does not repeal office physics. A Copilot that can summarize a Teams meeting is useful. A Copilot that summarizes the wrong meeting, misses the decision that happened in chat, or cannot distinguish a tentative idea from an approved action item still needs a human editor. The more authoritative the AI output appears, the more dangerous a quiet error becomes.
Microsoft’s challenge is therefore not just model quality. It is trust calibration. Users need to know when Copilot is drafting, when it is retrieving, when it is inferring, and when it is acting. Administrators need to know what data it can touch, what it logs, how permissions are inherited, and how mistakes can be investigated. Security teams need to understand whether AI expands the blast radius of overshared files and stale access rights.
For Windows administrators, this is not an abstract productivity debate. AI assistants on managed endpoints will touch the same old enterprise problems: least privilege, device posture, data leakage, retention, compliance, and user training. A bad AI rollout is not just a morale issue. It is a governance issue wearing a productivity badge.

The Cleanup Crew Is Often the Most Valuable Staff​

The National CIO Review’s headline — the AI cleanup crew is getting tired — points to a management problem hiding in plain sight. The people who can clean up AI work are often the people with the most institutional knowledge. They know the customer, the system, the exception, the security policy, the executive preference, and the history behind the numbers.
Those employees are valuable precisely because they can tell when AI output is wrong. But that ability makes them magnets for invisible labor. They become reviewers of AI-generated drafts, explainers of AI mistakes, fixers of agentic workflows, and informal trainers for colleagues who have been told to “use AI more” without being given much guidance.
That is a recipe for resentment. When employees are asked to absorb the cognitive load of AI supervision while leadership celebrates automation savings, the bargain starts to look dishonest. The worker experiences AI as another layer of responsibility; the organization reports it as productivity improvement.
Retention risk follows naturally. If the most competent people are assigned the burden of turning machine output into accountable work, they may begin to look for employers that recognize that labor. The irony is sharp: companies may deploy AI to reduce dependence on scarce expertise, only to burn out the experts required to make the AI useful.

AI Fatigue Is Not Anti-Technology Sentiment​

It is tempting for vendors and executives to dismiss complaints about botsitting as resistance to change. That would be a mistake. Office workers are not rejecting AI because they love manual drudgery. Many are using it enthusiastically. The fatigue comes from the gap between the marketed experience and the lived workflow.
In demos, AI operates in a clean room. The source documents are available, the prompt is well formed, the permissions are aligned, the output is polished, and the user accepts the result. In real offices, the data is messy, the files are duplicated, the policies are ambiguous, the output needs review, and the user is interrupted by five other systems demanding attention.
This gap is especially painful because AI tools often encourage overproduction. A system that can generate five drafts, 20 ideas, or 200 lines of code in seconds creates a new burden of selection. Someone has to decide what is good, what is safe, what is redundant, and what is subtly wrong. The bottleneck moves from production to judgment.
That shift matters culturally. Knowledge workers derive status not only from finishing tasks but from owning judgment. If AI turns more employees into supervisors of machine work while reducing space for original thought, organizations may gain throughput and lose engagement. The office may become faster and more alienating at the same time.

The ROI Spreadsheet Has Been Too Polite​

AI business cases often count subscription costs, infrastructure costs, implementation costs, and projected labor savings. They are less likely to count review time, rework, prompt tinkering, compliance checks, duplicate tool usage, and the opportunity cost of senior employees correcting junior employees’ AI-assisted work. That omission makes the ROI look cleaner than the workplace.
This is not a call to stop deploying AI. It is a call to stop pretending that deployment is adoption and adoption is value. The number of active users tells executives that people are touching the tool. It does not tell them whether the tool improved a workflow, reduced cycle time, improved quality, lowered risk, or merely shifted effort into unmeasured cleanup.
A more honest AI scorecard would measure the full path from request to accepted result. How many steps did the AI remove? How many did it add? How often did humans redo the work? Which roles absorbed the review burden? Which outputs could be trusted with light editing, and which required expert reconstruction?
Without that accounting, companies risk optimizing for visible activity. Employees will use AI because they are expected to use AI. They will generate more artifacts because generation is easy. The organization will then drown in plausible drafts, summaries, and recommendations that still require human sorting.

Disconnected Tools Are Turning Workers Into Middleware​

The most revealing complaints about workplace AI often sound less like complaints about intelligence and more like complaints about integration. The model may be clever, but it does not know where the latest contract lives. The assistant can summarize a document, but not connect that summary to the ticketing system. The chatbot can answer a policy question, but not open the HR workflow that resolves it.
When systems are disconnected, humans become middleware. They copy data, translate formats, reconcile contradictory records, and make judgment calls across software boundaries. AI can make that work faster in places, but it can also make the handoffs more numerous. A worker who once completed a task in three familiar systems may now consult two AI assistants, verify output in another tool, and paste the final version back into the original workflow.
For IT leaders, this should shift the adoption conversation away from “Which AI tool should we buy?” and toward “Which workflow are we redesigning?” A chatbot bolted onto a broken process produces a faster broken process. An agent without reliable permissions, APIs, logging, and rollback is not a labor-saving device; it is an operational liability.
The most successful AI deployments are likely to be boring in exactly this way. They will have fewer splashy prompts and more disciplined plumbing. They will connect to governed data, respect identity boundaries, surface uncertainty, and make human review easier rather than pretending it can be eliminated.

Security Teams Will Be Asked to Clean Up, Too​

Every wave of workplace technology creates a shadow backlog for security. AI is no different. Employees experimenting with public tools may paste sensitive data into systems the company has not approved. Departments may procure AI services outside IT review. Generated content may carry confidential fragments, licensing concerns, or policy violations. Agents may be granted broader access than they need because narrow access is harder to configure.
The botsitting burden therefore has a security twin. Someone has to classify what AI can see, decide which tools are approved, monitor usage, investigate incidents, and educate users about safe behavior. If that work is not funded and staffed, it lands on already stretched security and IT operations teams.
There is also a subtler risk: AI can launder uncertainty into confidence. A polished answer may hide weak sourcing. A generated script may run with privileges the user does not fully understand. A summarized incident may omit the anomaly that mattered. The cleanup crew in security is not just correcting prose; it is protecting the organization from automation that makes bad assumptions look official.
This is why governance cannot be treated as a brake on innovation. Good governance is what allows AI to scale without forcing every worker to become a full-time skeptic. The goal is not to smother experimentation. It is to make safe, useful paths easier than risky workarounds.

The Agentic Office Still Needs Adult Supervision​

The next phase of AI marketing is already centered on agents: systems that do not merely answer but act. They will file tickets, update records, schedule follow-ups, draft responses, query databases, and execute multistep workflows. In theory, this should reduce botsitting. In practice, it could intensify it if organizations skip the hard design work.
An agent that acts incorrectly is more dangerous than a chatbot that writes incorrectly. A flawed paragraph can be edited. A flawed action may change a record, notify a customer, trigger a workflow, or expose data. The more autonomy a system receives, the more important observability, permissions, approval gates, and rollback become.
This is where the “AI will replace busywork” narrative needs revision. AI may replace some manual steps, but it will also create supervisory work. The question is whether that supervisory work is designed, supported, measured, and compensated — or whether it is pushed onto employees as an invisible expectation.
The best agentic systems will not ask users to babysit every move. They will make clear what they did, why they did it, what data they used, and where uncertainty remains. They will escalate appropriately. They will fail safely. They will reduce the user’s mental load rather than converting the user into a nervous air-traffic controller for software robots.

CIOs Need to Treat Botsitting as a Design Defect​

The practical lesson for CIOs is blunt: if employees are spending nearly a day a week managing AI, the organization has not automated work so much as redistributed it. That may still be worthwhile in some cases, but it must be understood as a design choice. Hidden labor is still labor.
A mature AI program should begin with workflow inventory. Which tasks are AI-assisted today? Which ones produce accepted results faster? Which ones create rework? Which teams are acting as informal reviewers? Which data gaps force users to supply context manually? Which tools overlap? Which outputs are too risky to trust without expert review?
That analysis will often reveal that the right answer is not another model. It may be better retrieval, cleaner permissions, fewer redundant tools, stronger templates, clearer usage rules, better training, or a narrower deployment. In enterprise IT, boring improvements frequently beat glamorous procurement.
The other necessary move is managerial honesty. If AI supervision is part of the job, say so. Train for it. Measure it. Reward it. Do not call a worker “resistant” because they are doing the quality-control work that keeps the company out of trouble.

The Windows Desktop Is Becoming the AI Control Room​

For Windows users, this shift will be felt most directly at the desktop. AI is moving into operating systems, browsers, office suites, development environments, security consoles, and collaboration tools. The old desktop was a place where users operated applications. The new desktop is becoming a place where users supervise software that operates other software.
That changes what usability means. A good interface is no longer just one that exposes features cleanly. It must expose agency cleanly. Users need to see what the AI knows, what it can do, what it is about to do, and what it has already done. Administrators need controls that are granular enough to prevent oversharing but simple enough to maintain.
This also changes endpoint management. AI features will arrive through monthly app updates, cloud-side changes, browser integrations, and licensing toggles. Organizations that once worried about standardizing Office macros will now worry about standardizing AI behavior. Policy drift could become a major source of confusion: one worker’s assistant can access a repository, another’s cannot, and a third is using an unsanctioned external tool because it gives better answers.
Windows has always been the messy middle of enterprise work. That makes it the natural proving ground for AI’s real impact. If AI can reduce context switching, respect enterprise controls, and make work easier on managed Windows endpoints, the productivity story strengthens. If it adds another pane, another prompt, and another review burden, users will notice.

The Useful Lesson Hidden in the Busywork​

The botsitting debate should not be read as proof that AI is a bust. It is proof that AI is entering the hard part of enterprise adoption. The easy part was showing that models can generate useful artifacts. The hard part is embedding them into accountable systems where quality, security, compliance, and morale matter.
That distinction is important because it separates vendor promise from operational reality. Vendors are incentivized to highlight capability. Enterprises must manage consequence. A model that can do 80 percent of a task is impressive in a demo; in production, the remaining 20 percent may contain nearly all the risk.
Organizations that understand this will stop treating human review as a temporary flaw that disappears with the next model. Some review burden will shrink as models improve and integrations mature. But judgment, accountability, and institutional context will remain central. The future of AI work is not human-free. It is human-shaped, whether executives admit it or not.
The goal should be to make that human role better. AI should give workers leverage, not a second shift of invisible supervision. That requires disciplined architecture, clear governance, realistic metrics, and respect for the people doing the cleanup.

The Office AI Bill Is Coming Due​

The concrete message for Windows shops, CIOs, and department leaders is not that AI should be paused. It is that AI’s hidden labor must be brought onto the balance sheet before the next wave of agents expands it.
  • Organizations should measure AI work from request to accepted output, not merely count tool usage or generated drafts.
  • IT leaders should treat disconnected AI tools as a workflow problem, because employees become the integration layer when systems do not talk to each other.
  • Managers should recognize AI review and correction as real labor, especially when it falls on senior employees with scarce institutional knowledge.
  • Security teams should be involved before AI tools spread across departments, because cleanup after data exposure or unsafe automation is far costlier than early governance.
  • Microsoft 365 and Windows AI deployments should be judged by whether they reduce context switching and rework, not by whether they add another visible assistant to the desktop.
  • Agentic AI should earn autonomy gradually, with logging, permissions, approval gates, and rollback mechanisms treated as core features rather than enterprise add-ons.
The next year of workplace AI will be less about astonishment and more about accounting. The winners will not be the organizations that generate the most AI-assisted output, but the ones that redesign work so the machines remove burdens instead of disguising them. If the industry gets that right, AI may still deliver the productivity gains promised in the slide decks; if it gets it wrong, the office of the future will be staffed by tired humans quietly cleaning up after software that was supposed to set them free.

References​

  1. Primary source: Chicago Tribune
    Published: Mon, 15 Jun 2026 19:22:50 GMT
  2. Independent coverage: The National CIO Review
    Published: Mon, 15 Jun 2026 14:43:09 GMT
 

Back
Top