GitHub Reliability Crisis: Why Developer Trust Is Microsoft’s Biggest Risk

  • Thread Author
GitHub’s reliability crisis has become a Microsoft crisis because GitHub is no longer just a website where developers store code; it is a global production dependency for software teams, open-source maintainers, CI/CD pipelines, security tooling, and now AI coding agents. The latest flashpoint is Ghostty creator and HashiCorp co-founder Mitchell Hashimoto saying the platform is failing him “every single day,” a deeply personal rebuke from one of GitHub’s earliest and most committed users. Combined with GitHub’s own recent availability reports, the Zig project’s move to Codeberg, and Microsoft’s decision to fold GitHub more tightly into its CoreAI strategy, the question is no longer whether GitHub has outages. The question is whether Microsoft understands that developer trust is GitHub’s real infrastructure.

Cybersecurity operations scene with stressed staff as a cracked globe shows availability incidents, lightning, and warning icons.Overview​

Microsoft’s acquisition of GitHub in 2018 was controversial from the moment it was announced. The deal, valued at $7.5 billion in stock, placed the world’s most important developer collaboration platform inside a company that had spent years trying to rehabilitate its image with open-source communities. Microsoft had changed dramatically under Satya Nadella, but many developers still remembered the older Microsoft: proprietary, combative, and deeply suspicious of open platforms it did not control.
For several years, the deal looked better than skeptics expected. GitHub continued to grow, GitHub Actions became a major CI/CD platform, GitHub Sponsors helped fund maintainers, and GitHub Copilot turned the service into one of Microsoft’s most visible AI success stories. The company’s pitch was simple: Microsoft would provide scale, enterprise reach, and cloud infrastructure while GitHub preserved its developer-first identity.
That balance now looks much more fragile. GitHub’s 2026 availability reports describe repeated incidents affecting GitHub.com, Git operations, APIs, Actions, Codespaces, Dependabot, Copilot, Teams integrations, and search-backed experiences. In isolation, any one of those failures could be explained as the cost of operating a massive distributed system. Together, they look like a pattern: GitHub’s operational complexity has outgrown its reliability posture.
The emotional weight of Hashimoto’s departure matters because it reframes the issue from abstract uptime percentages to lived developer experience. His Ghostty terminal project is not a casual side repository; it is a serious open-source effort with a large user base and a demanding development workflow. When someone with that level of history says GitHub is no longer a place for serious work, Microsoft has a reputational problem that no Copilot demo can smooth over.

Why This Outage Story Feels Different​

The loss of emotional goodwill​

Developers tolerate many things from platforms they love. They forgive temporary downtime, awkward redesigns, confusing settings, and even corporate acquisitions if the core workflow remains fast and dependable. What they do not forgive easily is the feeling that a trusted tool has become hostile to the work itself.
That is why the Hashimoto post landed so hard. He was not writing like a tourist discovering a bad week on GitHub; he was writing like someone ending a long relationship. His frustration was not merely technical but emotional, and that distinction is important because GitHub’s moat has always been partly psychological.
For years, GitHub was where software happened. It was the place where a bug report became a patch, where a side project became a company, and where open-source reputation became legible through commits, issues, pull requests, and stars. If that sense of permanence erodes, GitHub becomes just another SaaS vendor with an incident page.
  • GitHub’s brand is trust-based, not just feature-based.
  • Open-source maintainers are high-leverage users, even when they are not the highest-paying customers.
  • Developer anger spreads quickly because engineers compare notes publicly.
  • A platform migration by one major project gives others permission to consider the same move.

From downtime to workflow failure​

Traditional uptime measurements can understate the damage caused by partial failure. A code host may technically be “up” while pull requests do not load, Actions queues stall, search returns nothing, or merge queues corrupt expected branch state. For developers, the practical result is the same: work stops.
This is especially true for modern GitHub workflows. A single pull request can depend on code review, status checks, branch protection, Actions runners, test artifacts, permissions, notifications, search indexing, webhooks, and deployment gates. When one subsystem fails, the user often experiences the whole platform as broken.
That is why 90.21% uptime, as claimed by Hashimoto’s personal tracking of negative work impact, is so damaging even if it is not the same as GitHub’s formal SLA measurement. It captures what many users care about most: “Could I do my job today?” If the answer is repeatedly no, the contractual definition of uptime becomes a footnote.

GitHub’s Technical Debt Has Become Visible​

The platform is now many platforms​

GitHub started as a social layer around Git repositories, but today it is far more than a source code host. It is a CI/CD system, package registry, security scanner, project management suite, documentation hub, identity layer, AI assistant surface, and enterprise compliance platform. That expansion created enormous value, but it also multiplied the number of ways GitHub can fail.
GitHub’s own recent explanations point to familiar distributed-systems failure modes: cache invalidation gone wrong, load balancer misconfiguration, Redis-related issues, datastore authentication failures, compute-provider dependencies, overloaded Elasticsearch clusters, and retry amplification. None of those are shocking on their own. What matters is that they are hitting core developer paths often enough to create a perception of systemic brittleness.
The April merge queue incident is especially troubling because it affected repository state rather than merely page loading. GitHub said affected squash merges could produce incorrect merge commits in certain merge queue conditions, impacting hundreds of repositories and thousands of pull requests. Even without data loss, branch correctness is sacred territory for a developer platform.
  • Git operations must be treated as a reliability tier above ordinary web features.
  • Pull request integrity must be protected against subtle automation regressions.
  • Search dependencies should not make unrelated UI paths feel unusable.
  • Actions capacity must be isolated from experimental and AI-heavy workloads.
  • Status transparency must reflect user-visible degradation, not only platform-wide collapse.

Blast radius is the real metric​

GitHub’s public remediation language now emphasizes isolation, reduced hidden coupling, better caching, fewer single points of failure, and more graceful degradation. That is exactly the right vocabulary. It is also an implicit admission that the platform’s internal dependencies have allowed too many failures to spread too far.
The hardest engineering problem here is not that GitHub has bugs. Every large service has bugs. The harder problem is that GitHub is now so central to software delivery that a medium-sized internal fault can cascade into a global productivity event.
Microsoft should treat GitHub like critical infrastructure, not like a fast-moving AI product surface. That means boring investments: capacity planning, dependency isolation, rollback discipline, chaos testing, incident drills, and conservative changes around merge, identity, storage, and CI queues. The glamorous work is Copilot; the existential work is making sure pull requests and Actions keep functioning under pressure.

AI Has Changed the Load Profile​

Agentic coding is not normal traffic​

GitHub says software development patterns have shifted rapidly, especially since late 2025, with agentic workflows increasing repository creation, pull request activity, API usage, automation, and large-repository workloads. That explanation is plausible. AI coding agents do not behave like human developers browsing a repository over coffee.
An agent can create branches, open pull requests, post comments, trigger CI, query APIs, generate review noise, and repeat failed actions at machine speed. Multiply that across millions of users, enterprise experiments, and automated coding products, and GitHub faces a load profile closer to bot traffic than traditional collaboration. The platform that was built for human-scale software development is now being asked to mediate machine-scale software generation.
This is where Microsoft’s AI strategy collides with GitHub’s reliability promise. Copilot may be a business triumph, but it also encourages more automation, more requests, more CI runs, and more generated code activity. If GitHub becomes both the home of the code and the execution arena for AI agents, its infrastructure burden changes dramatically.
  • AI agents generate more events per developer than traditional workflows.
  • Retry behavior can amplify outages when automated systems keep hammering failing services.
  • Generated pull requests increase review and CI pressure, even when many are low quality.
  • API quotas and abuse systems must distinguish useful automation from destructive noise.
  • Enterprise AI adoption will stress GitHub differently than hobbyist experimentation.

The AI-first perception problem​

The technical argument for AI-driven growth may be valid, but perception matters. Developers increasingly hear Microsoft talk about AI, Copilot, agents, and CoreAI while they experience sluggish pull requests, broken Actions, or unreliable search. That mismatch creates the impression that Microsoft is prioritizing investor-friendly AI narratives over the daily work of developers.
It may not be that simple. Reliability work is expensive, invisible, and slow to communicate. Feature launches are easier to sell than database isolation, queue redesign, or multi-cloud resilience. Still, Microsoft has to understand that developers judge platforms by outcomes, not org charts.
The phrase “availability first, then capacity, then new features” is exactly what GitHub users need to hear. But after months of visible incidents, the words must be backed by measurable improvement. Trust is not restored by a roadmap; it is restored by quiet weeks where nothing breaks.

Enterprise Customers Face a Different Kind of Risk​

GitHub is part of the production line​

For enterprises, GitHub downtime is not merely annoying. It can delay releases, block security patches, interrupt compliance workflows, and strand development teams during critical windows. A stalled Actions queue can mean a delayed hotfix; a degraded API can break internal tooling; an unreliable merge queue can undermine confidence in release governance.
Large companies also build layers of automation around GitHub. They connect it to identity systems, ticketing platforms, deployment pipelines, artifact repositories, security scanners, and internal dashboards. When GitHub degrades, the impact can travel far beyond developers staring at a browser tab.
That is why GitHub’s enterprise challenge is more severe than consumer-facing frustration. Microsoft has spent years selling GitHub Enterprise, Advanced Security, Copilot for Business, and Azure-adjacent developer workflows as part of a modern software factory. If the factory gate jams, CIOs and CTOs start asking uncomfortable questions.
  • Release management teams need predictable CI capacity.
  • Security teams need Dependabot and CodeQL workflows to remain dependable.
  • Compliance teams need auditable platform behavior during incidents.
  • Platform engineering teams need APIs that degrade clearly, not mysteriously.
  • Executives need assurance that AI growth will not cannibalize core reliability.

SLAs do not capture developer interruption​

Enterprise contracts may provide service credits, but service credits do not ship software. The real cost of a GitHub outage is lost coordination, delayed decisions, broken context, and the risky workarounds teams adopt under pressure. Developers may retry jobs, bypass checks, postpone reviews, or move urgent patches through less-controlled paths.
Those behaviors create secondary risk. A team blocked by Actions during a severe incident may choose to skip a noncritical test suite, only to discover later that the skipped test would have caught a regression. A platform outage becomes not just downtime but a quality and security concern.
Microsoft’s enterprise sales machine is powerful, but it cannot hand-wave that operational reality forever. If GitHub wants to remain the default for Fortune 500 development, it needs a reliability story that is as mature as Azure’s best enterprise commitments. That means better incident communication, clearer service separation, stronger guarantees for paid tiers, and tooling that lets customers design fallback paths.

Open Source Is Rediscovering Portability​

Zig and Ghostty are warning flares​

The Zig programming language’s migration to Codeberg in late 2025 was a direct challenge to GitHub’s cultural authority. Zig’s leadership criticized GitHub’s engineering culture, Actions reliability, UI performance, AI emphasis, and the broader implications of Microsoft ownership. Even readers who found the tone harsh could not ignore the substance: a serious language project decided GitHub’s convenience was no longer worth the dependency.
Ghostty’s planned departure compounds that signal. Unlike a corporate migration hidden inside procurement, open-source migrations are public acts of governance. They tell contributors where the canonical repository lives, where issues should be filed, where pull requests belong, and what platform values the project wants to support.
The hard truth for GitHub is that open source does not need perfect alternatives to start moving. It only needs enough credible alternatives for maintainers to believe the inconvenience is justified. Codeberg, Forgejo, GitLab, self-hosted Gitea, and other options do not have to beat GitHub feature-for-feature to benefit from GitHub fatigue.
  • Codeberg offers a nonprofit-aligned alternative that appeals to open-source governance values.
  • Forgejo and Gitea support self-hosted autonomy for projects that want control.
  • GitLab remains a full-stack DevSecOps competitor with strong enterprise recognition.
  • Mirrors can reduce migration pain by preserving GitHub visibility while moving canonical development.
  • Funding and issue migration remain friction points that keep many projects on GitHub.

Network effects can weaken slowly, then suddenly​

GitHub’s strongest defense is its network effect. Developers already have accounts, organizations already have permissions, tooling already assumes GitHub URLs, and contributors already know the workflow. That inertia is enormous.
But network effects are not invincible. They weaken when influential users begin treating the default as optional. If enough projects keep GitHub as a mirror while moving canonical work elsewhere, the symbolic center of open source becomes less fixed.
This does not mean GitHub is about to collapse. It remains the largest code-hosting platform and the default choice for countless projects. The more realistic danger is subtler: GitHub remains huge but loses its aura as the inevitable home of serious open-source work.

Microsoft’s Ownership Question Is Back​

The acquisition bargain is under review​

When Microsoft bought GitHub, it implicitly asked developers to believe in a bargain. GitHub would gain scale and resources, while Microsoft would gain developer credibility. The community’s fear was that the platform would eventually become absorbed into Microsoft’s corporate priorities.
For a while, Microsoft avoided the worst-case scenario. GitHub retained a distinct identity, Nat Friedman was widely respected in developer circles, and the company shipped meaningful improvements. Copilot then gave GitHub an enormous strategic role inside Microsoft’s AI ambitions.
The 2025 decision to bring GitHub into Microsoft’s CoreAI orbit changed the optics. With Thomas Dohmke’s departure and GitHub’s leadership reporting more directly into Microsoft’s AI organization, the old promise of independence became harder to maintain. Even if the operational teams remain dedicated, the public sees a platform once defined by open-source collaboration being reorganized around AI leverage.
  • GitHub’s independence was part of the post-acquisition reassurance.
  • CoreAI integration suggests strategic centralization.
  • Copilot’s success increases pressure to optimize GitHub around AI monetization.
  • Developers worry that reliability work is losing political priority.
  • Microsoft’s broader product quality debates now attach themselves to GitHub.

Xbox, Windows, and the quality narrative​

The Windows Central framing connects GitHub’s troubles to larger frustrations with Microsoft’s stewardship of Windows and Xbox. That comparison is emotionally potent but needs careful handling. GitHub’s reliability problems are not the same as Windows 11 UX complaints or Xbox strategy turmoil.
Still, the common thread is hard to ignore: users of major Microsoft platforms increasingly feel that core experience quality is competing with corporate pushes around AI, subscriptions, ads, services, and ecosystem strategy. In Windows, that can look like unwanted prompts or inconsistent interface changes. In Xbox, it can look like confusing platform direction. In GitHub, it looks like a development platform that sometimes cannot reliably support development.
This is a dangerous narrative for Microsoft because it attacks Nadella’s strongest achievement: making Microsoft feel trustworthy again to developers. If GitHub becomes evidence that Microsoft eventually absorbs and degrades beloved products, the reputational cost will exceed any single outage report.

Competitors Now Have an Opening​

GitLab, Codeberg, and the self-hosting argument​

GitHub’s competitors do not need to defeat Microsoft head-on to gain from this moment. They need to offer credible answers to specific frustrations: better reliability, less AI pressure, stronger self-hosting, clearer governance, or more transparent incident handling. Different users will value different combinations.
GitLab can appeal to enterprises that want an integrated DevSecOps platform with self-managed options and a mature CI/CD story. Codeberg can appeal to open-source maintainers who want nonprofit governance and a platform culturally distant from Big Tech AI priorities. Forgejo and Gitea can appeal to projects that want lighter, self-controlled infrastructure.
The migration calculus differs by user type. A hobbyist can move quickly. A language project must consider contributors, issue history, build infrastructure, funding, and distribution. A multinational enterprise must consider contracts, identity, audit, compliance, training, and integration with internal developer platforms.
  • Enterprises may evaluate multi-host strategies rather than full replacement.
  • Open-source projects may move canonical repositories while keeping GitHub mirrors.
  • Security-conscious teams may prefer self-hosted control for sensitive code.
  • AI-skeptical communities may choose platforms with stricter policy alignment.
  • Tooling vendors may add better support for non-GitHub forges if demand rises.

GitHub still has enormous advantages​

It would be foolish to declare GitHub finished. Its ecosystem depth remains unmatched. The platform has the largest contributor base, mature permissions, strong marketplace integrations, enterprise adoption, package services, security tooling, and a familiar collaboration model.
GitHub also has Microsoft’s resources behind it. If leadership truly prioritizes reliability, the company can invest at a level most competitors cannot match. Its move toward more capacity, better isolation, public-cloud expansion, and eventually multi-cloud resilience could produce a stronger platform over time.
The competitive danger is not immediate mass abandonment. It is optionality. Once teams begin designing workflows that can survive without GitHub, Microsoft loses some of the lock-in that made GitHub so strategically valuable.

What Microsoft Must Fix First​

Reliability is a product feature​

Microsoft and GitHub should treat reliability not as backend hygiene but as a first-class product feature. Developers do not experience “availability architecture”; they experience whether a pull request opens, a workflow starts, a push succeeds, and a review can be completed. The user-facing language should reflect that reality.
A serious recovery plan needs more than apologies. It needs measurable goals, public accountability, and visible reduction in incident frequency. GitHub has already begun publishing more availability details, but the next phase should make reliability legible to both individual maintainers and enterprise administrators.
The most important work is likely unglamorous. GitHub needs sharper dependency boundaries, safer rollout systems, better simulation of high-load agentic workflows, stricter controls around merge-sensitive paths, and incident communication that tells users exactly which workflows are safe and which are not.
  • Stabilize Git, pull requests, and Actions before shipping major new AI workflows.
  • Separate critical human development paths from high-volume agent traffic.
  • Publish clearer service-level reporting for user-visible workflows.
  • Give enterprises stronger fallback patterns for runners, queues, and API disruptions.
  • Prove improvement over months, not days, before asking developers to trust the roadmap.

The status page needs to match reality​

A recurring developer complaint around SaaS outages is that the status page does not reflect what users are experiencing. GitHub says it is improving status transparency, and that effort is essential. When developers cannot tell whether a failure is local, regional, account-specific, or platform-wide, they waste time debugging ghosts.
Better status reporting should include affected workflows, approximate scope, mitigation guidance, and whether retries are harmful. During automation-heavy incidents, telling users not to retry aggressively can be just as important as acknowledging the outage. The platform should communicate like a collaborator, not like a legal department.
Microsoft also needs to resist the urge to over-message AI while under-message reliability. If Copilot is on every banner but incident communication feels delayed or incomplete, the trust gap widens. The recovery campaign should be boring, specific, and relentless.

Strengths and Opportunities​

GitHub still has major strengths, and the current crisis could become a turning point if Microsoft treats it as an operational reset rather than a public-relations flare-up. The platform’s scale gives it data competitors lack, its enterprise footprint gives it funding, and its developer ecosystem gives it a path back if reliability visibly improves.
  • GitHub remains the default collaboration platform for a vast share of developers and organizations.
  • Microsoft has the capital and engineering depth to rebuild weak reliability layers.
  • Copilot gives GitHub a strategic role that can justify major infrastructure investment.
  • Public availability reporting can become a trust asset if it stays candid and detailed.
  • Azure integration can provide capacity advantages when paired with proper isolation.
  • Open-source backlash can sharpen GitHub’s priorities by making user pain impossible to ignore.
  • A successful reliability turnaround would strengthen Microsoft’s developer credibility at a critical AI moment.

Risks and Concerns​

The risks are substantial because GitHub’s problems sit at the intersection of technical debt, AI-driven load, corporate strategy, and community trust. Microsoft can fix systems faster than it can fix belief, and belief is what turns a platform from a vendor into a home.
  • Developer trust may erode faster than reliability metrics recover.
  • AI workloads may continue growing faster than infrastructure redesigns can absorb.
  • Open-source maintainers may normalize GitHub mirrors instead of GitHub-first development.
  • Enterprise customers may demand stronger contractual guarantees or multi-provider workflows.
  • CoreAI integration may reinforce fears that GitHub’s mission has changed.
  • Partial outages may remain undercounted if metrics do not reflect workflow-level failure.
  • Competitors may gain cultural momentum even without matching GitHub’s full feature set.

Looking Ahead​

The next few months matter​

The most important signal will not be a single apology, executive post, or architectural diagram. It will be whether developers experience fewer interruptions through the rest of 2026. GitHub needs a sustained period of calm, especially around Actions, pull requests, search, APIs, and merge queues.
Microsoft’s fiscal and strategic context also matters. The company is under intense pressure to show AI returns, and GitHub Copilot is one of its strongest examples of AI monetization. That makes it even more important for Microsoft to prove that AI growth will fund reliability rather than consume it.
  • Watch GitHub’s monthly availability reports for fewer incidents and clearer root-cause follow-through.
  • Watch major open-source projects to see whether Ghostty and Zig become isolated cases or early movers.
  • Watch Copilot and agentic coding announcements for signs of traffic isolation and abuse controls.
  • Watch enterprise messaging for stronger commitments around Actions, APIs, and critical workflow uptime.
  • Watch Microsoft’s organizational choices to see whether reliability leaders gain visible authority.

The platform must earn back the default​

GitHub’s greatest asset is that developers do not want to leave. Even angry users often criticize it because they remember how good it felt when GitHub was fast, simple, dependable, and culturally central. That lingering affection is a gift, but it has an expiration date.
The path forward is clear but difficult. GitHub must slow down where it matters, isolate what breaks, communicate honestly, and make the boring parts of software collaboration feel dependable again. Microsoft can still turn this crisis into proof that it understands the developer world it bought in 2018.
GitHub is not doomed, but it is being tested in a way that cuts to the heart of Microsoft’s modern identity. If Microsoft wants developers to believe in its AI future, it must first make sure the platform where those developers build, review, test, secure, and ship software works when they need it. The future of coding may involve agents, copilots, and enormous automation, but the foundation is still trust — and right now, trust is the service GitHub most urgently needs to restore.

Source: Windows Central After Xbox and Windows, now GITHUB is in crisis, "failing me, every single day, and it is personal" — Microsoft, what are you doing?
 

Back
Top