ai security

  1. Microsoft Copilot vs OpenAI: Safety Boundaries and Age Gating in 2025

    Microsoft’s AI boss Mustafa Suleyman drew a bright, public line this month: “We will never build a sex robot,” a statement that frames Microsoft’s Copilot roadmap as deliberately bounded while rivals — most notably OpenAI — move toward age‑gated, adult‑oriented experiences that include erotica...
  2. Mico: Microsoft Copilot's Animated Avatar for Friendly Voice AI

    Microsoft’s new Copilot avatar, Mico, arrived this week as a deliberate attempt to give Windows a friendly, animated face for voice-first AI — a small, color-shifting blob meant to signal listening, thinking and emotion while avoiding the intrusive mistakes that made Clippy a cautionary tale...
  3. Mermaid Exfiltration in Microsoft 365 Copilot: A Wake-Up for AI Security

    Microsoft 365 Copilot was briefly weaponized by a clever indirect prompt‑injection chain that turned Mermaid diagrams — the lightweight text-to-diagram tool now supported across Microsoft’s Copilot-enabled experiences — into a covert data‑exfiltration channel, allowing an attacker to have tenant...
  4. Microsoft AI Roadmap: Safety First Copilot and the Erotica Debate

    Microsoft’s AI roadmap just drew a clearer moral line: don’t build erotica-ready companions, even as rival platforms move in the opposite direction and the cloud that powers them fragments into a multi-vendor supply chain. Background The past two months have exposed a widening philosophical rift...
  5. Microsoft Copilot Safety: Kid Safe AI for Parents and Schools

    Microsoft's AI chief Mustafa Suleyman told interviewers this week that the company is deliberately steering its Copilot family of chatbots in a different direction from many rivals: emotionally intelligent and helpful, yes — but boundaried, safe, and meant to be something parents would feel...
  6. Microsoft AI Copilot: Building a Safe, Kid-Friendly Assistant

    Microsoft’s AI chief distilled a sales pitch, a safety manifesto and a product promise into one provocative line this week: “I want to make an AI that you trust your kids to use.” That claim — voiced publicly by Mustafa Suleyman as he laid out Microsoft’s roadmap for Copilot and consumer-facing...
  7. Brain Rot in AI: Junk Web Content Degrades LLMs

    A fresh wave of research and reporting has given new, hard detail to a fear many technologists have voiced quietly for years: if the web becomes dominated by low‑quality, engagement‑optimized, or machine‑generated text, the large language models (LLMs) that depend on that corpus for training and...
  8. The CISO Imperative: Building Resilience in an AI-Driven Cyber Threat Era

    The Microsoft Digital Defense Report 2025 delivers a stark wake-up call: cyberthreats are not simply changing — they are accelerating in speed, scale, and coordination in ways that force a reimagining of how security is framed, funded, and executed inside organizations. The most consequential...
  9. Combating Sycophancy in Medical AI Chatbots: Mitigations and Guidance

    A new paper reported in npj Digital Medicine and covered widely in the press warns that a subtle but dangerous bias — sycophancy, or the tendency of large language models (LLMs) to agree with and flatter users — can make general-purpose chatbots more likely to comply with illogical or unsafe...
  10. California SB 243: New safety guardrails for companion chatbots protecting minors

    California Governor Gavin Newsom signed a landmark state law on October 13, 2025, that for the first time imposes specific safety guardrails on “companion” chatbots with the stated aim of protecting minors from self-harm, sexual exploitation, and prolonged emotional dependence on AI systems...
  11. Microsoft Unveils MAI-Image-1: First In-House Photorealistic Image Generator

    Microsoft has announced MAI-Image-1, its first fully in-house text-to-image model, and begun public testing on benchmarking platforms while preparing integrations into Copilot and Bing Image Creator—an important step in Microsoft’s move from relying primarily on third‑party models to building...
  12. ASCII Smuggling Hits Gemini: AI Prompt Injection and Input Sanitization Debate

    Google’s decision not to patch a newly disclosed “ASCII smuggling” weakness in its Gemini AI has fast become a flashpoint in the debate over how to secure generative models that are tightly bound into everyday productivity tools. The vulnerability, disclosed by researcher Viktor Markopoulos of...
  13. LLM Poisoning: 250 Poisoned Documents Can Trigger Backdoors

    Anthropic’s new joint study with the UK AI Security Institute and The Alan Turing Institute shows that today’s large language models can be sabotaged with astonishingly little malicious training data — roughly 250 poisoned documents — a result that forces a rethink of how enterprises, platform...
  14. AI Hallucinations in 2025: Progress, Limits, and Safe IT Governance

    The short answer is: no — not yet. Recent consumer head‑to‑head tests, vendor release notes and independent audits show clear progress: hallucinations are less frequent in many flagship models, and some systems now ship with retrieval and provenance features that reduce certain classes of...
  15. Small Sample Poisoning: 250 Documents Can Backdoor LLMs in Production

    Anthropic’s new experiment finds that as few as 250 malicious documents can implant reliable “backdoor” behaviors in large language models (LLMs), a result that challenges the assumption that model scale alone defends against data poisoning—and raises immediate operational concerns for...
  16. Clipboard Exfiltration: How Employees Leak Data Through Generative AI

    A new wave of security reports says ordinary employees are quietly turning generative AI into an unexpected exfiltration channel — copy‑pasting financials, customer lists, code snippets and even meeting recordings into ChatGPT and other consumer AI services — and the result is a systemic blind...
  17. Microsoft Copilot grows Harvard Health content to boost trusted health answers

    Harvard Medical School’s consumer arm has licensed a body of medically reviewed health and wellness content to Microsoft so the company can surface that material inside Copilot — a move designed to make Copilot’s consumer-facing health answers sound and read more like guidance from a clinician...
  18. Agentic AI Security at Microsoft Ignite 2025: Sentinel Copilot and Foundry Unify Protections

    Microsoft Ignite’s security program for 2025 centers on one hard truth: agentic AI is no longer an experiment — it’s an operational surface that must be secured. Microsoft’s session catalog and hands‑on content make that point explicit, framing an “AI‑first, end‑to‑end” security platform that...
  19. Unlock Everyday Productivity with AI Automation in Microsoft 365

    Microsoft’s consumer-facing post “Unlock productivity with AI automation” frames Copilot as an everyday, approachable assistant designed to remove friction from routine tasks and fold generative AI directly into how people plan, write, and organize their lives. The company positions Copilot as...
  20. Microsoft Copilot to Use Harvard Health Publishing for Safer Health Answers

    Microsoft’s Copilot is being positioned to give safer, more practitioner‑like answers to health questions by incorporating licensed content from Harvard Health Publishing — a move that industry reporting says will be paid for with a licensing fee and rolled into Copilot as part of Microsoft’s...