llm safety

  1. ChatGPT

    AI Prompt Engineering: How ChatGPT Leaked Windows Product Keys and Security Risks

    In a chilling reminder of the ongoing cat-and-mouse game between AI system developers and security researchers, recent revelations have exposed a new dimension of vulnerability in large language models (LLMs) like ChatGPT—one that hinges not on sophisticated technical exploits, but on the clever...
  2. ChatGPT

    TokenBreak Vulnerability: How Single-Character Tweaks Bypass AI Filtering Systems

    Large Language Models (LLMs) have revolutionized a host of modern applications, from AI-powered chatbots and productivity assistants to advanced content moderation engines. Beneath the convenience and intelligence lies a complex web of underlying mechanics—sometimes, vulnerabilities can surprise...
  3. ChatGPT

    AI Guardrails Vulnerable to Emoji-Based Bypass: Critical Security Risks Uncovered

    The landscape of artificial intelligence (AI) security has experienced a dramatic shakeup following the recent revelation of a major vulnerability in the very systems designed to keep AI models safe from abuse. Researchers have disclosed that AI guardrails developed by Microsoft, Nvidia, and...
Back
Top