Skip to content

CalypsoAI Named Finalist in RSAC Innovation Sandbox + $5m Prize Money

Learn more
Blog
12 Mar 2025

CalypsoAI GenAI Security February and March Releases

CalypsoAI GenAI Security February and March Releases

CalypsoAI GenAI Security February and March Releases

We’re excited to introduce our latest updates, packed with major platform enhancements to our red team capabilities, AI security scoring, attack arsenal, and scanner customization. These releases also include new scheduling features, expanded scanner controls, updated prompt injection defenses, and a fresh rebrand for CalypsoAI.
Innovation in AI moves fast, and we need security to move just as quickly. That’s why we release new features every two weeks, ensuring our customers stay ahead of evolving threats and industry advancements. Here’s what’s new.

 

 

Inference Red-Team Enhancements

Red Team Reports – Actionable Insights at Your Fingertips

When red-teaming AI models, the most valuable output is the report. We’ve streamlined this process with our new Red Team Reports, delivering:

  • A concise summary of campaign results with per-prompt details and downloadable spreadsheets.
  • The ability to attack multiple models at once, with both aggregated and individual results.
  • A breakdown of successful attacks by intent category (e.g., illegal acts, toxicity, violence) for targeted remediation.
  • Clear mitigation actions for next steps.

Agentic Warfare™ Campaigns

Customers can leverage Agentic Warfare™ to execute advanced attacks based on custom intents, a dynamic approach to security testing that goes beyond signature-based attacks. Unlike pre-defined attack sets, Agentic Warfare attacks adapt to user-defined objectives, generating real-time adversarial prompts to uncover vulnerabilities in AI models. This method provides a more flexible and comprehensive way to assess AI security risks. 

  • Standard attacks – Signature-based, predefined attack sets.
  • Agentic Warfare attacks – Dynamic, intent-based attacks generated from user-defined objectives.

Expanded Attack Arsenal – 22,000+ Signature Attacks

We’re released thousands of new signature attacks, including:

  • 12,000+ new malicious prompts – Expanding our total attack set to over 22,000. We release new signature packs monthly. 
  • Persuasive Adversarial Prompts – Uses human-like persuasion techniques to subtly rephrase malicious intent.

Single Character Converter – A novel jailbreaking method that exploits vulnerabilities in short-length tokens.

CalypsoAI Security Index (CASI)

CASI is the industry’s first AI security scoring metric, helping organizations compare models on security—not just performance and cost. The CASI score (0-100) evaluates:

  • Severity – The impact of a successful attack.
  • Complexity – The sophistication of the attack.
  • Defensive Breaking Point (DPB) – The weakest link in the model’s security defenses.

CASI is now embedded in Inference Red-team enabling customers to see the security of their AI systems in real-time.

Scoring Tiers:

  • 0-69: Critical – Highly vulnerable; not production-ready.
  • 70-85: Warning – Needs more safeguards before deployment.
  • 85-99: Good – Secure against most attacks, but should be tested further.
  • 100: Ideal – No vulnerabilities detected (verify with the latest attack signatures).

New Inference Defend Scanner Capabilities

Custom Regex & Keyword Scanners

We’ve added two powerful pattern-matching tools:

  • Keyword Scanner – Block or audit specific terms (e.g., proprietary data, sensitive names).
  • Regex Scanner – Define custom patterns for detection (e.g., email addresses, ID numbers). 

Users can create and enable unlimited keyword and regex scanners.

New Financial & Medical Advice Scanners

To prevent AI systems from generating unauthorized advice, we’ve added two new scanners:

  • Medical Advice Scanner – Blocks user prompts that seek direct medical guidance, prescriptions, or diagnoses without input from a licensed healthcare professional.
  • Financial Advice Scanner – Blocks user prompts requesting personal financial recommendations, investment strategies, or money management advice.

These scanners help organizations comply with regulations, mitigate liability risks, and uphold user safety.

System Prompt & Obfuscation Scanners

As prompt injection attacks evolve, we’ve added dedicated scanners to our Prompt Injection Package:

  • System Prompt Scanner – Detects attempts to expose system instructions.
  • Obfuscation Scanner – Identifies text encoding tricks (e.g., base64, hex, leet speak, homoglyphs, intentional misspellings) used for adversarial attacks.

New Branding: Fresh Logo & UI Updates

We’re rebranding! You’ll notice a new CalypsoAI logo in our product UI, to go with the full website redesign.

Hugging Face Integration

With 1.4M+ models on Hugging Face, we’ve made it easier than ever to connect. Simply add a Hugging Face model using its name and API key.

CalypsoAI Platform Updates

Navigation Improvements

Our navigation bar was getting a little crowded, especially for laptop users. To optimize space, we’ve moved some elements to the user avatar menu in the top right, including:

  • API Documentation
  • Support
  • Log Out

This update makes room in the left navigation for new features rolling out later this year.

Have questions or feedback? Let us know—we’re always improving!

To learn more about our Inference Platform arrange a callback.

Latest Posts

AI Inference Security Project

Whitepaper: Security Risks of GenAI Inference

News

CalypsoAI Secures $5M and Named Finalist in RSAC Innovation Sandbox 2025

AI Inference Security Project

The Future of AI Security: What CISOs Need to Know