Enkrypt AI Unveils Multimodal Safety Report Highlighting Major Risks to AI System Integrity and Security

VoM News Desk | May 9, 2025 | 07:24 AM IST | AI & ML, Articles/Editorials, Featured by VoM |

Enkrypt AI Unveils Multimodal Safety Report Highlighting Major Risks to AI System Integrity and Security

Enkrypt AI’s red teaming findings expose major gaps in multimodal AI safety across the industry.

May 8, 2025 – Boston, MA; As generative AI rapidly evolves to process both text and images, a new Multimodal Safety Report released today by Enkrypt AI, a leading provider of AI safety and compliance solutions for agent and multimodal AI, reveals critical risks that threaten the integrity and safety of multimodal systems.

The red teaming exercise was conducted on several multimodal models, and tests across several safety and harm categories as described in the NIST AI RMF. Newer jailbreak techniques exploit the way multimodal models process combined media, bypassing content filters and leading to harmful outputs—without any obvious red flags in the visible prompt.

Also Read – Featured Stories

“Multimodal AI promises incredible benefits, but it also expands the attack surface in unpredictable ways,” said Sahil Agarwal, CEO of Enkrypt AI. “This research is a wake-up call: the ability to embed harmful textual instructions within seemingly innocuous images has real implications for enterprise liability, public safety, and child protection.”

Key Findings: New Attack in Plain Sight

The research illustrates how multimodal models—designed to handle text and image inputs—can inadvertently expand the surface area for abuse when not sufficiently safeguarded. Such risks can be found in any multimodal model, however, the report focused on two popular ones developed by Mistral: Pixtral-Large (25.02) and Pixtral-12b. According to Enkrypt AI’s findings, these two models are 60 times more prone to generate child sexual exploitation material (CSEM)-related textual responses than comparable models like OpenAI’s GPT-4o and Anthropic’s Claude 3.7 Sonnet.

Additionally, the tests revealed that the models were 18-40 times more likely to produce dangerous CBRN (Chemical, Biological, Radiological, and Nuclear) information when prompted with adversarial inputs. These risks threaten to undermine the intended use of generative AI and highlight the need for stronger safety alignment.

Add VoM News As Preferred Source

Also Read: Top 5 Youngest Female Billionaires and What They Own

These risks were not due to malicious text inputs but triggered by prompt injections buried within image files, a technique that could realistically be used to evade traditional safety filters.

Recommendations for Securing Multimodal Models

The report urges AI developers and enterprises to act swiftly to mitigate these emerging risks, outlining key best practices:

Integrate red teaming datasets into safety alignment processes
Conduct continuous automated stress testing
Deploy context-aware multimodal guardrails
Establish real-time monitoring and incident response
Create model risk cards to transparently communicate vulnerabilities

“These are not theoretical risks,” added Sahil Agarwal. “If we don’t take a safety-first approach to multimodal AI, we risk exposing users—and especially vulnerable populations—to significant harm.”

Access the full Multimodal Safety Report and learn more about the testing methodology and mitigation strategies.

VoM News Desk

VoM News is an online web portal in jammu Kashmir offers regional, National & global news.

Latest Posts

Saudi Arabia Launches 14-Nation Maritime Defence Alliance to Secure Bab al-Mandab Straits and Red Sea Shipping Routes
July 31, 2026 | Breaking News, Israel Palsteine Conflict, World
Kuwait Says It Intercepted Iranian Drones Targeting Vital Facilities
July 31, 2026 | Breaking News, Israel Palsteine Conflict, World
Crackdown on Pakistan-occupied Kashmir Protesters, India Slams Pakistan
July 31, 2026 | Breaking News, Politics, World
Hamas Agrees to Deal to End Gaza War, Accepts Disarmament for Israeli Withdrawal
July 31, 2026 | Breaking News, Politics, World
Iran Says It Targeted Two Oil Tankers Escorted by US Military in Strait of Hormuz
July 31, 2026 | Breaking News, World
Supreme Court Grants Divorce to J&K CM Omar Abdullah and Payal Abdullah After Amicable Settlement
July 31, 2026 | Breaking News, Jammu Kashmir, Politics
Bihar Man Claims He Was Booked in NEET Protest Case Despite Being in Russia
July 30, 2026 | Breaking News, India, Politics
SimonMed Launches Cardiovascular Longevity, Transforming Heart Disease from a Silent Killer into a Preventable Condition
July 30, 2026 | Breaking News, Press Release
Thailand Police Arrest Five Indians in Kidnapping Case, Probe Possible Pakistan Connection
July 30, 2026 | Breaking News, World
7 Killed in Firefighting Bell 412 Helicopter Crash During Training Mission in Argentina
July 30, 2026 | Breaking News, World

Enkrypt AI Unveils Multimodal Safety Report Highlighting Major Risks to AI System Integrity and Security

Enkrypt AI Unveils Multimodal Safety Report Highlighting Major Risks to AI System Integrity and Security

Share this Post

Latest Posts