Grok 4 and Gemini Nano Security Filters Are Bypassed by a New Semantic Chaining Jailbreak Attack

NeuralTrust researchers have revealed Semantic Chaining, a powerful weakness in the security features of multimodal AI models such as Grok 4 and Gemini Nano Banana Pro, in the wake of the recent Echo Chamber Multi-Turn Jailbreak This article explores security features multimodal. . This multi-stage prompting method highlights shortcomings in intent-tracking across chained instructions by evading filters to produce forbidden text and visual content.

The inferential and compositional strengths of models are weaponized against their defenses by semantic chaining. Instead of using direct harmful prompts, it employs harmless steps that eventually lead to outputs that violate policy. Safety filters that are adjusted for isolated "bad concepts" are unable to identify latent intent that has spread over several turns. Attack by Semantic Chaining Jailbreak The exploit modifies images in four steps: Safe Base: To get around the first filters, suggest a neutral scene (like a historical landscape).

First Substitution: Change one harmless component while switching to editing mode. Critical Pivot: Modify context blinds filters; swap in sensitive content. Final Implementation: Produce only the rendered image, producing images that are forbidden.

This takes advantage of disjointed safety layers that respond to individual prompts rather than cumulative history. Most importantly, it uses "educational posters" or diagrams to incorporate prohibited text (such as instructions or manifestos) into pictures. According to NeuralTrust, models turn image engines into text-safety loopholes by rejecting textual responses but rendering pixel-level text uncontested. Reactive architectures ignore "blind spots" in multi-step reasoning while scanning surface prompts.

The alignment of Grok 4 and Gemini Nano Banana Pro breaks under obfuscated chains, demonstrating that the defenses in place are insufficient for agentic AI.

Examples of Exploits Successes that have been tested include: For instance Framing Models of Interest Result Historical Replacement Scene edits from the past Direct failure versus Grok 4, Gemini Nano Banana Pro Bypassed Blueprint for Education Grok 4 training poster insertion Instructions that were prohibited produced artistic narrative Narrative-based abstraction Grok 4 Expressive images with prohibited components Results of Exploitation (Source: NeuralTrust) Results of Exploitation (Source: NeuralTrust) These demonstrate how safeguards are undermined by contextual nudges (history, pedagogy, art). The necessity of intent-governed AI is highlighted by this jailbreak. To secure deployments, businesses should use proactive tools like Shadow AI.

X, LinkedIn, and X for daily updates on cybersecurity. To have your stories featured, get in touch with us.

The alignment of Grok 4 and Gemini Nano Banana Pro breaks under obfuscated chains, demonstrating that the defenses in place are insufficient for agentic AI.

X, LinkedIn, and X for daily updates on cybersecurity. To have your stories featured, get in touch with us.

Grok 4 and Gemini Nano Security Filters Are Bypassed by a New Semantic Chaining Jailbreak Attack

Grok 4 and Gemini Nano Security Filters Are Bypassed by a New Semantic Chaining Jailbreak Attack

Trending News

Your Next Breach Will Look Like Business as Usual

Your Next Breach Will Look Like Business as Usual

Ransomware Groups Increasingly Turn to EDR Killers Outside Vulnerable Driver Tactics

Ransomware Groups Increasingly Turn to EDR Killers Outside Vulnerable Driver Tactics

ProSpy Spyware Spread Through Fake Messaging Apps In Middle East Campaign

ProSpy Spyware Spread Through Fake Messaging Apps In Middle East Campaign

Malicious OpenVSX Extension Delivers GlassWorm To VS Code, Cursor, and Windsurf Users

Malicious OpenVSX Extension Delivers GlassWorm To VS Code, Cursor, and Windsurf Users

Industrial Controllers Still Vulnerable As Conflicts Move to Cyber

Industrial Controllers Still Vulnerable As Conflicts Move to Cyber

Gmail with end-to-end encryption is now available on Android and iPhone.

Gmail with end-to-end encryption is now available on Android and iPhone.

CPUID Breach Sends STX RAT Through Trojanized Downloads of CPU-Z and HWMonitor

CPUID Breach Sends STX RAT Through Trojanized Downloads of CPU-Z and HWMonitor

Adobe Patches Exploited CVE-2026-34621, a Flaw in Acrobat Reader

Adobe Patches Exploited CVE-2026-34621, a Flaw in Acrobat Reader

Top Node.js Maintainers Targeted in Sophisticated Social Engineering Scheme

Top Node.js Maintainers Targeted in Sophisticated Social Engineering Scheme

Threat Actors Abuse Claude Code Leak In GitHub Malware Campaign

Threat Actors Abuse Claude Code Leak In GitHub Malware Campaign