Google Enhances AI Security with Layered Defenses Against Prompt Injection Attacks

June 23, 2025

In a proactive move to bolster the security of its generative artificial intelligence (AI) systems, Google has unveiled a suite of safety measures designed to mitigate emerging threats, particularly indirect prompt injections. According to Google’s GenAI security team, these injection attacks, distinct from direct prompt injections, involve malicious commands hidden within external data sources, which can trick AI systems into performing harmful actions.

To address this evolving cybersecurity challenge, Google is adopting a “layered” defense strategy intended to complicate and increase the cost of executing attacks against its frameworks. The corporation has implemented multiple countermeasures across its AI operations, including the integration of purpose-built machine learning (ML) models to identify malicious inputs and various system-level safeguards. Notably, the flagship Gemini model has been equipped with enhanced guardrails designed to protect against these vulnerabilities.

Among the critical enhancements are prompt injection content classifiers that filter harmful instructions from inputs, and a security thought reinforcement mechanism that incorporates special markers into untrusted data. This innovative approach ensures that models divert from potentially malicious commands embedded in the data. Google has also applied markdown sanitization techniques alongside suspicious URL redaction, leveraging Google Safe Browsing to neutralize possible threats.

Despite these advancements, researchers warn that malicious actors are increasingly deploying sophisticated, adaptive strategies designed to circumvent established defenses. The inability of certain AI models to correctly distinguish between legitimate user commands and deceptive instructions embedded in retrieved data represents a significant challenge. Following a recent peer-reviewed study, researchers highlighted the need for comprehensive security measures that must extend across all layers of AI systems to effectively counteract threats posed by both external attackers and internal misalignments.

AI Act cybersecurity threats Google Machine Learning Prompt Injection

Latest NEWS

Microsoft tightens Edge’s Internet Explorer mode after reports of exploit chain

October 13, 2025
Astaroth banking trojan leverages GitHub to restore command-and-control, McAfee says

October 13, 2025
Smishing texts impersonate New York tax agency to steal inflation refund details

October 13, 2025
Unauthenticated flaw in Gladinet CentreStack and Triofox (CVE-2025-11371) exploited in the wild

October 11, 2025
Researchers: Stealit malware uses Node.js single-executable feature to spread

October 11, 2025

Google Enhances AI Security with Layered Defenses Against Prompt Injection Attacks

Latest NEWS

Microsoft tightens Edge’s Internet Explorer mode after reports of exploit chain

Astaroth banking trojan leverages GitHub to restore command-and-control, McAfee says

Smishing texts impersonate New York tax agency to steal inflation refund details

Unauthenticated flaw in Gladinet CentreStack and Triofox (CVE-2025-11371) exploited in the wild

Researchers: Stealit malware uses Node.js single-executable feature to spread