News

How Adversarial ML Can Turn An ML Model Against Itself

Discover the main types of adversarial machine learning attacks and what you can do to protect yourself.

Published

1 year ago

March 25, 2024

how adversarial ml can turn an ml model against itself

Machine learning (ML) is at the very center of the rapidly evolving artificial intelligence (AI) landscape, with applications ranging from cybersecurity to generative AI and marketing. The data interpretation and decision-making capabilities of ML models offer unparalleled efficiency when you’re dealing with large datasets. As more and more organizations implement ML into their processes, ML models have emerged as a prime target for malicious actors. These malicious actors typically attack ML algorithms to extract sensitive data or disrupt operations.

What Is Adversarial ML?

Adversarial ML refers to an attack where an ML model’s prediction capabilities are compromised. Malicious actors carry out these attacks by either manipulating the training data that is fed into the model or by making unauthorized alterations to the inner workings of the model itself.

How Is An Adversarial ML Attack Carried Out?

There are three main types of adversarial ML attacks:

Data Poisoning

Data poisoning attacks are carried out during the training phase. These attacks involve infecting the training datasets with inaccurate or misleading data with the purpose of adversely affecting the model’s outputs. Training is the most important phase in the development of an ML model, and poisoning the data used in this step can completely derail the development process, rendering the model unfit for its intended purpose and forcing you to start from scratch.

Evasion

Evasion attacks are carried out on already-trained and deployed ML models during the inference phase, where the model is put to work on real-world data to produce actionable outputs. These are the most common form of adversarial ML attacks. In an evasion attack, the attacker adds noise or disturbances to the input data to cause the model to misclassify it, leading it to make an incorrect prediction or provide a faulty output. These disturbances are subtle alterations to the input data that are imperceptible to humans but can be picked up by the model. For example, a car’s self-driving model might have been trained to recognize and classify images of stop signs. In the case of an evasion attack, a malicious actor may feed an image of a stop sign with just enough noise to cause the ML to misclassify it as, say, a speed limit sign.

Model Inversion

A model inversion attack involves exploiting the outputs of a target model to infer the data that was used in its training. Typically, when carrying out an inversion attack, an attacker sets up their own ML model. This is then fed with the outputs produced by the target model so it can predict the data that was used to train it. This is especially concerning when you consider the fact that certain organizations may train their models on highly sensitive data.

How Can You Protect Your ML Algorithm From Adversarial ML?

While not 100% foolproof, there are several ways to protect your ML model from an adversarial attack:

Validate The Integrity Of Your Datasets

Since the training phase is the most important phase in the development of an ML model, it goes without saying you need to have a very strict qualifying process for your training data. Make sure you’re fully aware of the data you’re collecting and always make sure to verify it’s from a reliable source. By strictly monitoring the data that is being used in training, you can ensure that you aren’t unknowingly feeding your model poisoned data. You could also consider using anomaly detection techniques to make sure the training datasets do not contain any suspicious samples.

Secure Your Datasets

Make sure to store your training data in a highly secure location with strict access controls. Using cryptography also adds another layer of security, making it that much harder to tamper with this data.

Train Your Model To Detect Manipulated Data

Feed the model examples of adversarial inputs that have been flagged as such so it will learn to recognize and ignore them.

Perform Rigorous Testing

Keep testing the outputs of your model regularly. If you notice a decline in quality, it might be indicative of an issue with the input data. You could also intentionally feed malicious inputs to detect any previously unknown vulnerabilities that might be exploited.

Adversarial ML Will Only Continue To Develop

Adversarial ML is still in its early stages, and experts say current attack techniques aren’t highly sophisticated. However, as with all forms of tech, these attacks will only continue to develop, growing more complex and effective. As more and more organizations begin to adopt ML into their operations, now’s the right time to invest in hardening your ML models to defend against these threats. The last thing you want right now is to lag behind in terms of security in an era when threats continue to evolve rapidly.

Related Topics:Artificial Intelligence Machine Learning

Up Next

True Gamers & Takefluence Reveal New Ambassador Program

Don't Miss

Advancing MENA Health Through AI Vascular Age Analysis

Click to comment

News

Google Releases Veo 2 AI Video Tool To MENA Users

The state-of-the-art video generation model is now available in Gemini, offering realistic AI-generated videos with better physics, motion, and detail.

Published

1 week ago

April 16, 2025

Nour Nasir

Google

Starting today, users of Gemini Advanced in the MENA region — and globally — can tap into Veo 2, Google’s next-generation video model.

Originally unveiled in 2024, Veo 2 has now been fully integrated into Gemini, supporting multiple languages including Arabic and English. The rollout now brings Google’s most advanced video AI directly into the hands of everyday users.

Veo 2 builds on the foundations of its predecessor with a more sophisticated understanding of the physical world. It’s designed to produce high-fidelity video content with cinematic detail, realistic motion, and greater visual consistency across a wide range of subjects and styles. Whether recreating natural landscapes, human interactions, or stylized environments, the model is capable of interpreting and translating written prompts into eight-second 720p videos that feel almost handcrafted.

Users can generate content directly through the Gemini platform — either via the web or mobile apps. The experience is pretty straightforward: users enter a text-based prompt, and Veo 2 returns a video in 16:9 landscape format, delivered as an MP4 file. These aren’t just generic clips — they can reflect creative, abstract, or highly specific scenarios, making the tool especially useful for content creators, marketers, or anyone experimenting with visual storytelling.

Also Read: Getting Started With Google Gemini: A Beginner’s Guide

To ensure transparency, each video is embedded with SynthID — a digital watermark developed by Google’s DeepMind. The watermark is invisible to the human eye but persists across editing, compression, and sharing. It identifies the video as AI-generated, addressing concerns around misinformation and media authenticity.

While Veo 2 is still in its early phases of public rollout, the technology is part of a broader push by Google to democratize advanced AI tools. With text-to-image, code generation, and now video creation integrated into Gemini, Google is positioning the platform as a full-spectrum creative assistant.

Access to Veo 2 starts today and will continue expanding in the coming weeks. Interested users can try it out at gemini.google.com or through the Gemini app on Android and iOS.