Connect with us

News

MIT’s “PhotoGuard” Protects Images From Unauthorized AI Edits

The technology invisibly alters select pixels to throw off algorithmic AI models.

Published

on

mit's photoguard protects images from unauthorized ai edits
MIT CSAIL

As AI continues to develop rapidly, chatbots are gaining the power to create and manipulate images, with Shutterstock and Adobe currently leading the way. Despite the obvious power of AI algorithms, the technology has a few pitfalls, one of which is the unauthorized manipulation of copyrighted artwork and images.

MIT CSAIL thinks it has the answer to this growing problem in the form of PhotoGuard, a new technique that alters select pixels in an image to disrupt AI’s ability to understand what the image is.

The altered pixels are known as “perturbations” and are invisible to the human eye but easily seen by AI bots as they scan the color and position of every pixel in an image. Any edits AI tries to make to a protected image will also apply to the fake pixels, resulting in an unrealistic or broken final image, thanks to PhotoGuard.

Also Read: PicSo Review: A Popular AI-Based Text-To-Image App

“The encoder attack makes the model think that the input image is some other image,” explained MIT student and lead author of the paper, Hadi Salman. “Whereas the diffusion attack forces the diffusion model to make edits towards some target image”. The technique sounds complex but could potentially stop malicious actors from reverse engineering protected images by adding minor edits to circumvent copyright.

“A collaborative approach involving model developers, social media platforms, and policymakers presents a robust defense against unauthorized image manipulation. Working on this pressing issue is of paramount importance today,” Salman said in a recent press release. “And while I am glad to contribute towards this solution, much work is needed to make this protection practical. Companies that develop these models need to invest in engineering robust immunizations against the possible threats posed by these AI tools”.

Advertisement

📢 Get Exclusive Monthly Articles, Updates & Tech Tips Right In Your Inbox!

JOIN 21K+ SUBSCRIBERS

Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

News

Google Releases Veo 2 AI Video Tool To MENA Users

The state-of-the-art video generation model is now available in Gemini, offering realistic AI-generated videos with better physics, motion, and detail.

Published

on

google releases veo 2 ai video tool to mena users
Google

Starting today, users of Gemini Advanced in the MENA region — and globally — can tap into Veo 2, Google’s next-generation video model.

Originally unveiled in 2024, Veo 2 has now been fully integrated into Gemini, supporting multiple languages including Arabic and English. The rollout now brings Google’s most advanced video AI directly into the hands of everyday users.

Veo 2 builds on the foundations of its predecessor with a more sophisticated understanding of the physical world. It’s designed to produce high-fidelity video content with cinematic detail, realistic motion, and greater visual consistency across a wide range of subjects and styles. Whether recreating natural landscapes, human interactions, or stylized environments, the model is capable of interpreting and translating written prompts into eight-second 720p videos that feel almost handcrafted.

Users can generate content directly through the Gemini platform — either via the web or mobile apps. The experience is pretty straightforward: users enter a text-based prompt, and Veo 2 returns a video in 16:9 landscape format, delivered as an MP4 file. These aren’t just generic clips — they can reflect creative, abstract, or highly specific scenarios, making the tool especially useful for content creators, marketers, or anyone experimenting with visual storytelling.

Also Read: Getting Started With Google Gemini: A Beginner’s Guide

To ensure transparency, each video is embedded with SynthID — a digital watermark developed by Google’s DeepMind. The watermark is invisible to the human eye but persists across editing, compression, and sharing. It identifies the video as AI-generated, addressing concerns around misinformation and media authenticity.

While Veo 2 is still in its early phases of public rollout, the technology is part of a broader push by Google to democratize advanced AI tools. With text-to-image, code generation, and now video creation integrated into Gemini, Google is positioning the platform as a full-spectrum creative assistant.

Access to Veo 2 starts today and will continue expanding in the coming weeks. Interested users can try it out at gemini.google.com or through the Gemini app on Android and iOS.

Continue Reading

#Trending