
By futureTEKnow | Editorial Team
Google has just rolled out a new feature for its Gemini AI platform, making it possible for users to transform photos and text into short, 8-second video clips with sound. This update is currently available to subscribers of the Gemini AI Ultra and Pro plans, and it’s accessible through the Gemini chat interface on the web, with a mobile app update coming soon.
Upload a photo and add a text description.
Gemini generates a 720p landscape MP4 video—up to 8 seconds long, complete with sound.
The process is streamlined within the Gemini chat, making it user-friendly for both creators and casual users.
✨📦✨ Special delivery! A new Gemini feature just dropped. Make photos come alive by turning them into videos with sound.
— Google Gemini App (@GeminiApp) July 10, 2025
This isn’t Google’s first foray into AI-powered video creation. The technology was initially showcased as part of Veo 3, Google’s advanced video-generation model, and was previously limited to Flow, a standalone filmmaking tool. Now, by integrating it into Gemini, Google is democratizing access to AI video creation for a much wider audience.
Google’s move comes as the competition heats up in the AI video space. Rivals like OpenAI, Runway, Alibaba, and Kuaishou are all racing to launch their own generative video tools. By embedding this capability directly into Gemini, Google is aiming to keep pace and set new standards for AI-driven creative tools.
To prevent misuse, Google has implemented strict content guidelines:
No videos using images of celebrities, politicians, or public figures.
Prohibited from generating content that promotes violence or bullying.
Despite these safeguards, the technology is still evolving. Early testers found that while Gemini excels at animating nature scenes, drawings, and objects, it struggles with more complex tasks. For instance, attempts to create talking videos from photos sometimes resulted in altered facial features or even changes in race. Simple prompts—like making a plant sway or animating a cat—worked well, but more ambitious requests, such as making a person breakdance, often produced awkward or unintended results.
A Google spokesperson emphasized that the AI is not programmed to change appearances and that improvements, especially for face animation, are in the pipeline.
This update is a significant step for anyone interested in AI-powered content creation. The ability to quickly generate short videos from photos and text opens up new possibilities for storytelling, marketing, and social media engagement. As the technology matures, expect even more sophisticated video generation capabilities to become available to a broader audience.
Gemini AI now converts photos and text into 8-second videos with sound
Available to Ultra and Pro subscribers via web, with mobile support coming soon
Strict content guidelines prevent misuse and protect privacy
Best results currently come from simple animations of objects and nature
Face and complex motion animation are still under development
Google’s latest update signals a new era in AI-assisted creativity, putting powerful video generation tools directly into users’ hands.

Explore the cutting-edge ways AI is enhancing Lean Six Sigma, from real-time process insights to predictive controls, ushering in a new era of operational excellence and efficiency.

Facing supply chain challenges in 2025? High-performing teams leverage AI for risk management, demand forecasting, supplier analytics, and end-to-end visibility to ensure business continuity and resilience.

Craft an AI-powered supply chain Center of Excellence that unifies control tower visibility, analytics, and inventory optimization into one strategic hub. Explore this blueprint to learn how a modern supply chain CoE drives resilience, smarter decisions, and operational excellence in the age of AI.

Supply chain leadership is being redefined by AI, intelligent automation, and agentic decision-making, demanding leaders who can engineer end-to-end intelligence rather than simply manage workflows. This article explores how next-generation supply chain leaders will combine data, algorithms, and human judgment to build resilient, adaptive, and high-performing global operations.

Bridgit Mendler’s Northwood Space is pioneering mass-produced ground stations, enabling scalable, high-speed connectivity for the new era of satellite networks and megaconstellations.

SpaceX aims to nearly double launches from Vandenberg in 2025, facing support from federal agencies but strong objections from the state and local communities.

Traditional Medicare will pilot AI-assisted prior authorization in 2026 across six states, focusing on high-risk outpatient services. Clinicians retain final say, but incentives and access concerns loom as CMS tests fraud reduction and “gold card” exemptions. Here’s what providers and patients should know.

OpenArt’s new “one-click story” compresses scripting, visuals, and edits into ready-to-post short videos—fueling viral growth and a fresh IP debate. We break down how it works, adoption signals, what’s next (multi-character, mobile), and practical guardrails creators and brands should follow to stay original and compliant.

OpenAI’s o3 swept the Kaggle AI chess tournament, defeating xAI’s Grok 4–0. The victory fueled the intense rivalry between Altman and Musk, reshaping AI benchmarks.

NASA and Google’s AI-powered Crew Medical Officer Digital Assistant enables autonomous diagnoses for astronauts on Mars missions, redefining remote healthcare for space and Earth.

Pinterest’s CEO confirms that fully agentic AI shopping is years away, as the platform invests in AI-powered tools to enhance discovery, inspiration, and personalized shopping experiences for millions.

Shopify’s new AI shopping tools are transforming e-commerce, letting agents and chatbots deliver smooth, personalized shopping and checkout experiences across platforms. Learn how these innovations reshape online retail.
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Thanks for visiting futureTEKnow.