Google DeepMind CEO demonstrates Genie 2, world-building AI model that could train robots

Illustrate a vibrant and exciting image in a whimsical cartoon-like style, akin to early 20th-century animation, with a 3:2 aspect ratio. Depict an enthusiastic figure who embodies the role of a visionary tech CEO, showcasing a fantastic digital device. This device, named 'Genie 2,' should look futuristic yet friendly, showing a small 3D environment inside it. The figure explains the device's functionality to a curious, professional-looking reporter. Also, include an AI assistant in digital form named 'Astra,' who is interpreting and discussing an artwork next to them. The overall ambiance should be one of innovative breakthrough and exciting potential.

Google DeepMind’s CEO, Demis Hassabis, showcased the Genie 2 AI model, a significant advancement in world-building technology that has the potential to train robots. During a segment on 60 Minutes, correspondent Scott Pelley tested Astra, DeepMind’s AI assistant, which can recognize objects and generate creative narratives based on visual inputs. The demonstration highlighted Astra’s ability to interpret artwork and create stories, illustrating the AI’s understanding of emotional contexts.

DeepMind’s progress in generative AI was further exemplified by the Veo 2 model, which can produce photorealistic videos from text prompts, showcasing remarkable advancements in visual realism. Genie 2 can transform static images into interactive 3D environments, allowing users or AI agents to explore and interact with these generated worlds.

Hassabis explained the practical applications of this technology, emphasizing its potential in entertainment, gaming, and the training of AI and robots. Simulated environments can provide extensive data for training AI, reducing the need for costly real-world data collection. Additionally, Hassabis mentioned the exploration of using Google’s geographic data to enhance AI systems’ understanding of the real world, potentially making static images interactive and immersive.

Full article

Leave a Reply