Google I/O 2024 showcased the tech giant’s latest advancements and this year had a clear focus on artificial intelligence. The event was packed with announcements, unveiling new AI-powered features, enhancements in search technology, and updates to Google’s suite of apps. Let’s dive into the key highlights from the event.
Highlights from Google I/O 2024
Gemini Nano with Multimodality
Google’s AI model, Gemini Nano, received a significant upgrade and a longer name. It is now known as Gemini Nano with Multimodality. The improved version of the model can process and synthesize information from a wide range of sources. You can give it input in the form of text, photos, audio, videos, and even live feeds from your phone’s camera. The AI can summarize the content or answer questions about it. Google put it on display by scanning a bookcase with a camera and had the AI catalog all the books, delivering a comprehensive list.
Advanced Photo Search Capabilities
Google Photos introduced a robust new feature called Ask Photos, powered by the Gemini AI. The tool allows users to search their photo library with unprecedented accuracy. For instance, you can find your car by telling Ask Photos your license plate number.
AI Integration in Google Workspace
Google is integrating AI across its Workspace suite, including Gmail, Google Drive, Docs, Sheets, and Slides. A new AI assistant, powered by Gemini, will help users draft emails, summarize documents, and answer questions. This assistant, which you can personalize with a name, acts as a productivity companion, helping to coordinate communications, manage project files, and keep track of to-do lists.
The AI can also assist students by explaining homework problems or summarizing meeting notes. Additionally, Circle to Search, an AI tool launched earlier this year, is being improved to assist with academic tasks like solving math problems.
New AI Models and Features
Google introduced two new models in the Gemini AI lineup. Gemini 1.5 Flash, optimized for speed and low latency, is designed for quick response tasks. Meanwhile, Project Astra, a visual chatbot, enhances Google Lens capabilities, allowing users to ask questions about their surroundings using their phone camera. Astra can identify objects, provide contextual information, and even suggest creative ideas.
Creativity Tools from Google Labs
Google Labs showcased its new suite of creative tools, including VideoFX, a generative video model based on Google DeepMind’s Veo. This tool can create 1080p videos from text prompts, offering more flexibility in video production. ImageFX, another tool, generates high-resolution images with improved accuracy.
MusicFX, an AI music generator, introduced DJ Mode, allowing musicians to create loops and samples from text prompts. The feature was highlighted during a live performance by musician Mark Rebillet, adding a fun, creative flair to the keynote.
Evolution in Search
Google’s search capabilities are changing with new AI-driven features. AI-organized search results provide more readable and concise information, while AI overviews summarize information from multiple sources, appearing at the top of search results. These summaries aim to deliver answers without requiring users to click on additional links.
Multi-Step Reasoning is another innovative feature, offering layered information on complex topics. For example, planning a trip can include finding hotels, setting itineraries, and suggesting restaurants, all presented in an organized manner.
If you missed out you can check out the full keynote and much more at the official Google I/O website, just like last years event.