Gemini 2.0 Flash Launched: Google's New Breakthrough in AI

Google has officially kicked off the Gemini 2.0 era with the launch of the all-new Gemini 2.0 Flash. The company claims that the Gemini 2.0 Flash even outperforms the Gemini 1.5 Pro on key benchmarks, and is also up to 2x faster than its predecessor.

As such, Gemini 2.0 Flash will become Google's flagship AI model, competing directly with offerings from OpenAI and other big names in the market. In addition to improved performance and low latency, Gemini 2.0 Flash also comes with native support for multimodal output, including natively generated images combined with controllable text-to-speech (TTS) multilingual audio and text. The advanced model also supports multimodal inputs such as images, video, and audio, and is tightly integrated with native tools, including Google Search, code execution, and more.

In simple terms, Gemini 2.0 Flash stands out for its ability to process multiple types of inputs (text, images, video, audio) to create diverse outputs (including images and voice). In the previous generation, Flash 1.5 could only create text and was not suitable for high-demand tasks. With 2.0 Flash, Google claims that this model is not only fast but also extremely flexible thanks to the ability to use tools like Google Search and connect to external APIs.

Developers can try out the Gemini 2.0 Flash beta in AI Studio and Vertex AI today. Additionally, Google is rolling out a free beta of the new Multimodal Live API with real-time audio, streaming video input, and the ability to use multiple compositing tools.

The new Gemini 2.0 Flash model will be available to users through the Gemini experience on PC, web, and soon on mobile apps. Google plans to announce general availability of Gemini 2.0 Flash in January 2025.

Along with Gemini 2.0 Flash, Google also announced a number of prototypes exploring the operational capabilities of Gemini 2.0.

Project Astra now supports multilingual and mixed-language communication. Currently, Project Astra offers up to 10 minutes of session storage and can use Google Search, Lens, and Maps.
Project Mariner is an AI agent that can understand and interpret information on a user's browser screen to complete tasks. Google claims that Project Mariner achieved results up to 83.5% of the time as a single agent.

Jules is an AI-powered code agent that integrates directly into GitHub workflows to troubleshoot, plan, and execute on that plan.

With multi-modal capabilities and native tool integration, Gemini 2.0 Flash opens up exciting possibilities for both developers and users.

Lesley Montoya

Update 12 December 2024

You should read it

May be interested

Google Launches Gemini CLI, Bringing Gemini to the Terminal
google has announced gemini cli, an open-source tool designed to bring their ai models directly into the developer terminal.
Google Announces Gemini 2.5 Pro Deep Think, Beats OpenAI's o3 and o4 Models
at google i/o 2025, google announced a number of updates to its gemini 2.5 model line. the main highlight was the gemini 2.5 pro deep think mode, which is said to beat openai's latest o3 and o4 series models in popular ai benchmarks.
Gemini Tips for Multitasking on Android
gemini has just updated the utilities utility to help users easily perform multiple actions at the same time. this makes performing operations on gemini more convenient.
Google's Gemini insults user for asking him to do homework
google's ai chatbot gemini gives a deeply offensive response when asked for help with homework.
5 Reasons to Try Google Gemini
while not as widely discussed as some of its competitors, google's gemini ai has a lot going for it — and here are five reasons why gemini deserves your attention.
How to disable Gemini in Gmail and other Google Workspace apps
if you rarely use gemini, don't like ai tools, or want to avoid a deeper connection with google's ecosystem, here's how to disable gemini in gmail on desktop and android/ios devices.
How to Easily Identify Songs Using Gemini on Android
whether you're curious about a catchy beat or building your playlist, here's how gemini can help you discover new favorite songs.
Google launches Gemini 2.5 Deep Think - surpassing OpenAI o3 and Grok 4 in performance
google has just announced plans to make its most advanced ai model — gemini version 2.5 pro — available for free to all gemini app users.
How to Run Gemini CLI AI Agent on Terminal
google has announced gemini cli, an open-source tool designed to bring their ai models directly into the developer terminal.
Google Gemini Can Now Create Bedtime Stories Using AI Through Gemini Storybook
google has rolled out a new tool in its ai chatbot gemini that lets you create an illustrated story just by describing it. this feature, called storybook .

Gemini 2.0 Flash Launched: Google's New Breakthrough in AI

You should read it

May be interested

Tech info