one year on
Google unveils Gemini 1.0, claims state-of-the-art performance and native multimodality
Google announces Gemini 1.0 in three sizes — Ultra, Pro, Nano — with Ultra claiming a 90% score on MMLU, the first model to beat human experts, and Pro rolling into Bard today.
Google today introduced Gemini 1.0, its most capable AI model yet, claiming state-of-the-art performance on 30 of 32 widely-used academic benchmarks. The model comes in three sizes: Ultra for highly complex tasks, Pro for scaling across a wide range of tasks, and Nano for on-device. Gemini Ultra scored 90.0% on the MMLU benchmark, the first model to outperform human experts. The model is described as ‘natively multimodal,’ trained from the start on text, code, audio, image and video.
Gemini Pro is already powering Bard, Google’s chatbot, in what the company calls the biggest upgrade since Bard launched. The Pro version will be available to developers via API on December 13. Gemini Nano runs on the Pixel 8 Pro, enabling features like Summarize in Recorder and Smart Reply in Gboard. Gemini Ultra is undergoing safety checks and will be available to select customers early next year, followed by a new ‘Bard Advanced’ experience.
Some observers remain skeptical. Analyst Andy Thurai noted Google is ‘clearly playing catch-up with OpenAI’ and pointed to the delayed broader rollout of Ultra as a sign of caution. The company emphasized responsibility, calling Gemini’s safety evaluations the most comprehensive for any Google model to date.
The record
Constellation Research analyst said Google is trying to be a one-stop shop for large enterprises to train LLMs on Google Cloud, but cautioned that Gemini is playing catch-up with OpenAI and questioned how well it will perform in the real world.
One year later — open only if you can handle spoilers
While Gemini failed to immediately dethrone GPT-4 in market mindshare, it established Google as a credible multimodal competitor in the long run. Ultra's MMLU score became a widely cited benchmark, though real-world adoption lagged behind the hype. The Gemini line eventually powered a broad range of Google products, but the 2023 rollout was seen as the first step in a multi-year effort.