Google Introduced Gemini: A Revolutionary Multimodal AI Model

Editor in Chief at ApiX-Drive

Reading time: ~1 min

Google introduces Gemini, its newest AI breakthrough, capable of processing text, images, and videos. This innovative model is being progressively incorporated into Google's ecosystem, initially through the Bard chatbot. Gemini can be used in more than 170 regions, and the capabilities of the AI model will be available to developers through the Google Cloud API.

Future plans for Google's Gemini include its integration into Google's search engine, ads platform, and the Chrome browser. The most advanced version of the AI model is planned for launch in 2024 after a thorough safety assessment. Gemini is unique in its training across various data forms, such as texts, images, video, and audio, and is available in three distinct versions: Ultra, Nano, and Pro, each tailored for varying performance levels and efficiency.

Google is enhancing its Bard chatbot by integrating the Gemini Pro model, designed to improve its analytical and planning skills. Additionally, a special version of Gemini Pro is being incorporated into AlphaCode, a coding tool from Google DeepMind. In 2024, Bard will be equipped with the most sophisticated version, Gemini Ultra, which will also be available via a cloud API.