Google Introduces the Gemini AI Model

 Google Unveils Gemini: Its Most Advanced Multimodal AI Model


Google has introduced Gemini, its most sophisticated artificial intelligence model capable of processing diverse types of information, including video, audio, and text.

The magnitude of Google's ambitions is evident in the announcement, positioning Gemini as the company's "largest and most capable AI model." The introduction heralds a "Gemini era," envisioning the integration of this advanced model across various settings, from major corporations to consumer devices like the Google Pixel 8 Pro.

Unlike conventional AI models limited to specific user prompts, such as images or text exclusively, Gemini stands out as a "multimodal" model, offering a more comprehensive approach to information processing, as stated by Google. The tech giant claims that Gemini exhibits enhanced reasoning abilities and a nuanced understanding of information compared to its predecessors.

Alphabet CEO Sundar Pichai emphasized the significance of this new era of models, describing it as one of the most substantial science and engineering endeavors undertaken by the company. The announcement reflects Google's commitment to staying at the forefront of AI development, with Gemini positioned to rival the advancements introduced by Microsoft-backed OpenAI's ChatGPT, launched approximately a year ago.

Google added a portion of the new Gemini model technology to its AI assistant Bard on Wednesday, and said it planned to release its most advanced version of Gemini through Bard early next year.

Alphabet said it is making three versions of Gemini, each of which is designed to use a different amount of processing power. The most powerful version is designed to run in data centers, and the smallest will run efficiently on mobile devices, the company said.

Gemini is the largest AI model that the company’s Google DeepMind AI unit has helped make, but it is “significantly” cheaper to serve to users than the company’s prior, larger models, DeepMind Vice President, Product Eli Collins told reporters.

“So it’s not just more capable, it’s also far more efficient,” Collins said. The latest model still requires a substantial amount of computing power to train, but Google is improving on its process, he added.

Alphabet also announced a new generation of its custom-built AI chips, or tensor processing units (TPUs). The Cloud TPU v5p is designed to train large AI models, and is stitched together in pods of 8,960 chips.

The new version of its customer processors can train large language models nearly three times as fast as prior generations. The new chips are available for developers in “preview” as of Wednesday, the company said.

Comments