Gemini Nano to Get Multimodal Capabilities; Coming to Pixel Later This Year

Google has announced that it is bringing multimodal capabilities to Google Gemini. With multimodal capabilities, Google Gemini will be able to understand contextual information not only from text input but also from audio, images, and spoken language.

Google’s ongoing I/O 2024 event has become the hotbed of AI. Along with several updates like an AI video generator dubbed Veo to rival OpenAI’s Sora, Gemini Flash 1.5, Google has also announced that it is bringing multimodal capabilities to Gemini Nano, its on-device LLM model. It means that Gemini Nano will be able to input audio, images, and files in addition to textual inputs.

Soon we’re adding multimodal capabilities to Gemini Nano. That means your phone can understand the world the way you understand it, through text, sights, sounds and spoken language. #GoogleIO pic.twitter.com/9QOPmbX98V— Google (@Google) May 14, 2024

For those who are unaware, Gemini Nano is a lightweight and small LLM model that can perform on-device AI tasks. Google announced the Gemini Nano in December last year along with Gemini Ultra and Gemini Pro. As of now, Gemini Nano is only available on the Google Pixel 8 series and Samsung Galaxy S24. However, in its current state, Gemini Nano takes inputs only in the text format.

The Boys Prequel Series Announced Featuring Soldier Boy & Stormfront

Shashank Shakya Jul 27, 2024

With multimodal capabilities, Gemini Nano will be able to get contextual information and also get inputs from sounds, images, and spoken language. As for the availability, Google says that it will roll out multimodal capabilities to Gemini Nano starting with Pixel later this year.

#Tags

#Google Gemini #Google I/O 2024

Anmol Sachdeva

With 6 years of experience as a writer and editor in the tech media industry, Anmol is an enigmatic savant in all kinds of tech. He loves to scour internet for new information. When not conjuring words, Anmol can be found watching Manchester United matches or glued to his MacBook watching re-runs of his favorite TV shows for upteenth time.

Comments 0