#Google Gemini
34 Stories
Google’s Experimental Gemini Model Tops the Leaderboard, But Stumbles in My Tests
View quick summary
Google's upcoming Gemini-exp-1114 model has topped the LMArena leaderboard, outranking OpenAI and Anthropic models. However, in my reasoning tests, the model fails to answer tricky questions. In my book, it's still behind OpenAI's o1 models.
I Finally Gave Google’s NotebookLM a Shot; Here’s How to Use It
View quick summary
If you have not tried Google's NotebookLM AI tool yet, you must check it out. It lets you upload your personal notes and documents, and you can chat with the notebook and gain new insights. You can even listen to an AI-generated podcast discussing your personal notes.
Android users can now use Gemini Live for free. Google is rolling out the feature in English first. Support for more languages is coming soon.
Gemini Gems: How to Make Your Own Custom Gemini AI Chatbot
View quick summary
Google has finally rolled out custom Gems to Gemini. It allows you to create custom AI chatbots for your specific use case. You can add custom instructions to steer the model for your specific role. That said, you can't upload files to add extra knowledge.
I Tried Out Gemini Live; It Can’t Compete with ChatGPT Advanced Voice Mode
View quick summary
Google's Gemini Live was touted as the answer to ChatGPT's Advanced Voice Mode. Turns out, Gemini Live is a glorified text-to-speech engine, backed by an LLM. It can't understand emotions, the mood of the speaker, or the tone and tenor of the speech. That said, the new Gemini Live experience supports interruptions, but the experience is broken.
How to Use Gemini Live on Any Android Phone Right Now
View quick summary
Google's answer to OpenAI Advanced Voice Mode is finally here. Dubbed Gemini Live, it allows users to have a natural, free-flowing conversation with the AI model. You can interrupt it, pause the conversation, and resume it later on. Currently, Gemini Live is available in 10 voices. If you are an Android user, you can start using it today by subscribing to Gemini Advanced.
Gemini Now Supports File Uploads & Here’s How It Stacks Up Against ChatGPT
View quick summary
You can now upload documents to Gemini and ask questions or perform data analysis. Keep in mind, you need a Gemini Advanced subscription to be able to upload local files. In my testing, Gemini Advanced performed as good as ChatGPT. But ChatGPT offers file uploads for free and there is support for a wide variety of file formats as well.
Google Gemini App Arrives in India with Support for 9 Regional Languages
View quick summary
Google has finally launched the Gemini app in India. It brings support for 9 Indian languages. Apart from that, Indian users can also access Gemini Advanced features such as file uploads and data analysis. In addition, you can chat with Gemini in the Google Messages app as well.
Gemini Nano to Get Multimodal Capabilities; Coming to Pixel Later This Year
View quick summary
Google has announced that it is bringing multimodal capabilities to Gemini Nano. With multimodal capabilities, Gemini Nano will be able to understand contextual information not only from text input but also from audio, images, and spoken language.
Google Search AI Overviews Now Available to Everyone in the US
View quick summary
Google's AI overviews, previously known as SGE, are now available to all users in the US. This feature offers concise, AI-summarized answers and is designed to make searching faster and more efficient. It's available on web, Android, and iOS. Google also introduced multi-step reasoning, allowing for more complex questions and personalized results.
Google Photos Gets a Gemini Upgrade with the ‘Ask Photos’ Feature
View quick summary
Google has announced in Google I/O 2024 that the Google Photos app will get a new Ask Photos feature this Summer which will allow you to search for photos in a more personalized way.
You Can Chat with Gemini Directly in Chrome’s Search Bar; Here’s How
View quick summary
You can now chat with Gemini right from Chrome's address bar. You just have to type "@" and select "Chat with Gemini" to summon the AI chatbot. Enter your prompt and hit Enter. It will take you to Gemini's portal where you can find the answer. The feature is currently available to Chrome desktop users only.
Gemini on Android Will Display Results Over Other Apps Soon
View quick summary
Gemini user experience could soon get better with the ability to respond over other apps. The current Gemini experience is tardy because whenever you ask questions, the app force switches and takes you from whatever app you have opened to the Gemini app to show the results
Google Gemini to Support Music Streaming Apps Soon
View quick summary
There's a hidden feature that has been spotted inside the Gemini Settings page which lets you "Choose your default media provider". It will help users select a streaming service as the default, allowing Gemini to start accepting commands related to music streaming, such as "Play my liked songs".
How to Access and Use Gemini 1.5 Pro API Right Now
View quick summary
Google has finally given access to Gemini 1.5 Pro API to all users. You can generate the API key from Google AI Studio and start calling the API for both text and image inputs. The API access is currently free as part of a public preview. We have also added some coding examples so go through our guide to understand the documentation.
How to Use Google Gemini to Summarize YouTube Videos
View quick summary
Using Google Gemini, you can summarize YouTube videos instantly and save up time. On Android, head over to the Gemini app -> profile icon -> Extensions -> toggle on YouTube extension -> then paste the YouTube video link in the Gemini text box -> hit send. On iOS, go to the Google app -> switch to Gemini -> follow same steps as Android. You can also visit Gemini on the web using the browser of your choice and summarize YouTube videos quickly.
Apple’s AI Features on iPhones Could Be Powered by Google Gemini After All
View quick summary
Apple and Google are actively negotiating to build Google's Gemini Artificial Intelligence engine into an iPhone. This could be one of the blockbuster deals that could shake up the AI industry. The Google partnership could help Apple to deliver a range of generative AI features on its devices. While Apple and Google are actively negotiating, a final deal will not be announced until at least June.
You Can Now Edit and Modify Google Gemini Responses; Here’s How
View quick summary
Google's latest update for Gemini allows users to modify and regenerate specific portions of a response according to their preferences. The feature can be used to replace portions of text with a different prompt or you can simply ask Gemini to regenerate selected text or make it shorter or longer. The new feature is available for everyone but is limited to the Gemini web app.
How to Use Gemini AI Chatbot on iPhone
View quick summary
You can easily use the Gemini AI chatbot on your iPhone using the Google app or a web browser. All you need is a compatible device and a personal Google account. You can use the free version or upgrade to Gemini Advanced to unlock all the features.
Claude 3 Opus vs GPT-4 vs Gemini 1.5 Pro AI Models Tested
View quick summary
As has become a tradition now, I love testing new AI models, and when Anthropic released its new Claude 3 models, I knew I had to test out their tall claims. So, I compare the Claude 3 Opus model against OpenAI's GPT-4 and Google's Gemini 1.5 Pro model in this extensive comparison. Does Claude have the magic to beat ChatGPT and Gemini? Find out.
AI Models in India to Require Govt Approval; What are the Implications?
View quick summary
The Indian IT Ministry (MeitY) has issued a new advisory for tech companies offering AI models and services in India. The advisory asks tech firms to get approval before deploying "untested" AI models in India. The government later clarified that the advisory applies to large companies like Google, OpenAI, Microsoft, etc., and not startups. The advisory also requests companies to embed permanent metadata in generated data to easily identify the first originator.
How to Sign Up for Gemini 1.5 Pro Waitlist to Get Early Access
View quick summary
You can sign up for the Gemini 1.5 Pro waitlist via Google AI Studio and get early access to the flagship model with a context window of 1 million tokens. The model is currently in preview, so Google is offering access for free to test and evaluate the model. That said, there is no API available for Gemini 1.5 Pro yet, just like Gemini 1.0 Ultra.
We got our hands on the Gemini 1.5 Pro model via Google AI Studio, and after probing the model on a multitude of tests, we can say that Google has finally delivered an immensely powerful AI model. It's easily on par with GPT-4 model by OpenAI and surpasses Google's largest Gemini 1.0 Ultra model. It's excellent at advanced reasoning, can process videos, handles large corpus of data in a single window, and you can do so much more. Read our detailed comparison between Gemini 1.5 Pro, Gemini 1.0 Ultra, and GPT-4.
After Gemini generated some inaccurate and offensive images, Google has been accused of anti-white bias by critics from many quarters. In response, Google has temporarily turned off image generation of people in Gemini. Moreover, many accuse Google of aggressively tuning the model to represent diversity which seems to have backfired. So what explains this debacle and Google's overall approach to AI? Read on to find out.
Gemini Advanced Gets a ChatGPT-like Code Interpreter
View quick summary
Google is consistently adding new features to Gemini Advanced and now it has brought the ability to edit and run Python code inside the chatbot itself. For students and developers, it's a helpful addition as it allows you to run code directly without moving to Google Colab or other services. Unlike ChatGPT's versatile Code Interpreter, the Python interpreter in Gemini is only limited to coding tasks. To access the feature, you need to subscribe to Gemini Advanced which costs $20 per month, but Google is offering two months of free trial along with Google One benefits.
Google Introduces Gemini 1.5 Pro with a Massive 1 Million Context Window
View quick summary
After releasing Gemini Advanced with the Ultra 1.0 model last week, Google has announced its next-generation model called Gemini 1.5 Pro. The new model is built on the Mixture-of-Experts (MoE) architecture and supports a massive context window of up to 1 million tokens. It performs nearly the same as the much larger Gemini 1.0 Ultra model. The AI model is currently in limited preview and developers can test it on AI Studio after joining the waiting list.
Gemini Ultra vs GPT-4: Google Still Lacks the Secret Sauce
View quick summary
Touted to be the most capable model by the Google DeepMind team, the Gemini Ultra 1.0 model has been finally released. Gemini Advanced is powered by the Ultra model so we compared it with OpenAI's GPT-4 model. In our tests, we found that Gemini Advanced still lacks rigorous commonsense reasoning capability, hence, it's nowhere close to GPT-4. However, in creating writing, it does a better job than GPT-4. In coding tasks, it's indeed an improvement, but with tools like Code Interpreter in GPT-4, you are likely to get better results in ChatGPT.
How to Replace Google Assistant with Gemini AI on Your Android Phone
View quick summary
Google has officially started to roll out its Gemini Android app. However, it is currently limited to US users, and if you are not from the US, you will need to make a US Google account. Once you make it, you will be able to find the app listed on Play Store. Now, to set the Gemini AI as your primary voice assistant, open the Gemini app -> tap on Get started -> select I agree. That should automatically set it as your voice assistant. However, if you want to switch between Gemini and Google Assistant, open the Gemini app -> tap on profile icon -> Settings -> Digital assistants from Google -> select the desired assistant.
What is Gemini Advanced and How to Get Subscription
View quick summary
Google has launched a subscription plan called Gemini Advanced that is powered by the Gemini Ultra 1.0 model. Unlike the free Gemini (formerly Bard), Gemini Advanced is much better at highly complex tasks and performs close to OpenAI's GPT-4 model. It costs $20 per month, but you get two months of free trial. Gemini Advanced also bundles 2TB of storage, and Google One benefits. And soon, Gemini Advanced users will be able to access Gemini AI in apps like Gmail, Docs, and more.
Google Rebrands Bard to Gemini; Debuts Ultra 1.0 Model & Android App
View quick summary
Google is continuously positioning itself as the AI industry leader, and now, the search giant has rebranded Bard to Gemini in a bid to bring Google's family of best AI models to users. It has also introduced Gemini Advanced, which hosts the powerful Ultra model. However, you will have to pay $20 per month to access it. Moreover, Android users can now install the Gemini app on their smartphones whereas iPhone users can access Gemini from the Google app.
How to Access and Use Google Gemini API Key (with Examples)
View quick summary
Recently, Google released the API key for its Gemini Pro model. Currently, the company is offering text-only and text-and-vision models based on the Gemini Pro model. The best part is that Google is allowing users to test the API for free and without setting up Google Cloud billing, at least for now. To teach you how to set up the Gemini AI API key and use it, we have included three coding examples to showcase the API usage. You can test Gemini Pro's multimodal capability as well through the API, which is not yet available in Bard.
Google Gemini AI: Multimodal, GPT-4 Competitor, and More
View quick summary
Google has finally released a true multimodal AI model called Gemini. While multimodal features are not live yet, you can use Bard to check out the Gemini Pro model. Pixel users can also experience on-device AI with the Gemini Nano update. The most powerful Gemini Ultra model is quite impressive and beats the GPT-4 model on several benchmark tests. In multimodal tests too, Gemini Ultra dethrones GPT-4V model. However, the Ultra model has not been launched yet and is due to go live early next year.
How to Enable and Use Gemini Extensions
View quick summary
Extensions on Gemini allow you to connect to many Google services. In your conversations, you can chat with Gemini and pull data from YouTube, Google Flights, Gmail, Drive, Docs, and more. Keep in mind, your personal data from Gmail, Drive and Docs is not used for training Gemini. You can even use the YouTube extension to chat and summarize YouTube videos.