#AI - Page 5
429 Stories
You Can Chat with Gemini Directly in Chrome’s Search Bar; Here’s How
View quick summary
You can now chat with Gemini right from Chrome's address bar. You just have to type "@" and select "Chat with Gemini" to summon the AI chatbot. Enter your prompt and hit Enter. It will take you to Gemini's portal where you can find the answer. The feature is currently available to Chrome desktop users only.
Hellboy AI Character Design Controversy: Is Using AI Really That Bad?
View quick summary
It's no secret that a new Hellboy movie with the title of Hellboy: The Crooked Man is being made. In a recent update, accusations of using AI for character design have been made on the director of this movie which he has confirmed to be false and to be a major quoting error made by the media house who published the report. However, my question is, is the use of AI actually that bad? Read about what I think in this article.
OpenAI Finally Relents; You Can Disable Model Training in ChatGPT While Retaining Chat History
View quick summary
You can now opt-out of model training on ChatGPT via Settings and also retain chat history. Earlier, OpenAI used to disable chat history if you refused to share your data for model training. You can find the new data controls under ChatGPT Settings.
In Today’s AI Race, Don’t Gamble with Your Digital Privacy
View quick summary
As we are moving towards the AI era, things are developing at a breakneck pace. In all of this, we have to be mindful of our privacy and how to protect it. In this article, we look at the privacy policies of popular AI chatbots and how companies handle private conversations. We have also discussed how you can minimize your data footprint and opt out of model training.
ChatGPT Extends Its AI Memory Feature to All Paid Users
View quick summary
OpenAI is now making the Memory feature available to all paid ChatGPT Plus users. It can remember your preferences and key details about yourself to make the chat experience more personal. The feature is turned on by default, but you can disable it from the Settings page.
Nvidia DLDSR: The RTX AI Feature You Might Be Missing Out On
View quick summary
Nvidia DLDSR is a feature that can be enabled via the Nvidia Control Panel. It is present under Manage 3D Settings > Global Settings > DSR - Factors > DL Scaling. When enabled, DLDSR upgrades the quality of your games using AI-based super sampling. Effectively, it can turn the rendering resolution of the game higher than what is supported by your monitor. The feature can even be enabled alongside DLSS for even more AI-enhanced goodness.
Microsoft Unveils Phi-3 Mini Model, Small Enough to Run on Phones
View quick summary
Microsoft is on a roll with smaller language models. It introduced a new family of Phi-3 models with the smallest, Phi-3 Mini available right away. The smallest model is trained on 3.8B parameters and beats larger models like Gemma 7B, Mistral 7B, and Llama 8B.
How to Install and Run LLMs Locally on Android Phones
View quick summary
Do you want to run AI models locally on your Android phone? Well, with the MLC Chat app, you can now download LLMs like Gemma, Phi-2, Mistral, Llama 3, etc., and chat with them natively on Android devices. The token generation is slow though, especially on older Snapdragon-based Android devices.
Llama 3 vs GPT-4: Meta Challenges OpenAI on AI Turf
View quick summary
Meta released its Llama 3 models recently so we have taken the liberty to compare the 70B model with the flagship GPT-4 model. Surprisingly, Llama 3 performs as good as the GPT-4 model despite being a smaller model. In advanced reasoning tests, it demonstrates intelligence and does better than GPT-4 in following user instructions.
Meta Launches Llama 3 Model; Integrates AI Across Meta Apps
View quick summary
Meta has finally unveiled its next family of Llama 3 models in two sizes: 8B and 70B parameters. The company has launched its meta.ai portal where you can chat with Llama 3 models. Besides that, you can find Meta AI experiences on WhatsApp, Facebook, Instagram, Messenger, and more. Currently, the context length is 8K tokens, but Meta will be releasing larger models with increased context length and multimodal capability in the near future.
Microsoft Edge Too Wants You to Circle to Search Like Google
View quick summary
Microsoft is working on a Circle to Copilot feature that is similar to Circle to Search by Google. The feature is available to try out in the Edge browser on iPhone and iPad. The feature is still in development and there is no confirmation when it will come to Android devices.
AnythingLLM Lets You Chat With Documents Locally; Here’s How to Use It
View quick summary
If you are looking for an easy solution that lets you chat with your documents locally, you can check out AnythingLLM. It has a slick GUI interface and works even on consumer-grade PCs. No need to have high-end GPUs. Apart from that, you can ingest a variety of file formats including TXT, PDF, CSV, audio files, spreadsheets, and more.
Even a Mouse Now Has an AI Button, Thanks to Logitech
View quick summary
Nobody is safe from the clutches of AI as Logitech has now launched the new M750 AI Edition along with the Logitech Prompt Builder software. Combining ChatGPT integration with recipes, the builder can be assigned as a shortcut with certain Logitech mouse and comes up in a neat pop-up window. I had lots of fun with the app as it let me rewrite text, send custom e-mails, and even draft witty replies all with the power of ChatGPT! The app is free to use but does require the Logi Options App+.
How to Access and Use Gemini 1.5 Pro API Right Now
View quick summary
Google has finally given access to Gemini 1.5 Pro API to all users. You can generate the API key from Google AI Studio and start calling the API for both text and image inputs. The API access is currently free as part of a public preview. We have also added some coding examples so go through our guide to understand the documentation.
Gemini 1.5 Pro Now Listens to Audio and Is Available to All
View quick summary
You no longer have to wait to access the Gemini 1.5 Pro model. Google has made the model generally available to all users via Google AI Studio. You can access a context window of 1 million tokens without paying any fee. In addition to that, Gemini 1.5 Pro can now process audio files too, apart from videos and images.
Google’s Imagen 2 Model Can Generate Four-Second Video Clips
View quick summary
Google at the Cloud Next 2024 event made its Imagen 2 model generally available to enterprise customers. One of its new features is that it can generate short video clips aka live images of up to four seconds. In addition, the generation clips show different camera angles and motion. Users can also inpaint and outpaint images on Vertex AI.
Spotify AI Playlist is Here: Curate Music using Text Prompts
View quick summary
Spotify's new AI Playlist feature allows users to create personalized playlists by entering a prompt, which the AI will use to generate a playlist based on the user's listening history. The feature is currently available for Premium members in Australia and the UK, and will soon expand to other regions.
Soon You Might Have to Pay to Access Google’s AI Search Features
View quick summary
Google announced its SGE (Search Generative Experience) last year, and now, we are getting to see this entire thing become "premium". In other words, you will have to pay for the AI features integrated within Google Search. For those unaware, generative AI in Google Search allowed you to get a summary of your query, thereby providing all the information you need right at the top. Now, the new Financial Times report suggests that you will soon need to pay the premium $19.99/month fee of Google One AI to make use of it.
How to Use Google Gemini to Summarize YouTube Videos
View quick summary
Using Google Gemini, you can summarize YouTube videos instantly and save up time. On Android, head over to the Gemini app -> profile icon -> Extensions -> toggle on YouTube extension -> then paste the YouTube video link in the Gemini text box -> hit send. On iOS, go to the Google app -> switch to Gemini -> follow same steps as Android. You can also visit Gemini on the web using the browser of your choice and summarize YouTube videos quickly.
YouTube CEO Says OpenAI Could’ve Violated Platform Rules For Sora Training
View quick summary
Speaking to Bloomberg, the YouTube CEO, Neal Mohan said that OpenAI would violate YouTube's Terms of Service if the company used YouTube videos to train its Sora model. Google also uses some corpus of YouTube videos to train its Gemini model, but signs licensing contracts with individual creators. Earlier, the CTO of OpenAI, Mira Murati, refused to clarify whether Sora was trained on YouTube videos.
DALL-E Now Lets You Edit the AI Images You Generate, Here’s How
View quick summary
DALL-E can generate solid visuals but has lagged behind Midjourney in editing, thus limiting its creative appeal. However, thanks to OpenAI's new update, the tool now offers a powerful new editing interface that allows users refine portions of an images with modified prompts. The new interface boasts a "Select" option that offers precise targeting of areas for modification, enabling you to add, remove, or adjust elements.
ChatGPT’s Free Version Can Now Be Used Without Logging In
View quick summary
Just like Microsoft Copilot, you can now use ChatGPT without an account. You no longer need to sign in or create an account to use the free version of ChatGPT. The GPT-3.5 model is readily available to all free users including those without an account. Keep in mind, you won't get some features like custom instructions, chat history, chat sharing, and voice conversations without an account.
What Is an AI PC and Should You Buy It in 2024?
View quick summary
The new AI PC era is here. Basically, AI PCs have a special hardware component known as Neural Processing Unit (NPU). They are also designed with a better GPU than previous-generation PCs for the upcoming AI era of computing. While an AI PC can be worth it if there are features that can benefit your work, we are a long way away from the AI PC maturing. So, it's not worth buying an AI PC for now, unless you can take advantage of the features.
ChatGPT Now Cites the Sources for Its Answers
View quick summary
OpenAI's ChatGPT will now cite sources in its responses similar to Microsoft's Copilot, providing transparency and legitimacy in reponses. This feature is limited to paid versions: ChatGPT Plus, Team, and Enterprise unlike Copilot which offers it for free. The announcement for this was made by OpenAI through an X post on Friday.
Why Google Keep’s ‘Help Me Create’ Is the Best Feature for Procrastinators
View quick summary
Google is rolling out a new generative AI feature, this time inside the Google Keep app. The feature called "Help me create a list" takes input and can generate a list. It makes it easier to add things to your list on the get-go and skip the research part. The feature appears as you tap the "+" icon in Google Keep to create a new Note. The "Help me create a list" button in gradient appears on the bottom-right corner.
OpenAI’s Voice Engine Can Clone Human Voices From a 15-Second Sample
View quick summary
After introducing Sora, a remarkable text-to-video AI model, OpenAI has previewed its Voice Engine model that can clone voices with a single 15-second audio sample. It can translate voices into different languages with high accuracy. Since there are risks associated with the technology, the company is not releasing the speech synthesis model now. The company encourages society to understand the capabilities of AI and adapt to a new reality.
Elon Musk’s xAI Announces Grok-1.5 With 128K Context Length
View quick summary
Elon Musk's xAI firm has launched a new intermediate model called the Grok-1.5. It comes with improved reasoning capabilities and supports a large context length of 128K tokens. On the NIAH test, the model has shown great retrieval capability. xAI says early testers and existing Grok users will be able to access the model on X (formerly Twitter) in the coming days.
Snapchat Now Lets You Write Captions Using AI; Here’s How
View quick summary
Snapchat offers an AI Caption feature that uses AI to analyze the image in your Snap and generate a caption based on its content. The feature is currently only available in a limited number of regions, including the US and UK and can only be used by Snapchat+ members. You can create AI Captions on Snapchat by taking a Snap first and then navigating to T icon > AI captions.
GPT-5 Might Release in Summer 2024; ‘Materially Better’ Than GPT-4
View quick summary
The ChatGPT maker, OpenAI, is reportedly planning to release the GPT-5 model during the summer of this year. A new report says that OpenAI has already demoed the GPT-5 model to a few enterprise customers and one of the CEOs called it "materially better" than GPT-4. OpenAI is also working on AI agents that can perform complex actions.
Microsoft Appoints Google Deepmind Co-founder Mustafa Suleyman to Lead Its AI Efforts
View quick summary
In another reshuffle, Microsoft has hired Mustafa Suleyman, the co-founder of Google DeepMind, and Inflection to lead AI innovations at Microsoft. Suleyman will oversee the development of Copilot, Edge, and Bing and how AI can be leveraged to make meaningful product improvements.
8 Best AI Voice Generators in 2024
View quick summary
This list will offer some of the best options in the realm of AI powered tools that can reproduce human like voice from given text. These include options like the popular Speechify voice generator and other tools like Play.HT that generate almost natural voice-overs. You can try out any of the ones that we have mentioned in this list and see which one works best for you.
LimeWire AI API Review: Seamless Content Creation for Developers
View quick summary
If you are a developer building AI products, LimeWire AI API offers great flexibility and pricing. It hosts multiple Diffusion models from OpenAI, Stability AI, Google, etc. to deliver generative AI features. You can use the API to generate images, upscale images, inpaint, and oupaint images as well. In our testing, the API performed pretty well. Not to mention, the API implementation is straightforward and supports many popular languages.
Google Demos a New AI That Can Play Video Games with You
View quick summary
The Google DeepMind team has developed a generalized AI agent called SIMA that can perform actions in a wide range of virtual games. Google has trained the AI agent on nine video games including No Man's Sky, Teardown, Valheim, and more. It can carry out tasks even on unseen video games which makes it a unique AI agent.
OpenAI Blog Leaks GPT-4.5 Turbo; Sparks Interest
View quick summary
OpenAI might be working to release an intermediate GPT-4.5 Turbo model before launching the next-gen GPT-5 model. It was leaked through the OpenAI Blog page which was indexed by Bing and DuckDuckGo. The upcoming model promises better "speed, accuracy, and scalability" than the GPT-4 Turbo model. However, strangely, the captured description says the model will have a knowledge cutoff of up to June 2024.
You Can Now Upload Files to Copilot on Windows 11; Here’s How
View quick summary
Microsoft is slowly adding new features to Copilot on Windows 11. Copilot has finally received the file upload capability on Windows 11. You can upload a range of documents including PDF, DOC, XLS, PPT, TXT, and more. Copilot seamlessly ingests the documents and answers from the provided document accurately. From analyzing financial sheets to summarizing PDFs and understanding private code documentation, the feature can be immensely helpful to all kinds of users.
Elon Musk Says xAI to Open-Source Grok This Week
View quick summary
After filing a lawsuit against OpenAI for becoming a closed-source company, Elon Musk has now announced that xAI's Grok model will be open-sourced this week. Musk founded xAI in March last year to rival OpenAI and released its first preview model in November. However, Grok has not been received well and it's highly prone to hallucination. Open-sourcing the model may help the company improve Grok on many fronts.
Microsoft’s Seeing AI Mobile App Can Help Visually Impaired Users
View quick summary
Microsoft launched the Seeing AI app with AI features during its Ability Summit 2024. The app is powered by AI and allows low-vision users to make sense of the world by hearing descriptions of scenes, images, products, currency, and more. You can install the app on Android and iOS for free. It's currently available in 33 languages, and Microsoft is planning to bring support for 36 languages by the end of 2024.
You Can Now Edit and Modify Google Gemini Responses; Here’s How
View quick summary
Google's latest update for Gemini allows users to modify and regenerate specific portions of a response according to their preferences. The feature can be used to replace portions of text with a different prompt or you can simply ask Gemini to regenerate selected text or make it shorter or longer. The new feature is available for everyone but is limited to the Gemini web app.
Microsoft is Holding a Surface, Windows & Copilot AI Event Later This Month
View quick summary
Microsoft has announced a new event for March 21st where the company is likely to unveil Windows 11 improvements, new Copilot features, and refreshed Surface devices. The company may release refreshed versions of Surface Laptop 6 and Surface Pro 10. Microsoft is also preparing to showcase an advanced version of Copilot with a new feature called "AI Explorer".
How to Download and Run Google Gemma AI Model on PC and Mac
View quick summary
If you have a low-end computer, you can download and run Google's open-source Gemma model on Windows, macOS, and Linux. The model is just 1.5GB in size and takes up around 1.4GB RAM. For creative tasks in English, the model does a good job while running offline. You can download LM Studio and load the model to start using it right away.
How to Access Claude 3 API for Opus and Sonnet Models (With Examples)
View quick summary
Anthropic has immediately released APIs for its Claude 3 models including Opus and Sonnet. The company says API access for the smallest Haiku model is coming pretty soon. While the API pricing for the Claude 3 models is high, users and developers are keen to test the Opus model, which according to Anthropic, beats GPT-4 and Gemini 1.0 Ultra. We have also added some code examples so go through our detailed tutorial.
Anthropic Announces Claude 3 AI Models; Beats GPT-4 and Gemini 1.0 Ultra
View quick summary
Anthropic has released a new family of Claude 3 models -- Opus, Sonnet, and Haiku. The largest and most capable Claude 3 Opus model beats GPT-4 and Gemini 1.0 Ultra in all major benchmarks. According to Anthropic, all three models support a context window of 200K tokens and deliver 99% accuracy with great recall. Plus, they come with vision capability as well.