AI - Page 2
Trending Stories
I Tried Out Gemini Live; It Can’t Compete with ChatGPT Advanced Voice Mode
View quick summary
Google's Gemini Live was touted as the answer to ChatGPT's Advanced Voice Mode. Turns out, Gemini Live is a glorified text-to-speech engine, backed by an LLM. It can't understand emotions, the mood of the speaker, or the tone and tenor of the speech. That said, the new Gemini Live experience supports interruptions, but the experience is broken.
Grok recently got the ability to generate images on X. However, it seems there are no safety guardrails to prevent users from generating harmful and offensive images. In our testing, X image generator continues to generate images depicting violence, drugs, and explicit images of celebrities and public figures.
How to Use the Add Me Feature on Pixel 9 Phones
View quick summary
The latest Google Pixel 9 series of phones have arrived with a new Add Me feature that uses on-device AI to add you to group photos efficiently. If you don't have a tripod or are too shy to ask a stranger to capture the photo, this can work wonders. To use the feature, head over to the native Pixel camera app -> tap on Add Me from the bottom panel -> capture a group photo without you in the frame and keep an empty space to fit yourself in -> have the second shot captured with just you in the frame filling that empty space. And, just like that, AI stitches the two pictures together to add you into those group photos.
Don’t Sleep on Grok 2.0; It’s Powerful But Controversial
View quick summary
xAI has released its powerful Grok 2.0 AI model in beta. I tested the model on many reasoning questions, and it surprised me with its exceptionally strong performance. However, the model doesn't seem to have any safety guardrails. To learn more, go through our article.
How to Use Gemini Live on Any Android Phone Right Now
View quick summary
Google's answer to OpenAI Advanced Voice Mode is finally here. Dubbed Gemini Live, it allows users to have a natural, free-flowing conversation with the AI model. You can interrupt it, pause the conversation, and resume it later on. Currently, Gemini Live is available in 10 voices. If you are an Android user, you can start using it today by subscribing to Gemini Advanced.
Flux vs Midjourney: Is There a New AI Image Generator Champ?
View quick summary
A new AI model for generating images has just dropped, and it's gaining huge traction among the AI community. The new Flux model is open-source and it beats Midjourney in generating photorealistic images. Check out our comparison between Flux and Midjourney to see the difference in visual quality.
ChatGPT Got a Secret Update Last Week, And It’s Performing At Its Best
View quick summary
OpenAI has quietly released an updated GPT-4o model on ChatGPT and it's called "chatgpt-4o-latest". The model improves performance in coding, reasoning, and other key areas. The updated ChatGPT model has again taken the first spot on the LMSYS leaderboard, outranking all other AI models.
How to Use Flux AI Image Generator For Free
View quick summary
If you are looking for a free Midjourney alternative, you must check out Flux. It's a free and open-source model for generating lifelike AI images. You can use it for free on HuggingFace, NightCafe, BasedLabs, etc.
Project Strawberry Explained: Is ChatGPT Getting a Huge Upgrade?
View quick summary
Project Strawberry is an internal OpenAI research project that is said to bring human-level reasoning capability. It's related to the Q* project, according to Reuters. Not only that, Project Strawberry can also plan and perform a series of actions unlocking agentic workflows. The project is said to be proficient in solving the hardest math problems.
What is Artificial General Intelligence (AGI)? Explained
View quick summary
AGI is the ultimate AI system that can match or surpass human capabilities, be it intellectual reasoning or performing complex tasks that require cognitive thinking, just like humans. OpenAI says that AGI can be achieved within the next 10 years, whereas other AI researchers believe that AGI is still two to three decades away. While AGI can lead to many scientific discoveries, experts warn that it can also be catastrophic to humanity.
What Is an NPU? Explained
View quick summary
Nearly all chipmakers are incorporating an NPU into their SoC. To unlock new AI experiences and features, a dedicated NPU is essential. It can perform AI operations at a breakneck pace without hitting the battery life. Most of the upcoming AI features rely on the NPU to process such requests locally on the device.
8 Best AI Tools for Video Editing in 2024
View quick summary
If you are looking for both traditional and new-gen video editors that bring effortless AI editing tools, check out our list. We have picked the best AI video editors that allow you to edit videos using text-based transcription, remove unwanted objects and noise, enhance speech, add translated voice, and much more.
Llama 3.1 vs ChatGPT 4o: Meta AI Is Not So Intelligent After All
View quick summary
Meta claims that its largest Llama 3.1 405B model beats GPT-4o in several key benchmarks. However, in our reasoning tests, it remains behind GPT-4o. In fact, in my testing, Llama 3.1 405B ranks below Claude 3.5 Sonnet and Gemini 1.5 Pro. For detailed results, follow our article.
X (Twitter) is Training Grok AI with Your Data; Here’s How to Disable It
View quick summary
X (formerly Twitter) is secretly training its Grok AI on X posts by default and without seeking user consent. Thankfully, you can disable AI training from the web version of X. The dedicated setting is not available on the X app. You can follow our article and opt out of AI training on your X posts.
What is SearchGPT, the ChatGPT Search Engine? Explained
View quick summary
After months of anticipation, OpenAI has finally unveiled its AI-powered ChatGPT search engine, known as SearchGPT. It's designed to answer fast using authentic sources and clear attribution. Users can also find all the sources and engage with the web results. That said, it's available in a limited prototype which means users have to join the waitlist to receive access.
8 Best AI Photo Editor Tools in 2024
View quick summary
If you are on the lookout for the best AI tools for photo editing, follow our curated list of apps and services. With these new AI photo editors, you can remove or add new objects, change the background, isolate the subject, make fine adjustments, and do much more. You can even fix blurry images and upscale them to a higher resolution.
10 Best AI Tools for Research: Consensus, Scite, Elicit & More
View quick summary
If you are looking for AI-powered tools for research purposes, we have curated the 10 best tools for academic research. These tools help you find relevant papers from a vast database, lets you extract key insights, find the general trend, and much more. From literature review to visualization of papers and chatting with local research papers, you can do all of it using these AI tools.
How to Use Llama 3.1 405B AI Model Right Now
View quick summary
Llama 3.1 405B is the largest and most capable model from Meta. The company has also open-sourced it. You can start chatting with the model on Meta AI and WhatsApp if you are from the US. And for the rest of the regions, you can head to HuggingChat or Groq to access the 405B model for free.
How to Fix ChatGPT Error in Moderation
View quick summary
ChatGPT shows an 'Error in Moderation' and refuses to generate a response when the AI model thinks that your prompt is sensitive and might be used to generate harmful content. In such a scenario, it's recommended to tweak your prompt in such a way that it's not flagged by the AI model. Apart from that, you can re-login on ChatGPT to see if the problem has been fixed or not.
Vidnoz AI: The Effortless Tool to Create Stunning AI Videos in Minutes
View quick summary
If you are looking to create professional-quality AI videos at a fraction of the cost, I would recommend checking out Vidnoz AI. For budding content creators, Vidnoz AI offers hundreds of AI avatars, video templates, and AI voices. You can even generate AI scripts with a simple prompt. And if you want to personalize your video, you can create your own custom AI avatar and clone your voice as well.
Boost Your Image Quality to the Max with Aiarty Image Enhancer
View quick summary
If you are looking for an AI tool to upscale images, I would recommend Aiarty Image Enhancer. It's a fantastic program that generates more details to fix pixelation and removes blurriness from images. Overall, it's an image restoration program that actually works. Go through our guide to understand all the key features.
Gemma 2 vs Llama 3: Best Open-Source AI Model?
View quick summary
Google has released its latest open-source model called Gemma 2 27B. Until now, Meta ruled the open-source space with its Llama family of models. So in this article, I have compared Gemma 2 27B with Llama 3 70B to evaluate their prowess in a variety of tasks. Despite a much smaller size, Gemma 2 27B performs well, except in reasoning tests.
Gemini Now Supports File Uploads & Here’s How It Stacks Up Against ChatGPT
View quick summary
You can now upload documents to Gemini and ask questions or perform data analysis. Keep in mind, you need a Gemini Advanced subscription to be able to upload local files. In my testing, Gemini Advanced performed as good as ChatGPT. But ChatGPT offers file uploads for free and there is support for a wide variety of file formats as well.
Claude 3.5 Sonnet vs ChatGPT 4o vs Gemini 1.5 Pro: Anthropic is Back
View quick summary
Anthropic's new Claude 3.5 Sonnet model has performed remarkably well on benchmarks. So we have compared it with ChatGPT 4o and Gemini 1.5 Pro. In reasoning tests, Claude 3.5 Sonnet shows great intelligence and also does a great job at following user instructions. Even in coding, Claude 3.5 Sonnet generates bug-free code in comparison to ChatGPT 4o and Gemini 1.5 Pro.
What is Safe Superintelligence and What It Does
View quick summary
While we are currently using AI chatbots like ChatGPT and Gemini, more powerful AI systems are on the horizon. AI agents, AGI, and Superintelligence are the next frontier of highly-powerful AI systems. Superintelligence, in particular, is said to be even more powerful than humans and AGI. To learn more about Superintelligence, go through our explainer.
Here’s Why Elon Musk is Wrong About Apple Intelligence
View quick summary
At the WWDC 2024 event, the Cupertino giant introduced Apple Intelligence and announced ChatGPT integration into Apple devices. Elon Musk took to X and went on a tirade against Apple for integrating ChatGPT and threatened to ban all Apple devices at his companies. That said, it appears Musk's argument is misplaced as Apple has developed its own AI models. Only when Apple's AI model fails to come up with a proper response, it seeks the user's permission before routing the query to ChatGPT.
Apple Intelligence AI Models Compete with ChatGPT 3.5 Turbo
View quick summary
Apple has developed two capable AI models to power Apple Intelligence features on iPhone, iPad, and Mac. For on-device processing, there is a small 3B model, and for complex tasks, there is a large, server-class model. The smaller 3B model performs better than Microsoft's Phi-3-mini and Google's Gemma 2B and 7B models. And Apple's large server model does better than GPT-3.5 Turbo.
Apple Private Cloud Compute: What It Means for Your Privacy
View quick summary
To process AI requests on the cloud, Apple has built its own cloud server stack called Private Cloud Compute. Apple has built the cloud server using custom Apple silicon for quick AI inferencing. The Cupertino giant says that none of the personal user data is stored in the PCC system and not even Apple can access it.
Microsoft SwiftKey: AI Keyboard Revolutionizes the Way We Type
View quick summary
In this piece, I discuss and highlight some of the key features of the Microsoft SwiftKey keyboard app, including clipboard sync with Windows devices, customizable AI stickers, a variety of themes, a built-in language translator, and a synced clipboard with Windows PCs.
12 Best GPTs for ChatGPT to Use in 2024
View quick summary
ChatGPT offers you tons of GPTs to install and use from the GPT store. However, there are just way too many of them with similar ideas, and you might end up getting confused. So, we have handpicked the best ones to use, and the Canva GPT integration easily tops that list. Besides that, Scholar GPT takes care of your research needs. You also have super helpful educational GPTs like Khan Academy's Tutor Me tool and Code Copilot. In addition, you can also make use of the Mia AI GPT to get your very own therapist.
Data Science vs Artificial Intelligence: Understanding the Difference
View quick summary
Data Science and Artificial Intelligence are closely related disciplines, but their difference lies in their scope, methodologies, and objectives. While Data Science is limited to data interpretation, AI models are aimed at creating intelligent systems that can perform tasks similar to humans.
What is a Large Language Model (LLM): Explained
View quick summary
A large language model is essentially a deep-learning algorithm that is designed to understand, process, and generate human language. It predicts the next word in a sentence based on the principles of probability. LLMs are trained on an extensive dataset of textual data from the internet, books, archives, etc.
AI Image Detection: How to Detect AI-Generated Images
View quick summary
C2PA has developed a powerful tool called Content Credentials to detect AI-generated images. If the images have been modified or metadata has been removed, Content Credentials can still detect AI images and their source. Other than that, you can find inconsistencies in AI images and check for watermarks.
This AI Sound Effect Generator Is a Cheat Code Every Creator Needs
View quick summary
Elevenlabs has officially released their AI sound effects generator and I decided to give it a try. I used the free version, which gives you a 10,000 quota per month to use. Every generation takes up 200 of the provided quotas. Meanwhile, paid plans start at $5 per month. The generator works best with non-complex prompts and the free version does not provide the best quality either. I realized that the best way to put it to some good use is by generating sound effects separately and putting them together with audio editing.
Why was OpenAI’s Sam Altman Fired? These New Details Worry Me
View quick summary
Last year, OpenAI fired its CEO Sam Altman, only to appoint him back to the position a couple of days later. Now, some more details from ex-OpenAI board members, Helen Toner and Tasha McCauley, have revealed the actual reasons behind Altman's firing and that has understandably raised some big concerns in the community. From "psychological abuse" to secrecy and lies, Altman has been accused of it all by the ex-board members.
Meta Trains Its AI on Your Instagram and FB Photos; Here’s How to Opt Out
View quick summary
Meta is using images and other data from Instagram and Facebook to train its AI models. It's opt-in by default and users are being notified now. You can ask Meta to stop training on your personal data, but it will only apply to data gathered from third-party services. Meta has also made it harder to opt out by asking for evidence to further process the request.
Google’s AI is Losing It! Asks Users to Eat Rocks, Add Glue to Pizza & More
View quick summary
Google's new AI Overview experience in Search is rolling out to users in the US. People are complaining about the misinformation AI Overview is generating. We have collated some of the replies generated by Google's AI Overview. Essentially, Google has redefined the relationship of a search engine provider and taken the role of a publisher.
Gemini 1.5 Flash is an Underrated Gem You Need to Try Right Now: Here’s How
View quick summary
At the I/O 2024, Google unveiled many AI models, but Gemini 1.5 Flash remained under the radar. It's a lightweight AI model that delivers remarkable speed and efficiency with support for multimodal reasoning and a large context window of 1 million tokens. It's also very cheap to run. You can try the model on Google AI Studio for free and without any waitlist.
ChatGPT 4o vs ChatGPT 4: Premium Features for Free?
View quick summary
If you are wondering whether you should subscribe to ChatGPT Plus or keep using the free ChatGPT version, read our extensive comparison. We have done a thorough comparison of ChatGPT 4o and ChatGPT 4 models. In addition, we have laid out the differences between the free and the paid version of ChatGPT.
I Made a Game Using ChatGPT 4o in Seconds and You Can Too
View quick summary
If you want to create a simple game or a web app, ChatGPT 4o can be of immense help. It created an arcade game in Python and I could run it without any errors on the first try. ChatGPT 4o further improved the game with a replay functionality and added a score system as well.
ChatGPT 4o vs Gemini 1.5 Pro: It’s Not Even Close
View quick summary
We have thoroughly compared ChatGPT 4o with Gemini 1.5 Pro on a variety of tasks including reasoning, code generation, multimodal tests, and more. In our tests, ChatGPT 4o performed much better than the Gemini 1.5 Pro model. To understand the capabilities of both models, go through our entire comparison.