#AI - Page 6
429 Stories
AI Models in India to Require Govt Approval; What are the Implications?
View quick summary
The Indian IT Ministry (MeitY) has issued a new advisory for tech companies offering AI models and services in India. The advisory asks tech firms to get approval before deploying "untested" AI models in India. The government later clarified that the advisory applies to large companies like Google, OpenAI, Microsoft, etc., and not startups. The advisory also requests companies to embed permanent metadata in generated data to easily identify the first originator.
How to Sign Up for Gemini 1.5 Pro Waitlist to Get Early Access
View quick summary
You can sign up for the Gemini 1.5 Pro waitlist via Google AI Studio and get early access to the flagship model with a context window of 1 million tokens. The model is currently in preview, so Google is offering access for free to test and evaluate the model. That said, there is no API available for Gemini 1.5 Pro yet, just like Gemini 1.0 Ultra.
Elon Musk Sues OpenAI and Sam Altman Over AGI Fear
View quick summary
In a surprising turn of events, Elon Musk has sued OpenAI and its CEO, Sam Altman in the San Francisco Superior Court on Thursday. Musk claims that OpenAI has become a closed-source company and it's only developing new technologies to maximize profits. Further, Musk says that Microsoft is leveraging its power and influencing OpenAI's operations. Musk is seeking an injunction against OpenAI and Microsoft from taking advantage of the AGI technology and cashing in for profits.
We got our hands on the Gemini 1.5 Pro model via Google AI Studio, and after probing the model on a multitude of tests, we can say that Google has finally delivered an immensely powerful AI model. It's easily on par with GPT-4 model by OpenAI and surpasses Google's largest Gemini 1.0 Ultra model. It's excellent at advanced reasoning, can process videos, handles large corpus of data in a single window, and you can do so much more. Read our detailed comparison between Gemini 1.5 Pro, Gemini 1.0 Ultra, and GPT-4.
After Gemini generated some inaccurate and offensive images, Google has been accused of anti-white bias by critics from many quarters. In response, Google has temporarily turned off image generation of people in Gemini. Moreover, many accuse Google of aggressively tuning the model to represent diversity which seems to have backfired. So what explains this debacle and Google's overall approach to AI? Read on to find out.
You Can Now Set Copilot As the Default Assistant on Android; Here’s How
View quick summary
Microsoft's Co-Pilot AI can now be set as the default assistant on Android devices. The latest beta version allows users to replace Google Assistant with Co-Pilot, which can be triggered from any screen. However, it currently lacks voice activation and screenshot capabilities.
Microsoft Eclipses OpenAI; Signs Multi-Year Deal with Mistral AI
View quick summary
Microsoft is looking to expand its AI investment beyond OpenAI. The software giant has partnered with a French AI startup called Mistral AI and invested close to $2 billion to accelerate their new LLMs on Azure's infrastructure. Mistral AI's new model, Mistral Large, is already available on Azure AI Studio for customers and developers. The Mistral Large model supports advanced reasoning, understands several international languages, and comes with native function calling support.
Qualcomm AI Hub Released at MWC 2024; Run AI Models on Your Device
View quick summary
Qualcomm has finally released its AI Hub and AI Stack to run AI models locally on Snapdragon platforms, be it your smartphone or PC. The chipmaker has optimized over 75 AI models to run locally on your device. You can generate texts and images, enhance low-light images, perform image segmentation, and do much more locally. Qualcomm's AI Engine delivers 3x-4x better performance than Intel's latest Core Ultra processors.
Meet Groq, a Lightning Fast AI Accelerator that Beats ChatGPT and Gemini
View quick summary
Groq is a company formed by ex-Google TPU engineers who have developed an LPU (Language Processing Unit) that can generate outputs at a blistering speed. It can generate over 500 tokens per second while using a 7B model and close to 250 tokens per second while using a 70B model. ChatGPT and Gemini generate responses at a speed of 50 to 60 tokens per second. The Groq LPUs are said to be highly performant with much less latency and minimum energy consumption. With the introduction of LPUs, expect instant interaction with AI models soon.
Google Launches Gemma, a Family of Open-source Models
View quick summary
After launching Gemini 1.0 Ultra and Gemini 1.5 Pro in the last few weeks, Google is back with new AI models that are actually open-source. Dubbed "Gemma", these are small open-source models which can run even on consumer laptops. They come in two sizes, one with 2B parameters and another with 7B parameters. The models are largely trained in the English language and are suited for text generation, summarization, reasoning, and Q&A. Google has given a commercial license to Gemma models subject to certain prohibited use policies.
OpenAI Launches Sora: A Groundbreaking Text-to-Video AI Model
View quick summary
OpenAI has unveiled its remarkable text-to-video model, Sora, that can generate AI videos for up to 1 minute. The generated videos are highly detailed and have a resolution of up to 1080p. Unlike existing solutions, Sora-generated videos don't have distortions in the scene, which makes everyone believe that it's trained on gaming engine simulations like Unreal Engine 5. OpenAI says it's a diffusion model built on the transformer architecture. The Sora model is going through safety checks now and currently, it's not available to regular users.
Google Introduces Gemini 1.5 Pro with a Massive 1 Million Context Window
View quick summary
After releasing Gemini Advanced with the Ultra 1.0 model last week, Google has announced its next-generation model called Gemini 1.5 Pro. The new model is built on the Mixture-of-Experts (MoE) architecture and supports a massive context window of up to 1 million tokens. It performs nearly the same as the much larger Gemini 1.0 Ultra model. The AI model is currently in limited preview and developers can test it on AI Studio after joining the waiting list.
Nvidia Launches Chat with RTX, an AI Chatbot You Can Run Locally on Windows
View quick summary
Nvidia has released a new tool called Chat with RTX. This lets Windows PCs with Nvidia RTX 30 series or RTX 40 series GPU run an AI chatbot locally on their PC. Custom data can be given to the AI including documents, PDFs, and even YouTube videos. It includes the Mistral and Llama-2 open-source LLMs in the download. 16GB of RAM is also required and the GPU must have at least 8GB of VRAM capacity in order to work.
What is Gemini Advanced and How to Get Subscription
View quick summary
Google has launched a subscription plan called Gemini Advanced that is powered by the Gemini Ultra 1.0 model. Unlike the free Gemini (formerly Bard), Gemini Advanced is much better at highly complex tasks and performs close to OpenAI's GPT-4 model. It costs $20 per month, but you get two months of free trial. Gemini Advanced also bundles 2TB of storage, and Google One benefits. And soon, Gemini Advanced users will be able to access Gemini AI in apps like Gmail, Docs, and more.
Amazon Launches Rufus, an AI Assistant in Its Shopping App
View quick summary
The e-commerce giant, Amazon, has added an AI shopping assistant called Rufus to its shopping app. It's an AI chatbot trained on Amazon's vast product catalogs, customer reviews, community Q&A, and information sourced from the web. It can answer all your questions based on your shopping needs, find personalized products, offer recommendations for various occasions, and much more. The Rufus AI assistant is currently being rolled out to a small subset of users in the US only.
How to Generate AI Images Using Google Bard
View quick summary
Google has finally added image generation into Bard and you can create images for free. Simply start your prompt with "create an image of" or "generate an image of" and Bard will generate two images at once. Google is using its in-house Imagen 2 AI model with ImageFX tool for added guardrails to power Bard image generation. Right now, it only supports prompts in English and some countries such as the UK, Switzerland, and EEA regions can't access the feature right away.
8 Best AI Photo Enhancers in 2024 (Free and Paid)
View quick summary
There are a number of AI photo enhancers that let you restore old photos, upscale images, add more clarity and detail, expand images, and more. We have included the eight best AI tools to improve images including popular ones like Remini, Lensa, Clipdrop, and several others. We have also included some tools that you can run locally on your computer if you don't want to upload your images to the cloud.
A recent MIT study eases the job automation fears, revealing that only 23% of certain jobs can be cost-effectively replaced by AI. Economic hurdles and high upfront cost of AI installation and maintenance contribute to a prolonged timeline. The study further says that at least until 2046, human labor will be a valued asset.
Great, AI Can Forge Your Handwriting Now and I’m Concerned!
View quick summary
A group of researchers from Mohamed Bin Zayed University of Artificial Intelligence in Abu Dhabi developed a new AI tool to imitate human handwriting. Apparently, it just requires a couple of paragraphs to analyze the handwriting and copy it. Previously, generative adversarial method was used to mimic someone's handwriting. Although it worked great, it could not work on the subtle elements of a human handwriting. That's why they followed a vision transformer-based solution to formulate a handwritten text image generation approach. With this, the AI can be trained to accurately capture the essence of each human handwriting.
OpenAI’s Sam Altman Is Raising Money to Set Up AI Chip Factories
View quick summary
According to a new Bloomberg report, OpenAI CEO Sam Altman is looking to raise in billions to establish his own network of AI chip factories. He predicts that industry-leading foundries like TSMC, Intel and Samsung will not be able to support the needs of AI technologies in the future. So, as per the report, he is already in active discussions with investors like G42 and SoftBank Group, with funding requirements from G42 alone revolving around $8 billion or more.
Microsoft Sets 16GB RAM & 40 TOPS of AI Computing Speed as Standard for New ‘AI PCs’
View quick summary
Microsoft is said to put new hardware requirements in place, for future PCs. Here, 40 TOPS has been mentioned as the minimum AI computing speed a PC should have, in order to be branded as an AI PC. In addition to this, Microsoft is also setting 16GB RAM capacity as a minimum for AI PCs. The newly released Intel Core Ultra processors do not meet this requirement.
10 Best AI Apps for iPhone in 2024
View quick summary
A good AI app for iPhone can add more bells & whistles to your device's functionality and capabilities. Today, you can spot a bunch of paid and free AI apps for iOS devices that are easy to use and are loaded with mind-boggling features. We've listed the top 8 AI apps for iPhone in 2024, out of which, Otter, AI Chatbot, and Lensa AI secured the top 3 spots.
CES 2024: The Rabbit R1 Is a Walkie-Talkie AI Assistant That Listens
View quick summary
Rabbit R1 is a new AI assistant made by a startup. It has been launched at CES 2024 and pre-orders have gone live with the device carrying a $199 price tag. This is made to be a 'simple computer' and the AI model they are using can navigate user interfaces (UI) to do various tasks like book you an Uber cab, navigate Discord to create an AI image on Midjourney, among other things.
Google Seeks Your Input on New Bard Features You Want in 2024
View quick summary
Google has been working hard to make Bard as feature-rich as ChatGPT. A PM at Google is now asking Reddit users to share their 2024 Bard Wishlist for new feature additions and extra capabilities. Reddit users point that Bard needs a separate app for Android and iOS, just like ChatGPT. In addition, Google needs to quell the hallucination issue, and integrate Bard with Google Assistant, among other things.
GitHub Copilot Chat is Now Generally Available to All Users
View quick summary
GitHub has made Copilot Chat generally available to all users. The company has even made it free for students, verified teachers, and maintainers of popular open-source projects. You can start using it on Visual Studio or Visual Studio Code. You get features such as code completion, contextual chat, in-line code generation, unit test generation, security vulnerability detection, and more.
Microsoft Is Making AI PCs with Its 2024 Surface Lineup
View quick summary
Microsoft is reportedly working AI-centric laptops and we may seem them in action with Surface Pro 10 and Surface Laptop 6 next year. They will be powered by Qualcomm's Snapdragon X Elite chipset and Intel's 14th-gen processor. With the dedicated NPU, Microsoft would be able to bring new AI features and experiences, hopefully with Windows 12's release. It's also being reported that Surface laptops will sport a Copilot button on the keyboard.
The New York Times Sues OpenAI and Microsoft Over Copyright Infringement
View quick summary
The New York Times has sued OpenAI and Microsoft for using millions of Times' articles to train its AI models without paying any licensing fee. In the lawsuit, the NYT has alleged that GPT-4 produces verbatim output from the NYT articles with minor or no changes at all without any attribution. The NYT is seeking "billions" in damages from both OpenAI and Microsoft over the unlawful use of its journalistic work. The major American newspaper had tried to negotiate a deal back in April, but it didn't go anywhere.
How to Access and Use Google Gemini API Key (with Examples)
View quick summary
Recently, Google released the API key for its Gemini Pro model. Currently, the company is offering text-only and text-and-vision models based on the Gemini Pro model. The best part is that Google is allowing users to test the API for free and without setting up Google Cloud billing, at least for now. To teach you how to set up the Gemini AI API key and use it, we have included three coding examples to showcase the API usage. You can test Gemini Pro's multimodal capability as well through the API, which is not yet available in Bard.
8 Best AI Image Upscaler Tools in 2024 (Free and Paid)
View quick summary
There are several AI image upscalers available online, but we have selected the eight best tools that can boost the image resolution from 1x to up to 16x free of cost. Not just upscaling, these tools also fill in missing pixels and fixes blurry images with just one click. We have included tools from Stability AI, Topaz Labs, Upscale Media, etc., to offer a wide range of services.
Eiichiro Oda Asked AI to Compose a One Piece Song; Give It a Listen!
View quick summary
This year, AI fever has become the biggest thing, and One Piece author Eiichiro Oda doesn't seem to be out of the loop. With the assistance of an AI music creation tool, he produced a song titled the "YO-HO-HO We Pirates" about the One Piece series, which the team released on their official X account.
Google Gemini AI: Multimodal, GPT-4 Competitor, and More
View quick summary
Google has finally released a true multimodal AI model called Gemini. While multimodal features are not live yet, you can use Bard to check out the Gemini Pro model. Pixel users can also experience on-device AI with the Gemini Nano update. The most powerful Gemini Ultra model is quite impressive and beats the GPT-4 model on several benchmark tests. In multimodal tests too, Gemini Ultra dethrones GPT-4V model. However, the Ultra model has not been launched yet and is due to go live early next year.
Google Is Prepping an AI Model That Will Tell More About Your Life
View quick summary
Google could soon introduce a new AI model, Project Ellmann, which could get an insight into your life and answer previously 'impossible' questions.
Google Launches Gemini, Its Most Powerful AI Model Yet
View quick summary
Google's newly launched AI model is called Gemini and is going to a complete multimodal model. This allows it to fully analyse text, images, audio and more and give out seamless outputs. The model has already beaten GPT-4 when it comes to certain benchmarks. Gemini AI will come in three different variations. The model is scheduled to launch next year but a version of it is already live on Google Bard.
Google Delays Its GPT 4-Rival ‘Gemini AI’ to Next Year
View quick summary
We are not seeing Google's most powerful AI launch this year. Instead, Google's Gemini AI will launch in early 2024 instead. It could be as early as January 2024, especially to Google's Cloud enterprise customers. Gemini is the company's most innovative AI model, expected with features such as global language support. However, the latest report indicates it has issues with processing non-English queries. Now, planned Google events to which would have unveiled the next-gen Gemini AI across the US (NYC, Washington, California) have reportedly been canceled.
Doritos Silent Lets You Munch, Crunch, and Game Without the Noise!
View quick summary
Doritos Silent is a new app made by the highly renowned tortilla chips brand. The app uses an AI which has been trained on 500+ people with over 5000 sounds of eating crunchy Doritos nacho chips. After enabling the Crunch Cancellation feature, you can keep gaming in the team voice chat and also eat your chips without causing any annoying disturbances. It also supports any voice chat app - including Discord, Microsoft Teams, Zoom, and others.
Thousands of RTX 4090 GPUs Are Being Rebuilt for AI Computing in China
View quick summary
The US sanctions have put an AI chip ban on China among other countries has resulted in Nvidia's top-of-the-line RTX 4090 graphics cards being banned for sale. But, new factories in China are reportedly taking the board components (GPU chip, VRAM, etc.) to redesign their own GPUs, built specifically for AI compute!
WhatsApp Beta Adds Shortcut for AI-Powered Chats!
View quick summary
WhatsApp AI-Powered Chat feature will roll out in a stable update in the coming months. But on the beta version, we are now seeing a new shortcut button made for easily accessing the AI-Powered chat feature in WhatsApp!
Sony Cameras to Fight AI with Birth Certificates for Images
View quick summary
AI Photos are becoming a problem for media reporting. To curb this, Sony is introducing a new digital signature feature in its cameras. The authenticity of a photo can be verified with this feature, solving a big problem in the photojournalism industry: fake, manipulated photos.