AI models are being cranked out at a dizzying pace, by everyone from Big Tech companies like Google to startups like OpenAI and Anthropic. Keeping track of the latest ones can be overwhelming.ย
Adding to the confusion is that AI models are often promoted based on industry benchmarks. But these technical metrics often reveal little about how real people and companies actually use them.ย
To cut through the noise, TechCrunch has compiled an overview of the most advanced AI models released since 2024, with details on how to use them and what theyโre best for. Weโll keep this list updated with the latest launches, too.
There are literally over a million AI models out there: Hugging Face, for example, hosts over 1.4 million. So this list might miss some models that perform better, in one way or another.ย
AI models released in 2025
Google Gemini 2.5
Gemini 2.5 Pro Experimental, a reasoning model, excels at building web apps and code agents according to Google. It underperforms on one popular coding benchmark compared to Claude Sonnet 3.7, however. The model requires a $20 monthly Gemini Advanced subscription.
ChatGPT-4o image generator
OpenAI has upgraded its existing GPT-4o model to generate images, not just text. The souped-up model soon went viral for transforming images into Studio Ghibli-style anime, despite obvious copyright concerns. Accessing GPT-4o requires, at minimum, a $20 per month ChatGPT Plus subscription.
Stability AIโs Stable Virtual Camera
Image generation startup Stability AI has launched a model that the company says can generate 3D scenes and camera angles from a single 2D image. However, it still struggles with scenes featuring more complex elements like humans and moving water. The model is available for noncommercial research use on HuggingFace.
Cohereโs Aya Vision
Cohere released a multimodal model called Aya Vision that it claims is best in class at doing things like captioning images and answering questions about photos. It also excels in languages other than English, unlike other models, Cohere claims. It is available for free on WhatsApp.
OpenAIโs GPT 4.5 โOrionโ
OpenAI calls Orion their largest model to date, touting its strong โworld knowledgeโ and โemotional intelligence.โ However, it underperforms on certain benchmarks compared to newer reasoning models. Orion is available to subscribers of OpenAIโs $200-per-month plan.
Claude Sonnet 3.7
Anthropic says this is the industryโs first โhybridโ reasoning model, because it can both fire off quick answers and really think things through when needed. It also gives users control over how long the model can think for, per Anthropic. Sonnet 3.7 is available to all Claude users, but heavier users will need a $20-per-month Pro plan.
xAIโs Grok 3
Grok 3 is the latest flagship model from Elon Musk-founded startup xAI. Itโs claimed to outperform other leading models on math, science, and coding. The model requires X Premium (which is $50 per month.) After one study found Grok 2 leaned left, Musk pledged to shift Grok more โpolitically neutralโ but itโs not yet clear if thatโs been achieved.
OpenAI o3-mini
This is OpenAIโs latest reasoning model and is optimized for STEM-related tasks like coding, math, and science. Itโs not OpenAIโs most powerful model but because itโs smaller, the company says itโs significantly lower cost. It is available for free but requires a subscription for heavy users.
OpenAI Deep Research
OpenAIโs Deep Research is designed for doing in-depth research on a topic with clear citations. This service is only available with ChatGPTโs $200-per-month Pro subscription. OpenAI recommends it for everything from science to shopping research, but beware that hallucinations remain a problem for AI.
Mistral Le Chat
Mistral has launched app versions of Le Chat, a multimodal AI personal assistant. Mistral claims Le Chat responds faster than any other chatbot. It also has a paid version with up-to-date journalism from the AFP. Tests from Le Monde found Le Chatโs performance impressive, although it made more errors than ChatGPT.
OpenAI Operator
OpenAIโs Operator is meant to be a personal intern that can do things independently, like help you buy groceries. It requires a $200-per-month ChatGPT Pro subscription. AI agents hold a lot of promise, but theyโre still experimental: A Washington Post reviewer says Operator decided on its own to order a dozen eggs for $31, paid with the reviewerโs credit card.
Google Gemini 2.0 Pro Experimental
Google Geminiโs much-awaited flagship model says it excels at coding and understanding general knowledge. It also has a super-long context window of 2 million tokens, helping users who need to quickly process massive chunks of text. The service requires (at minimum) a Google One AI Premium subscription of $19.99 a month.
AI models released in 2024
DeepSeek R1
This Chinese AI model took Silicon Valley by storm. DeepSeekโs R1 performs well on coding and math, while its open source nature means anyone can run it locally. Plus, itโs free. However, R1 integrates Chinese government censorship and faces rising bans for potentially sending user data back to China.
Gemini Deep Research
Deep Research summarizes Googleโs search results in a simple and well-cited document. The service is helpful for students and anyone else who needs a quick research summary. However, its quality isnโt nearly as good as an actual peer-reviewed paper. Deep Research requires a $19.99 Google One AI Premium subscription.
Meta Llama 3.3 70B
This is the newest and most advanced version of Metaโs open source Llama AI models. Meta has touted this version as its cheapest and most efficient yet, especially for math, general knowledge, and instruction following. It is free and open source.
OpenAI Sora
Sora is a model that creates realistic videos based on text. While it can generate entire scenes rather than just clips, OpenAI admits that it often generates โunrealistic physics.โ Itโs currently only available on paid versions of ChatGPT, starting with Plus, which is $20 a month.ย
Alibaba Qwen QwQ-32B-Preview
This model is one of the few to rival OpenAIโs o1 on certain industry benchmarks, excelling in math and coding. Ironically for a โreasoning model,โ it has โroom for improvement in common sense reasoning,โ Alibaba says. It also incorporates Chinese government censorship, TechCrunch testing shows. Itโs free and open source.
Anthropicโs Computer Use
Claudeโs Computer Use is meant to take control of your computer to complete tasks like coding or booking a plane ticket, making it a predecessor of OpenAIโs Operator. Computer use, however, remains in beta. Pricing is via API: $0.80 per million tokens of input and $4 per million tokens of output.
xAIโs Grok 2ย
Elon Muskโs AI company, xAI, has launched an enhanced version of its flagship Grok 2 chatbot it claims is โthree times faster.โ Free users are limited to 10 questions every two hours on Grok, while subscribers to Xโs Premium and Premium+ plans enjoy higher usage limits. xAI also launched an image generator, Aurora, that produces highly photorealistic images, including some graphic or violent content.
OpenAI o1
OpenAIโs o1 family is meant to produce better answers by โthinkingโ through responses through a hidden reasoning feature. The model excels at coding, math, and safety, OpenAI claims, but has issues with trying to deceive humans, too. Using o1 requires subscribing to ChatGPT Plus, which is $20 a month.
Anthropicโs Claude Sonnet 3.5ย
Claude Sonnet 3.5 is a model Anthropic claims as being best in class. Itโs become known for its coding capabilities and is considered a tech insiderโs chatbot of choice. The model can be accessed for free on Claude, although heavy users will need a $20 monthly Pro subscription. While it can understand images, it canโt generate them.
OpenAI GPT 4o-mini
OpenAI has touted GPT 4o-mini as its most affordable and fastest model yet, thanks to its small size. Itโs meant to enable a broad range of tasks like powering customer service chatbots. The model is available on ChatGPTโs free tier. Itโs better suited for high-volume simple tasks compared to more complex ones.
Cohere Command R+
Cohereโs Command R+ model excels at complex retrieval-augmented generation (or RAG) applications for enterprises. That means it can find and cite specific pieces of information really well. (The inventor of RAG actually works at Cohere.) Still, RAG doesnโt fully solve AIโs hallucination problem.


