⚠️ This post links to an external website. ⚠️
This post presents a comprehensive comparison of 429 AI models across intelligence, performance, and pricing metrics. Gemini 3.1 Pro Preview and GPT-5.4 (xhigh) lead in intelligence, while Mercury 2 and Granite 3.3 8B offer the fastest output speeds at 794 and 435 tokens per second respectively.
The analysis reveals significant price-quality variance across models, with Gemma 3n E4B and LFM2 24B A2B offering the lowest costs at $0.03 and $0.05 per million tokens. Latency performance is dominated by Gemini 2.5 Flash-Lite variants, while Llama 4 Scout provides the largest context window at 10 million tokens.
continue reading onartificialanalysis.ai
If this post was enjoyable or useful for you, please share it! If you have comments, questions, or feedback, you can email my personal email. To get new posts, subscribe use the RSS feed.