
AI in ‘25 was insane, are you ready for ‘26?
That time of the year again!
We've been covering AI as it happens all year.
Time to pause, reflect, and put it on a podium.
Starting with image generation.
It saw the most rapid improvements this year. We've been talking about it lately, so it made sense to close the loop.
January: AI couldn't spell words without turning them into gibberish. December: Production-quality outputs with perfect typography in seconds.
I tested 12 models this year and the winners are as follows.
🎁 A quick gift before we begin
We put together an AI Survival Hackbook for 2026: a practical guide on how AI changed work between 2019–2025, what will matter next year, and how professionals are quietly using AI for leverage (not hype).
It includes:
Clear breakdowns of what shifted in 2025
Ready-to-use prompts & workflows
Real automation ideas you can set up
It’s free for the next two weeks.
Best for Text Rendering
This category has a clear specialist.
Ideogram 3.0 launched in March specifically to solve typography with native text rendering.
As of today, it’s a tiebreaker for models being able to replicate text in image accurately among various models, I chose Ideogram because it goes beyond just getting the text right.
For logos, posters, infographics or anything requiring readable text: Ideogram is killing it.
Caveat: Text rendering is all Ideogram is good at. Photorealism and artistic work? Read further.
Honorable mention: GPT Image 1.5. This could go either way between Nano Banana Pro and GPT Image 1.5, but OpenAI deserves credit here.
Best for Photorealism
Look at the skin texture. The lighting. The way shadows fall.
Nano Banana Pro is really good at generating believable images.
The technical reason: Gemini 3 Pro generates "thought images" before the final output. Better compositional logic.
Runner-up: FLUX 2 Pro
Best for Artistic/Creative Work
I love that Midjourney keeps winning this.
They've continued to dominate this area.
Nothing else comes close to it for pure aesthetic quality. The color grading, the composition.
V7's Draft Mode is a great add-on. It is now 10× faster, and is half the cost.
The caveat (there is always one): Text rendering is still terrible.
Runner-up: Stable Diffusion
Best Value (Free/Open Source)
Fully open source. Runs locally. Zero cost per image once you have the hardware.
95,000+ models on Hugging Face, has a massive community and extensive tooling support (ComfyUI, A1111 WebUI).
Runner-up: FLUX.2 [pro]: Better quality, non-commercial license for the model (but outputs are commercially usable).
Most Improved

Google went from irrelevant to leader in 86 days.
August 12: Anonymous model "nano-banana" hits #1 on LMArena.
August 26: Public launch. 3D figurine trend explodes.
September: 23 million new users. 500 million images generated. Gemini app #1 on App Store.
November 20: Nano Banana Pro.
Runner-up: Black Forest Labs (FLUX 1.1 → FLUX 2, 12B → 32B parameters), but Google's consumer adoption was impressive.
Bust of the Year
DALL-E 3 just became irrelevant.
ELO ~984. 51% win rate (basically ties against average competition).
The model that pioneered mainstream AI image generation now loses to Flux, Midjourney, Nano Banana, Imagen, and even Midjourney v6.1 from 2024.
OpenAI effectively admitted this.
March: GPT Image 1.
December: GPT Image 1.5.
They rebranded DALL-E and tried to rejuvenate it.
The company's focus shifted to Sora (video) and GPT-5 (reasoning). Image generation got deprioritized until last week.
Best Character Consistency
94.7% character consistency accuracy. Same face, same features, across dozens of images.
Released May 29. Three variants:
Kontext [max]: $0.08/image, maximum quality
Kontext [pro]: $0.04/image, optimized for editing
Kontext [dev]: Open-weight, 12B params, free (non-commercial)
The technical breakthrough: Processes reference images alongside prompts. No separate fine-tuning needed. 8× faster than GPT-Image-1.
FLUX 2 extended this to 10 reference images at once with >95% consistency but didn't win for a reason.
Runner-up: Flux. 2
Best Overall Image Model (MVP)

Yeah, GPT Image 1.5 hit #1 on LMArena last week. Doesn't matter.
I ran both through my standard tests. Nano Banana Pro won every single time.
Nano Banana Pro gets faces, and logos right. It gets the intent behind your prompt, not just the words.
Runner-up: GPT Image 1.5
My take

2025 solved the image generation problem.
Text rendering, photorealism and character consistency are now problems of the past.
Use:
Nano Banana Pro for photorealism and overall quality.
Midjourney for artistic work.
Ideogram for text.
But see what we're still asking these models to do: Generate one image at a time. Manual iteration. No memory across sessions. Character consistency that still breaks the moment you switch to video.
2026's fight is going to be about:
Real-time generation (currently takes 8-30 seconds)
Multi-image workflows (storyboards, comic panels, presentations)
Video consistency (where every current model fails)
Google’s rapid rise should tell you how fast this space moves. The model that wins 2025 might be irrelevant by March.
Next edition: AI for education and where the big dogs stand.
Until next time,
Vaibhav 🤝🏻
If you read till here, you might find this interesting
#AD 1
Introducing the first AI-native CRM
Connect your email, and you’ll instantly get a CRM with enriched customer insights and a platform that grows with your business.
With AI at the core, Attio lets you:
Prospect and route leads with research agents
Get real-time insights during customer calls
Build powerful automations for your complex workflows
Join industry leaders like Granola, Taskrabbit, Flatfile and more.
#AD 2
Banish bad ads for good
Google AdSense's Auto ads lets you designate ad-free zones, giving you full control over your site’s layout and ensuring a seamless experience for your visitors. You decide what matters to your users and maintain your site's aesthetic. Google AdSense helps you balance earning with user experience, making it the better way to earn.











