I Gave 9 AI Image Models the Same "Impossible" Prompt. Here's What Happened.
I designed a single text-to-image prompt with 7 hidden stress tests baked in. Then I ran it through Gemini, GPT-4o, Midjourney, Qwen, Wan, Seedream, Flux, Z-Image, and Uni-1. The prompt asked for an elderly Japanese calligrapher painting the kanji character 勇 (courage) in a candlelit tatami room with very specific requirements: correct shadow direction from a defined light source, three wall scrolls with different kanji, a tortoiseshell cat sleeping on a red cushion, a snowy garden through an open window, and medium format film aesthetics. Every element was chosen to test something most AI models get wrong. What I was actually testing: 1. Specific kanji rendering - Can it produce a real Japanese character, not gibberish? 2. Shadow physics - Candle on the left means shadow falls right. Basic physics that most models ignore. 3. Object counting + variation - Three scrolls, each with a different character. 4. Breed-specific animal - Tortoiseshell pattern, sleeping pose, correct placement on the floor. 5. Reflections on curved surfaces - Candlelight on a ceramic tea cup. 6. Dual-temperature lighting - Warm tungsten interior vs. cool blue moonlit exterior in the same frame. 7. Film simulation - Shallow depth of field with medium format bokeh characteristics. The Results (ranked) 1. Gemini (Nano Banana Pro) - 9.0/10 The champion, and it won by doing something no other model managed: zero catastrophic failures. Three perfect scroll kanji (道, 夢, 志 meaning way, dream, ambition), correct shadow on the shoji screen, sleeping cat on the floor on a red cushion, visible candlelight reflection on the tea cup, and the most natural dual-temperature lighting of any model. The only misses were the main kanji on the paper (not quite 勇) and slightly ambiguous gender. 2 (tied). Uni-1 - 8.7/10 Uni-1 debuted in the shootout and immediately tied for second place. Three real distinct kanji on the scrolls (静, 誠, 道 meaning tranquility, sincerity, way), the best cat placement of any model (tabby/tortoiseshell, curled and sleeping on a red cushion on the floor), correct shadow direction on the shoji screen, and an authentic indigo kimono with patterned obi. The calligrapher clearly reads as female. Only real miss was the main kanji on the paper not being 勇.