Fable seems to be a great model in my testing. Currently I’m building a Mac desktop app, and I’m working hard to be faithful to the look that Apple has for their apps. Fable seems to be pretty good at it. But so does GPT 5.5 as long as I’m clear. Maybe my problems aren’t hard enough.
Fable has a tendency to use jargon a lot more in a dense, PhD student sort of way. It’s able to elegantly explain things though if asked. It shares the Opus tendency to jump in and get started even when something isn’t clear. I suppose that is good if you want to move fast but not great if you are a deliberate person.
Do I feel like Fable is a step change? Perhaps yes. But it still isn’t good enough for me to let it run on its own. Is the extra cost worth the extra intelligence, given that I still have to baby sit it for tasks? No, probably not.