Microsoft released a suite of fresh MAI models at Build 2026. They work fine, but they can't compete with Claude and Gemini.
I compared Claude Opus 4.8 with 4.7 in a 10-round honesty test - and a legal prompt broke it ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results