How much have AI applications learned, and how can one know their capabilities if they are being evaluated with an exam that is far too easy? In 2024, with the publication of the previous benchmark to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results