Smith, who tested Codex for a month and ended up rewriting a bunch of his apps and shipping versions for Windows and Android: ...
Android Bench ranks AI models based on their ability to complete real Android coding challenges.