
release = announced and accessible via API
comparable = massive multimodal model (text, image, audio in and out), over 100 tokens / s
May 14: Technically, OpenAI is still not accessible as of release date due to lack of native audio and image support
Update May 14: At Google I/O, Google showed new multi-modal capabilities, but no new model updates outside of extended context window and smaller model Gemini Flash.
Update Oct 01: OpenAI releases real-time API for GPT-4o with audio support
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ1,134 | |
2 | Ṁ187 | |
3 | Ṁ163 | |
4 | Ṁ133 | |
5 | Ṁ96 |
People are also trading
@0xSMW is native image out in the api required? Gpt-4o for instance has only been released using dalle for image gen I believe.
@FergusArgyll as of May 13, 2024 gemini no longer produces images out, and doesn't output voice. it's likely that on May 14, 2024, google will announce a new gemini version.