ブックマーク / openai.com (1)

  • Hello GPT-4o

    GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversat

    Hello GPT-4o
    nyankosenpai
    nyankosenpai 2024/05/14
    何より、視覚障害者にとって読み上げ機能がデフォってのはとても便利だと思いますよ。OS自体に組み込まれて音声操作だけで完結できるようになってほしい
  • 1