OpenAI's new GPT-4o model offers promise of improved smartphone assistants
Briefly

On Monday, the gulf grew larger still, as OpenAI announced a new model called GPT-4o—the 'o' stands for Omni—which gives the chatbot new abilities to understand and create audio, video, and still images. The system is uncanny to behold. It can engage in prolonged conversations about the world seen through a camera lens, carry out live translation between two different languages, and even laugh at appropriate points.
The new system can operate directly in speech without needing to lean on other models to prop it up, speeding up responses and allowing it to acknowledge quirks such as tone of voice. But it still isn't quite an AI assistant. It can answer questions and perform knowledge work, but not yet act on requests.
Read at www.theguardian.com
[
|
]