Apple Open-Sources Multimodal AI Model 4M-21

from InfoQ 2 weeks ago

Researchers at Apple and EPFL have unveiled 4M-21, an open-sourced AI model that supports 21 input and output modalities, showing significant performance across various benchmarks.
InfoQhttps://www.infoq.com/news/2024/09/apple-4m21-multimodal-ai/

The performance of 4M-21 out of the box on numerous vision benchmarks illustrates the viability of extensively training a single model on diverse modalities without compromising effectiveness.
InfoQhttps://www.infoq.com/news/2024/09/apple-4m21-multimodal-ai/

4M-21's ability to integrate text, pixel data, and various metadata fosters innovative multimodal interactions, like seamless retrieval and dynamic generation, all executed by one model.
InfoQhttps://www.infoq.com/news/2024/09/apple-4m21-multimodal-ai/

With 21 modalities compared to its predecessor's seven, 4M-21 highlights a major leap in capabilities, demonstrating Apple's commitment to advancing multimodal AI technologies.
InfoQhttps://www.infoq.com/news/2024/09/apple-4m21-multimodal-ai/

Read at InfoQ

#ai-models #multimodal #open-source #apple #epfl

[

Collection

]

[

...

]

Apple Open-Sources Multimodal AI Model 4M-21Apple Open-Sources Multimodal AI Model 4M-21 Briefly

Apple Open-Sources Multimodal AI Model 4M-21
Apple Open-Sources Multimodal AI Model 4M-21
Briefly