The Verge
1 month agoArtificial intelligence
Microsoft brings out a small language model that can look at pictures
Microsoft introduced Phi-3-vision, a multimodal language model focusing on both text and image comprehension, specifically for mobile devices. [ more ]