fromInfoQ1 month agoMistral AI Launches API for LLM-Based OCR of Multimodal DocumentsMistral OCR aims to digitize complex documents by interleaving text, images, and tables, suitable for scientific research and historical artifacts.Marketing tech
Artificial intelligencefromTechzine Global2 months agoMicrosoft launches Phi models optimized for multimodal processingMicrosoft expands its Phi language model line with Phi-4-mini and Phi-4-multimodal for improved multimodal processing and hardware efficiency.
Artificial intelligencefromInfoQ1 month agoGoogle Introduces Gemini 2.5 Pro with Improved Reasoning and Coding CapabilitiesGemini 2.5 Pro enhances AI reasoning and coding capabilities, achieving top scores in multiple benchmarks despite some integration issues.
Artificial intelligencefromTechzine Global2 months agoMicrosoft launches Phi models optimized for multimodal processingMicrosoft expands its Phi language model line with Phi-4-mini and Phi-4-multimodal for improved multimodal processing and hardware efficiency.
Artificial intelligencefromInfoQ1 month agoGoogle Introduces Gemini 2.5 Pro with Improved Reasoning and Coding CapabilitiesGemini 2.5 Pro enhances AI reasoning and coding capabilities, achieving top scores in multiple benchmarks despite some integration issues.