#model-compression

[ follow ]
fromInfoQ
6 days ago

GenAI at Scale: What It Enables, What It Costs, and How To Reduce the Pain

My name is Mark Kurtz. I was the CTO at a startup called Neural Magic. We were acquired by Red Hat end of last year, and now working under the CTO arm at Red Hat. I'm going to be talking about GenAI at scale. Essentially, what it enables, a quick overview on that, costs, and generally how to reduce the pain. Running through a little bit more of the structure, we'll go through the state of LLMs and real-world deployment trends.
Artificial intelligence
Artificial intelligence
fromTechCrunch
1 month ago

Buzzy AI startup Multiverse creates two of the smallest high-performing models ever | TechCrunch

Multiverse Computing has released the world's smallest AI models designed for high performance on personal devices and IoT applications.
[ Load more ]