#small-language-models

[ follow ]
#on-device-ai
fromInfoQ
1 month ago

New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture

IBM attributes those improved characteristics vs. larger models to its hybrid architecture that combines a small amount of standard transformer-style attention layers with a majority of Mamba layers-more specifically, Mamba-2. With 9 Mamba blocks per 1 Transformer block, Granite gets linear scaling vs. context length for the Mamba parts (vs. quadratic scaling in transformers), plus local contextual dependencies from transformer attention (important for in-context learning or few-shots prompting).
Artificial intelligence
fromTechzine Global
1 month ago

Cisco acquires NeuralFabric to strengthen the foundation of AI Canvas

The acquisition should help organizations build, train, and deploy specific AI models and Small Language Models (SLMs) within their own infrastructure. NeuralFabric's technology should primarily ensure that the new AI Canvas has an even more solid foundation. The future of AI models lies at least as much in small models as in large ones. To make AI truly interesting within organizations, we don't need another generic model, but rather more specialized models and SLMs.
Artificial intelligence
Artificial intelligence
fromZDNET
2 months ago

Claude's latest model is cheaper and faster than Sonnet 4 - and free

Anthropic launched Haiku 4.5, a smaller, faster, cost-effective model available on Claude.ai free plans offering strong coding and safety performance.
fromFortune
2 months ago

Want to build your own chatbot for $100? A glimpse into AI's small, cheap, DIY future | Fortune

Andrej Karpathy, a former OpenAI researcher and Tesla's former director of AI, calls his latest project the "best ChatGPT $100 can buy." Called "nanochat," the open-source project, released yesterday for his AI education startup EurekaAI, shows how anyone with a single GPU server and about $100 can build their own mini-ChatGPT that can answer simple questions and write stories and poems.
Tech industry
#large-language-models
Artificial intelligence
fromEntrepreneur
6 months ago

The AI Advantage Most Entrepreneurs Are Missing | Entrepreneur

The next wave of AI opportunities lies in small language models (SLMs) due to their accessibility and efficiency, not just larger, more powerful models.
[ Load more ]