#small-language-models tag

New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture

IBM attributes those improved characteristics vs. larger models to its hybrid architecture that combines a small amount of standard transformer-style attention layers with a majority of Mamba layers-more specifically, Mamba-2. With 9 Mamba blocks per 1 Transformer block, Granite gets linear scaling vs. context length for the Mamba parts (vs. quadratic scaling in transformers), plus local contextual dependencies from transformer attention (important for in-context learning or few-shots prompting).

Artificial intelligence

fromTechzine Global

2 weeks ago

Cisco acquires NeuralFabric to strengthen the foundation of AI Canvas

The acquisition should help organizations build, train, and deploy specific AI models and Small Language Models (SLMs) within their own infrastructure. NeuralFabric's technology should primarily ensure that the new AI Canvas has an even more solid foundation. The future of AI models lies at least as much in small models as in large ones. To make AI truly interesting within organizations, we don't need another generic model, but rather more specialized models and SLMs.

Artificial intelligence

fromZDNET

1 month ago

Claude's latest model is cheaper and faster than Sonnet 4 - and free

Anthropic launched Haiku 4.5, a smaller, faster, cost-effective model available on Claude.ai free plans offering strong coding and safety performance.

fromFortune

1 month ago

Want to build your own chatbot for $100? A glimpse into AI's small, cheap, DIY future | Fortune

Andrej Karpathy, a former OpenAI researcher and Tesla's former director of AI, calls his latest project the "best ChatGPT $100 can buy." Called "nanochat," the open-source project, released yesterday for his AI education startup EurekaAI, shows how anyone with a single GPU server and about $100 can build their own mini-ChatGPT that can answer simple questions and write stories and poems.

Tech industry

fromTechzine Global

2 months ago

Qualcomm's vision: you're the maestro, AI is your ensemble

During the Snapdragon Summit on Maui, Cristiano Amon, CEO of Qualcomm, gave a glimpse into where the (mobile) ecosystem they provide with chips is heading. Qualcomm envisions a future in which AI moves from the cloud to your devices, taking care of everything for you in every possible way. Qualcomm invited us to attend the Snapdragon Summit, where two new chips were presented: a new smartphone and a new compute chip. The latter is primarily intended for laptops and mini PCs.

Artificial intelligence

#large-language-models

fromInfoWorld

2 months ago

Artificial intelligence

When it comes to AI, bigger isn't always better

fromHarvard Business Review

2 months ago

Artificial intelligence

The Case for Using Small Language Models

fromInfoWorld

2 months ago

Artificial intelligence

When it comes to AI, bigger isn't always better

fromHarvard Business Review

2 months ago

Artificial intelligence

The Case for Using Small Language Models

more#large-language-models

Artificial intelligence

fromEntrepreneur

5 months ago

The AI Advantage Most Entrepreneurs Are Missing | Entrepreneur

The next wave of AI opportunities lies in small language models (SLMs) due to their accessibility and efficiency, not just larger, more powerful models.

Artificial intelligence

fromComputerworld

6 months ago

GenAI companies go granular with open-source models for agents

Companies are investing in smaller, targeted LLMs for better customization and application in generative AI.

Artificial intelligence

fromComputerWeekly.com

7 months ago

The role of small language models in enterprise AI | Computer Weekly

Small language models (SLMs) are cost-effective alternatives for generative AI due to better finetuning and efficiency.

#small-language-models#small-language-models

New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture

Cisco acquires NeuralFabric to strengthen the foundation of AI Canvas

Claude's latest model is cheaper and faster than Sonnet 4 - and free

Want to build your own chatbot for $100? A glimpse into AI's small, cheap, DIY future | Fortune

Qualcomm's vision: you're the maestro, AI is your ensemble

When it comes to AI, bigger isn't always better

The Case for Using Small Language Models

When it comes to AI, bigger isn't always better

The Case for Using Small Language Models

The AI Advantage Most Entrepreneurs Are Missing | Entrepreneur

GenAI companies go granular with open-source models for agents

The role of small language models in enterprise AI | Computer Weekly

#small-language-models
#small-language-models