#prompt-caching

[ follow ]
Artificial intelligence
fromTechCrunch
1 week ago

Running AI models is turning into a memory game | TechCrunch

Rising DRAM prices and sophisticated prompt-caching orchestration make memory management a critical cost and performance factor for large-scale AI deployments.
Artificial intelligence
fromZDNET
3 months ago

Developers gain major speed and cost savings with new GPT-5.1 update

GPT-5.1 accelerates coding with adaptive and no-reasoning modes, reduces API costs via prompt caching, and enhances IDE agent capabilities.
fromTechzine Global
5 months ago

xAI launches agentic model

Its technical performance is particularly notable for its speed. Thanks to prompt caching, grok-code-fast-1 achieves cache hit rates above 90 percent and can handle multiple tool calls before the first output lines are visible. The model supports a wide range of programming languages, including TypeScript, Python, Java, Rust, C++, and Go. It can perform a variety of tasks, from setting up new projects to answering programming questions and targeted bug fixing.
Artificial intelligence
[ Load more ]