#large-context-llms

[ follow ]
Artificial intelligence
fromTechCrunch
2 days ago

Nvidia unveils new GPU designed for long-context inference | TechCrunch

Nvidia unveiled the Rubin CPX GPU for context windows exceeding one million tokens, targeting long-context workloads with a disaggregated inference infrastructure, arriving end of 2026.
[ Load more ]