Baseten raises $150 million to power the future of AI inference
Briefly

Baseten raises $150 million to power the future of AI inference
"Baseten just pulled in a massive $150 million Series D, vaulting the AI infrastructure startup to a $2.15 billion valuation and cementing its place as one of the most important players in the race to scale inference - the behind-the-scenes compute that makes AI apps actually run. If the last generation of great tech companies was built on the cloud, the next wave is being built on inference. Every time you ask a chatbot a question, generate an image, or tap into an AI-powered workflow, inference is happening under the hood."
"Baseten wants to be the go-to platform for that process - a kind of "Stripe for AI". The company's co-founder and CEO, Tuhin Srivastava, describes inference as the foundational layer of the modern AI economy. His pitch is simple: the better and cheaper the inference infrastructure, the more powerful the AI products that can be built on top."
"The platform is already serving high-volume workloads: Healthcare: powering billions of fine-tuned LLM calls each week for medical teams. Sales and productivity: helping companies like Clay and Writer roll out new AI capabilities faster and at scale. For customers, Baseten isn't just infrastructure - it's the difference between shipping new AI features in weeks instead of months."
Baseten raised $150 million in a Series D at a $2.15 billion valuation to expand its inference infrastructure business. The company positions inference as the foundational compute layer powering chatbots, image generation, and AI workflows and aims to serve as a platform akin to a "Stripe for AI." Baseten supports high-volume workloads in healthcare and sales/productivity, powering billions of fine-tuned LLM calls weekly and accelerating feature delivery. The startup has raised over $285 million overall and plans to expand developer tools, improve reliability, push model performance, and grow customer success and support teams with the new capital.
Read at Silicon Valley Journals
Unable to calculate read time
[
|
]