"We believe new model architectures are necessary to build truly useful AI models," Goel told TechCrunch. "The AI industry is a competitive space, both commercial and open source, and building the best model is crucial to success."
Goel eventually took a job at Snorkel AI, then Salesforce, while Gu became an assistant professor at Carnegie Mellon. But Gu and Goel went on studying SSMs, releasing several pivotal research papers on the architecture.
Cartesia, whose founding team also includes Ré, is behind many derivatives of Mamba, perhaps the most popular SSM today. Gu and Princeton professor Tri Dao started Mamba as an open research project last December, and continue to refine it through subsequent releases.
Collection
[
|
...
]