QCon SF 2024 - Scaling Large Language Model Serving Infrastructure at MetaScaling LLM serving infrastructure requires deep collaboration with model developers and optimal hardware utilization to manage compute demands effectively.
How TUI rapidly scaled its generative AI integration through Amazon BedrockBusinesses need a strong data foundation to integrate generative AI and tackle infrastructure barriers for optimal performance.