Nvidia launches a set of microservices for optimized inferencing | TechCrunch
Briefly

NIM takes the software work Nvidia has done around inferencing and optimizing models and makes it easily accessible by combining a given model with an optimized inferencing engine and then packing this into a container, making that accessible as a microservice.
Nvidia is already working with Amazon, Google and Microsoft to make these NIM microservices available on SageMaker, Kubernetes Engine and Azure AI, respectively. They'll also be integrated into frameworks like Deepset, LangChain and LlamaIndex.
Read at TechCrunch
[
add
]
[
|
|
]