1 result found Sort:
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
Created
2023-10-21
276 commits to main branch, last one 3 days ago