1 result found Sort:

49
590
apache-2.0
12
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports LLMs, embeddings, and speech-to-text.
Created 2023-10-21
235 commits to main branch, last one 4 days ago