Statistics for topic inference

RepositoryStats tracks 636,567 Github repositories, of these 332 are tagged with the inference topic. The most common primary language for repositories using this topic is Python (141). Other languages include: C++ (61), Jupyter Notebook (31), Rust (12), TypeScript (12), Go (11)

Stargazers over time for topic inference

Most starred repositories for topic inference (view more)

vllm vllm-project

6.7k

43.8k

apache-2.0

365

A high-throughput and memory-efficient inference and serving engine for LLMs

Created 2023-02-09

5,701 commits to main branch, last one 11 hours ago

ColossalAI hpcaitech

4.5k

40.7k

apache-2.0

393

Making large AI models cheaper, faster and more accessible

ai hpc big-model inference large-scale deep-learning data-parallelism foundation-models model-parallelism pipeline-parallelism distributed-computing heterogeneous-training

Created 2021-10-28

3,807 commits to main branch, last one about a month ago

whisper.cpp ggml-org

4.1k

39.1k

mit

330

Port of OpenAI's Whisper model in C/C++

openai whisper inference transformer speech-to-text speech-recognition

Created 2022-09-25

2,397 commits to master branch, last one 2 days ago

DeepSpeed deepspeedai

4.3k

37.8k

apache-2.0

349

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

gpu zero pytorch inference compression deep-learning data-parallelism machine-learning model-parallelism billion-parameters mixture-of-experts trillion-parameters pipeline-parallelism

Created 2020-01-23

2,743 commits to master branch, last one 5 days ago

mediapipe google-ai-edge

5.3k

29.3k

apache-2.0

517

Cross-platform, customizable ML solutions for live and streaming media.

android framework inference mediapipe calculator perception c-plus-plus graph-based deep-learning computer-vision graph-framework audio-processing machine-learning video-processing stream-processing mobile-development pipeline-framework

Created 2019-06-13

4,572 commits to master branch, last one 13 hours ago

4.2k

21.3k

other

572

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Created 2017-06-30

3,404 commits to master branch, last one 15 hours ago

Trending repositories for topic inference (view more)