8 results found Sort:

558
9.3k
apache-2.0
111
Running large language models on a single GPU for throughput-oriented scenarios.
This repository has been archived (exclude archived)
Created 2023-02-15
107 commits to main branch, last one 3 months ago
Run Mixtral-8x7B models in Colab or consumer desktops
Created 2023-12-15
86 commits to master branch, last one about a year ago
217
1.8k
bsd-3-clause
44
PyTorch native quantization and sparsity for training and inference
Created 2023-11-03
1,013 commits to main branch, last one 11 hours ago
27
131
mit
9
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating ...
Created 2023-07-31
191 commits to main branch, last one about a month ago
13
94
apache-2.0
3
LLM Inference on consumer devices
Created 2024-12-25
107 commits to v0.1.0 branch, last one 3 days ago
dpdk infrastructure for software acceleration. Currently working on RX and ACL pre-filter
Created 2019-06-10
107 commits to master branch, last one 3 years ago
23
65
lgpl-2.1
6
DPU-Powered File System Virtualization over virtio-fs
Created 2022-07-20
371 commits to master branch, last one about a year ago
12
40
unknown
10
A collection of tests for the Open vSwitch HW offload.
Created 2020-05-27
5,649 commits to master branch, last one 3 months ago