36 results found Sort:

4.6k
30.5k
apache-2.0
248
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
3,511 commits to main branch, last one 13 hours ago
3.5k
15.6k
apache-2.0
469
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
This repository has been archived (exclude archived)
Created 2017-06-15
4,379 commits to master branch, last one about a year ago
513
6.8k
apache-2.0
70
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,135 commits to master branch, last one a day ago
527
3.5k
apache-2.0
172
Fast and flexible AutoML with learning guarantees.
Created 2018-06-28
440 commits to master branch, last one 3 years ago
Everything we actually know about the Apple Neural Engine (ANE)
Created 2020-04-15
77 commits to master branch, last one about a month ago
334
1.7k
apache-2.0
38
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created 2019-11-05
98 commits to master branch, last one 3 years ago
Large-scale LLM inference engine
Created 2023-06-23
825 commits to main branch, last one 21 hours ago
27
496
cc-by-4.0
8
Everything you want to know about Google Cloud TPU
Created 2022-02-28
66 commits to main branch, last one 4 months ago
Neural network-based chess engine capable of natural language commentary
Created 2020-03-14
505 commits to main branch, last one 2 years ago
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
Created 2021-06-26
24 commits to main branch, last one about a year ago
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Created 2018-12-25
44 commits to master branch, last one about a year ago
31
235
apache-2.0
18
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Created 2024-03-01
123 commits to main branch, last one 2 days ago
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
Created 2020-09-07
317 commits to master branch, last one 17 days ago
16
155
apache-2.0
10
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Created 2022-12-11
1,420 commits to main branch, last one 4 months ago
🖼 Training StyleGAN2 on TPUs in JAX
Created 2022-07-15
23 commits to main branch, last one 2 years ago
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Created 2020-09-12
50 commits to master branch, last one 10 months ago
6
124
bsd-2-clause
4
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created 2021-08-17
97 commits to master branch, last one 2 years ago
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Created 2023-05-06
14 commits to main branch, last one about a year ago
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Created 2021-07-24
19 commits to main branch, last one 3 years ago
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Created 2020-10-29
198 commits to master branch, last one 3 years ago
24
81
apache-2.0
23
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Created 2023-06-12
186 commits to main branch, last one 21 hours ago
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Created 2019-06-09
471 commits to master branch, last one 3 years ago
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Created 2020-03-04
1,044 commits to master branch, last one 2 months ago
6
58
apache-2.0
3
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
Created 2021-09-05
6 commits to main branch, last one 2 years ago
3
49
apache-2.0
5
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Created 2023-09-04
3 commits to main branch, last one about a year ago
6
46
bsd-2-clause
6
HomebrewNLP in JAX flavour for maintable TPU-Training
Created 2021-07-18
1,808 commits to main branch, last one about a year ago
20
45
unknown
2
Solana TpuClient Typescript Implementation
Created 2022-03-02
24 commits to main branch, last one 4 months ago