36 results found Sort:

4.5k
29.7k
apache-2.0
242
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
3,290 commits to main branch, last one 7 hours ago
3.5k
15.5k
apache-2.0
466
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
This repository has been archived (exclude archived)
Created 2017-06-15
4,379 commits to master branch, last one about a year ago
502
6.8k
apache-2.0
71
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,082 commits to master branch, last one 7 hours ago
527
3.5k
apache-2.0
172
Fast and flexible AutoML with learning guarantees.
Created 2018-06-28
440 commits to master branch, last one 3 years ago
Everything we actually know about the Apple Neural Engine (ANE)
Created 2020-04-15
77 commits to master branch, last one about a month ago
334
1.7k
apache-2.0
38
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created 2019-11-05
98 commits to master branch, last one 3 years ago
Large-scale LLM inference engine
Created 2023-06-23
801 commits to main branch, last one 2 days ago
27
491
cc-by-4.0
8
Everything you want to know about Google Cloud TPU
Created 2022-02-28
66 commits to main branch, last one 3 months ago
Neural network-based chess engine capable of natural language commentary
Created 2020-03-14
505 commits to main branch, last one about a year ago
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
Created 2021-06-26
24 commits to main branch, last one about a year ago
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Created 2018-12-25
44 commits to master branch, last one about a year ago
31
227
apache-2.0
17
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Created 2024-03-01
121 commits to main branch, last one 2 days ago
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
Created 2020-09-07
317 commits to master branch, last one 2 days ago
16
154
apache-2.0
10
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Created 2022-12-11
1,420 commits to main branch, last one 3 months ago
🖼 Training StyleGAN2 on TPUs in JAX
Created 2022-07-15
23 commits to main branch, last one 2 years ago
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Created 2020-09-12
50 commits to master branch, last one 10 months ago
6
124
bsd-2-clause
4
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created 2021-08-17
97 commits to master branch, last one 2 years ago
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Created 2023-05-06
14 commits to main branch, last one about a year ago
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Created 2021-07-24
19 commits to main branch, last one 3 years ago
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Created 2020-10-29
198 commits to master branch, last one 3 years ago
23
80
apache-2.0
21
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Created 2023-06-12
162 commits to main branch, last one 2 days ago
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Created 2019-06-09
471 commits to master branch, last one 3 years ago
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Created 2020-03-04
1,044 commits to master branch, last one 2 months ago
6
58
apache-2.0
3
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
Created 2021-09-05
6 commits to main branch, last one 2 years ago
3
47
apache-2.0
5
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Created 2023-09-04
3 commits to main branch, last one about a year ago
6
46
bsd-2-clause
6
HomebrewNLP in JAX flavour for maintable TPU-Training
Created 2021-07-18
1,808 commits to main branch, last one about a year ago
11
45
apache-2.0
3
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
Created 2024-03-02
224 commits to main branch, last one 8 months ago