36 results found Sort:

4.9k
32.3k
apache-2.0
263
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
3,879 commits to main branch, last one 9 hours ago
3.5k
15.7k
apache-2.0
469
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
This repository has been archived (exclude archived)
Created 2017-06-15
4,379 commits to master branch, last one about a year ago
531
6.9k
apache-2.0
70
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,187 commits to master branch, last one 14 hours ago
527
3.5k
apache-2.0
172
Fast and flexible AutoML with learning guarantees.
Created 2018-06-28
440 commits to master branch, last one 3 years ago
Everything we actually know about the Apple Neural Engine (ANE)
Created 2020-04-15
77 commits to master branch, last one 2 months ago
334
1.7k
apache-2.0
38
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created 2019-11-05
98 commits to master branch, last one 3 years ago
Large-scale LLM inference engine
Created 2023-06-23
906 commits to main branch, last one 17 hours ago
30
503
cc-by-4.0
8
Everything you want to know about Google Cloud TPU
Created 2022-02-28
66 commits to main branch, last one 5 months ago
Neural network-based chess engine capable of natural language commentary
Created 2020-03-14
505 commits to main branch, last one 2 years ago
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
Created 2021-06-26
24 commits to main branch, last one about a year ago
32
256
apache-2.0
19
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Created 2024-03-01
126 commits to main branch, last one 2 days ago
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Created 2018-12-25
44 commits to master branch, last one about a year ago
DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images into S...
Created 2020-09-07
317 commits to master branch, last one about a month ago
16
157
apache-2.0
11
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Created 2022-12-11
1,420 commits to main branch, last one 5 months ago
🖼 Training StyleGAN2 on TPUs in JAX
Created 2022-07-15
23 commits to main branch, last one 2 years ago
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Created 2020-09-12
50 commits to master branch, last one 11 months ago
6
124
bsd-2-clause
4
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created 2021-08-17
97 commits to master branch, last one 2 years ago
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Created 2023-05-06
14 commits to main branch, last one about a year ago
28
89
apache-2.0
21
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Created 2023-06-12
210 commits to main branch, last one a day ago
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Created 2021-07-24
19 commits to main branch, last one 3 years ago
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Created 2020-10-29
198 commits to master branch, last one 3 years ago
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Created 2019-06-09
471 commits to master branch, last one 4 years ago
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Created 2020-03-04
1,044 commits to master branch, last one 3 months ago
6
58
apache-2.0
3
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
Created 2021-09-05
6 commits to main branch, last one 2 years ago
3
50
apache-2.0
5
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Created 2023-09-04
3 commits to main branch, last one about a year ago
22
48
unknown
2
Solana TpuClient Typescript Implementation
Created 2022-03-02
24 commits to main branch, last one 5 months ago
11
46
apache-2.0
3
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
Created 2024-03-02
224 commits to main branch, last one 9 months ago