36 results found Sort:
- Filter by Primary Language:
- Python (25)
- Jupyter Notebook (4)
- Jsonnet (1)
- C++ (1)
- Shell (1)
- TypeScript (1)
- V (1)
- +
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
3,290 commits to main branch, last one 7 hours ago
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
This repository has been archived
(exclude archived)
Created
2017-06-15
4,379 commits to master branch, last one about a year ago
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created
2021-08-11
2,082 commits to master branch, last one 7 hours ago
Fast and flexible AutoML with learning guarantees.
Created
2018-06-28
440 commits to master branch, last one 3 years ago
Everything we actually know about the Apple Neural Engine (ANE)
Created
2020-04-15
77 commits to master branch, last one about a month ago
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created
2019-11-05
98 commits to master branch, last one 3 years ago
Large-scale LLM inference engine
Created
2023-06-23
801 commits to main branch, last one 2 days ago
Everything you want to know about Google Cloud TPU
Created
2022-02-28
66 commits to main branch, last one 3 months ago
Neural network-based chess engine capable of natural language commentary
Created
2020-03-14
505 commits to main branch, last one about a year ago
Differentiable Fluid Dynamics Package
Created
2022-03-21
29 commits to main branch, last one about a month ago
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
Created
2021-06-26
24 commits to main branch, last one about a year ago
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Created
2018-12-25
44 commits to master branch, last one about a year ago
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Created
2024-03-01
121 commits to main branch, last one 2 days ago
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
Created
2020-09-07
317 commits to master branch, last one 2 days ago
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Created
2022-12-11
1,420 commits to main branch, last one 3 months ago
🖼 Training StyleGAN2 on TPUs in JAX
Created
2022-07-15
23 commits to main branch, last one 2 years ago
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Created
2020-09-12
50 commits to master branch, last one 10 months ago
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created
2021-08-17
97 commits to master branch, last one 2 years ago
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Created
2023-05-06
14 commits to main branch, last one about a year ago
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Created
2021-07-24
19 commits to main branch, last one 3 years ago
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Created
2020-10-29
198 commits to master branch, last one 3 years ago
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Created
2023-06-12
162 commits to main branch, last one 2 days ago
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Created
2019-06-09
471 commits to master branch, last one 3 years ago
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
Created
2022-04-11
106 commits to main branch, last one 8 months ago
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Created
2020-03-04
1,044 commits to master branch, last one 2 months ago
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
Created
2021-09-05
6 commits to main branch, last one 2 years ago
:dart: Accumulated Gradients for TensorFlow 2
Created
2022-05-31
698 commits to main branch, last one 9 months ago
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Created
2023-09-04
3 commits to main branch, last one about a year ago
HomebrewNLP in JAX flavour for maintable TPU-Training
Created
2021-07-18
1,808 commits to main branch, last one about a year ago
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
Created
2024-03-02
224 commits to main branch, last one 8 months ago