36 results found Sort:
- Filter by Primary Language:
- Python (25)
- Jupyter Notebook (4)
- Jsonnet (1)
- C++ (1)
- Shell (1)
- TypeScript (1)
- V (1)
- +
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
3,511 commits to main branch, last one 13 hours ago
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
This repository has been archived
(exclude archived)
Created
2017-06-15
4,379 commits to master branch, last one about a year ago
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created
2021-08-11
2,135 commits to master branch, last one a day ago
Fast and flexible AutoML with learning guarantees.
Created
2018-06-28
440 commits to master branch, last one 3 years ago
Everything we actually know about the Apple Neural Engine (ANE)
Created
2020-04-15
77 commits to master branch, last one about a month ago
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created
2019-11-05
98 commits to master branch, last one 3 years ago
Large-scale LLM inference engine
Created
2023-06-23
825 commits to main branch, last one 21 hours ago
Everything you want to know about Google Cloud TPU
Created
2022-02-28
66 commits to main branch, last one 4 months ago
Neural network-based chess engine capable of natural language commentary
Created
2020-03-14
505 commits to main branch, last one 2 years ago
Differentiable Fluid Dynamics Package
Created
2022-03-21
29 commits to main branch, last one 2 months ago
Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot
Created
2021-06-26
24 commits to main branch, last one about a year ago
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
Created
2018-12-25
44 commits to master branch, last one about a year ago
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Created
2024-03-01
123 commits to main branch, last one 2 days ago
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
Created
2020-09-07
317 commits to master branch, last one 17 days ago
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Created
2022-12-11
1,420 commits to main branch, last one 4 months ago
🖼 Training StyleGAN2 on TPUs in JAX
Created
2022-07-15
23 commits to main branch, last one 2 years ago
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
Created
2020-09-12
50 commits to master branch, last one 10 months ago
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created
2021-08-17
97 commits to master branch, last one 2 years ago
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Created
2023-05-06
14 commits to main branch, last one about a year ago
TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Created
2021-07-24
19 commits to main branch, last one 3 years ago
EvoPose2D is a two-stage human pose estimation model that was designed using neuroevolution. It achieves state-of-the-art accuracy on COCO.
Created
2020-10-29
198 commits to master branch, last one 3 years ago
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
Created
2023-06-12
186 commits to main branch, last one 21 hours ago
Repository for Google Summer of Code 2019 https://summerofcode.withgoogle.com/projects/#4662790671826944
Created
2019-06-09
471 commits to master branch, last one 3 years ago
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
Created
2022-04-11
106 commits to main branch, last one 8 months ago
Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Created
2020-03-04
1,044 commits to master branch, last one 2 months ago
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
Created
2021-09-05
6 commits to main branch, last one 2 years ago
:dart: Accumulated Gradients for TensorFlow 2
Created
2022-05-31
698 commits to main branch, last one 10 months ago
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
Created
2023-09-04
3 commits to main branch, last one about a year ago
HomebrewNLP in JAX flavour for maintable TPU-Training
Created
2021-07-18
1,808 commits to main branch, last one about a year ago
Solana TpuClient Typescript Implementation
Created
2022-03-02
24 commits to main branch, last one 4 months ago