69 results found Sort:
- Filter by Primary Language:
- Python (46)
- Jupyter Notebook (12)
- Go (2)
- HTML (2)
- JavaScript (1)
- Scala (1)
- Shell (1)
- TypeScript (1)
- +
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created
2023-05-23
904 commits to main branch, last one 18 days ago
Low-code framework for building custom LLMs, neural networks, and other AI models
Created
2018-12-27
3,861 commits to master branch, last one 3 months ago
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created
2021-08-11
2,270 commits to master branch, last one 2 days ago
Efficient Triton Kernels for LLM Training
Created
2024-08-06
387 commits to main branch, last one 3 days ago
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created
2023-07-11
334 commits to main branch, last one 11 days ago
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created
2023-04-17
451 commits to main branch, last one 16 days ago
Code examples and resources for DBRX, a large language model developed by Databricks
Created
2024-03-26
9 commits to main branch, last one 9 months ago
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel acce...
Created
2022-01-04
2,438 commits to master branch, last one 2 days ago
DLRover: An Automatic Distributed Deep Learning System
Created
2022-06-24
2,909 commits to master branch, last one a day ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created
2021-06-08
637 commits to master branch, last one 9 days ago
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created
2022-09-01
29 commits to main branch, last one 7 months ago
A PyTorch Native LLM Training Framework
Created
2024-02-26
37 commits to main branch, last one about a month ago
116
652
mit
17
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created
2023-05-28
674 commits to main branch, last one 14 days ago
LLM (Large Language Model) FineTuning
Created
2023-10-22
165 commits to main branch, last one 8 months ago
irresponsible innovation. Try now at https://chat.dev/
Created
2023-05-19
503 commits to main branch, last one 8 months ago
Repo for fine-tuning Casual LLMs
Created
2021-08-07
166 commits to main branch, last one 10 months ago
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created
2023-09-26
152 commits to main branch, last one 3 months ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created
2024-03-27
232 commits to main branch, last one about a month ago
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created
2023-10-23
137 commits to main branch, last one about a year ago
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created
2024-01-16
500 commits to develop branch, last one 12 days ago
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created
2024-01-11
496 commits to main branch, last one 4 days ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created
2023-09-30
1,113 commits to main branch, last one 2 days ago
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created
2023-04-28
79 commits to main branch, last one about a year ago
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Created
2024-06-18
1,078 commits to main branch, last one 20 days ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived
(exclude archived)
Created
2023-06-19
138 commits to main branch, last one 5 months ago
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created
2024-06-19
91 commits to main branch, last one 6 months ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created
2025-01-05
10 commits to master branch, last one 28 days ago
ICLR 2024 论文和开源项目合集
Created
2024-05-14
20 commits to main branch, last one 8 months ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created
2024-02-05
27 commits to main branch, last one 7 months ago
seqax = sequence modeling + JAX
Created
2024-04-26
6 commits to main branch, last one 6 months ago