64 results found Sort:
- Filter by Primary Language:
- Python (43)
- Jupyter Notebook (11)
- Go (2)
- JavaScript (1)
- HTML (1)
- Scala (1)
- Shell (1)
- TypeScript (1)
- +
Low-code framework for building custom LLMs, neural networks, and other AI models
Created
2018-12-27
3,861 commits to master branch, last one about a month ago
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created
2023-05-23
898 commits to main branch, last one 17 days ago
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created
2021-08-11
2,135 commits to master branch, last one a day ago
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created
2023-04-17
450 commits to main branch, last one 29 days ago
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created
2023-07-11
330 commits to main branch, last one 13 days ago
Efficient Triton Kernels for LLM Training
Created
2024-08-06
299 commits to main branch, last one a day ago
Code examples and resources for DBRX, a large language model developed by Databricks
Created
2024-03-26
9 commits to main branch, last one 7 months ago
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
Created
2022-01-04
2,285 commits to master branch, last one 20 hours ago
DLRover: An Automatic Distributed Deep Learning System
Created
2022-06-24
2,824 commits to master branch, last one a day ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created
2021-06-08
613 commits to master branch, last one a day ago
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created
2022-09-01
29 commits to main branch, last one 4 months ago
A PyTorch Native LLM Training Framework
Created
2024-02-26
36 commits to main branch, last one 2 months ago
112
631
mit
17
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created
2023-05-28
667 commits to main branch, last one 2 days ago
irresponsible innovation. Try now at https://chat.dev/
Created
2023-05-19
503 commits to main branch, last one 6 months ago
LLM (Large Language Model) FineTuning
Created
2023-10-22
165 commits to main branch, last one 6 months ago
Repo for fine-tuning Casual LLMs
Created
2021-08-07
166 commits to main branch, last one 7 months ago
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created
2023-09-26
152 commits to main branch, last one about a month ago
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created
2023-10-23
137 commits to main branch, last one 11 months ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created
2024-03-27
214 commits to main branch, last one 2 days ago
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created
2024-01-16
479 commits to develop branch, last one 3 days ago
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created
2023-04-28
79 commits to main branch, last one 10 months ago
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created
2024-01-11
459 commits to main branch, last one 19 hours ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created
2023-09-30
1,091 commits to main branch, last one a day ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived
(exclude archived)
Created
2023-06-19
138 commits to main branch, last one 2 months ago
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created
2024-06-19
91 commits to main branch, last one 3 months ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created
2024-02-05
27 commits to main branch, last one 5 months ago
seqax = sequence modeling + JAX
Created
2024-04-26
6 commits to main branch, last one 4 months ago
ICLR 2024 论文和开源项目合集
Created
2024-05-14
20 commits to main branch, last one 6 months ago
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Created
2023-12-03
217 commits to main branch, last one 4 months ago
Generate ideal question-answers for testing RAG
Created
2023-07-04
59 commits to master branch, last one 4 months ago