Trending repositories for topic transformer

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

32,255 (+205)

apache-2.0

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+175)

apache-2.0

huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

136,576 (+152)

apache-2.0

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+97)

mit

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...

57,297 (+79)

mit

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,062 (+55)

mit

ml-tooling/best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

18,345 (+33)

cc-by-sa-4.0

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+32)

gpl-3.0

open-mmlab/mmdetection

OpenMMLab Detection Toolbox and Benchmark

29,859 (+32)

apache-2.0

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

7,267 (+30)

mit

amusi/CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,580 (+29)

BlinkDL/RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...

12,828 (+28)

apache-2.0

microsoft/multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

163 (+25)

mit

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594 (+24)

apache-2.0

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

11,291 (+24)

apache-2.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,116 (+22)

mit

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525 (+20)

apache-2.0

huggingface/text-generation-inference

Large Language Model Text Generation Inference

9,484 (+18)

apache-2.0

zjunlp/KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

110 (+14)

mit

caiyuanhao1998/Retinexformer

"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)

953 (+9)

mit

Last 3 days (relative gain)

microsoft/multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

163 (+18%)

mit

zjunlp/KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

110 (+15%)

mit

ChongQingNoSubway/DGR-MIL

Code for paper: DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification [ECCV 2024]

133 (+6%)

mit

AndrzejMiskow/TradeAI-Advancing-Algorithmic-Trading-Systems-with-Time-Series-Transformer-for-Cryptocurrency-Data

TradeAI: Empowering Algorithmic Trading with Deep Learning for Cryptocurrency Data. Explore the potential of deep learning in cryptocurrency trading through our full-stack algorithmic trading system, ...

32 (+3%)

mit

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to bu...

228 (+3%)

mit

harleyszhang/llm_counts

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

44 (+2%)

qrzou/ParCo

[ECCV 2024] Official PyTorch implement of paper "ParCo: Part-Coordinating Text-to-Motion Synthesis": http://arxiv.org/abs/2403.18512

48 (+2%)

apache-2.0

myscience/open-genie

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

103 (+2%)

mit

DAMO-NLP-SG/DiGIT

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

53 (+2%)

mit

SMARTlab-Purdue/SAN-NaviSTAR

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...

55 (+2%)

mit

Fediory/HVI-CIDNet

"You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"

174 (+2%)

mit

Adamdad/kat

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

637 (+2%)

mit

Haiyang-W/TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

456 (+2%)

apache-2.0

yinizhilian/ICLR2024-Papers-with-Code

ICLR 2024 论文和开源项目合集

150 (+1%)

hkproj/pytorch-transformer

Attention is all you need implementation

689 (+1%)

ChristophReich1996/Swin-Transformer-V2

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].

186 (+1%)

mit

bowang-lab/ECG-FM

An electrocardiogram analysis foundation model.

102 (+1.0%)

mit

HowieMa/CVTHead

[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"

105 (+1.0%)

caiyuanhao1998/Retinexformer

"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)

953 (+1.0%)

mit

autonomousvision/plant

[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations

231 (+0.9%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+401)

apache-2.0

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

32,255 (+385)

apache-2.0

huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

136,576 (+312)

apache-2.0

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+202)

mit

labmlai/annotated_deep_learning_paper_implementations

57,297 (+171)

mit

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,062 (+118)

mit

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594 (+81)

apache-2.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,116 (+79)

mit

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

7,267 (+67)

mit

BlinkDL/RWKV-LM

12,828 (+59)

apache-2.0

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+57)

gpl-3.0

amusi/CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,580 (+56)

ml-tooling/best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

18,345 (+51)

cc-by-sa-4.0

open-mmlab/mmdetection

OpenMMLab Detection Toolbox and Benchmark

29,859 (+49)

apache-2.0

microsoft/multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

163 (+48)

mit

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,081 (+47)

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525 (+45)

apache-2.0

huggingface/text-generation-inference

Large Language Model Text Generation Inference

9,484 (+44)

apache-2.0

PaddlePaddle/PaddleSpeech

11,291 (+36)

apache-2.0

ddz16/TSFpaper

This repository contains a reading list of papers on Time Series Forecasting/Prediction (TSF) and Spatio-Temporal Forecasting/Prediction (STF). These papers are mainly categorized according to the typ...

2,168 (+27)

Last week (relative gain)

microsoft/multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

163 (+42%)

mit

zjunlp/KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

110 (+18%)

mit

taco-group/MWFormer

[TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

32 (+10%)

ChongQingNoSubway/DGR-MIL

Code for paper: DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification [ECCV 2024]

133 (+9%)

mit

AndrzejMiskow/TradeAI-Advancing-Algorithmic-Trading-Systems-with-Time-Series-Transformer-for-Cryptocurrency-Data

32 (+7%)

mit

myscience/open-genie

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

103 (+6%)

mit

microsoft/monitors4codegen

228 (+5%)

mit

harleyszhang/llm_counts

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

44 (+5%)

dkzlv/micro-transform

Super simple Typescript serializer

48 (+4%)

mit

DAMO-NLP-SG/DiGIT

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

53 (+4%)

mit

SMARTlab-Purdue/SAN-NaviSTAR

55 (+4%)

mit

joshnuss/shiki-transformer-copy-button

A Shiki Transformer that adds a Copy button

28 (+4%)

AXERA-TECH/ax-llm

Explore LLM model deployment based on AXera's AI chips

59 (+4%)

bsd-3-clause

csinva/interpretable-embeddings

Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)

30 (+3%)

uzh-rpg/ssms_event_cameras

[CVPR'24 Spotlight] The official implementation of "State Space Models for Event Cameras"

92 (+3%)

Haiyang-W/TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

456 (+3%)

apache-2.0

A-suozhang/Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

33 (+3%)

datvodinh/ppo-transformer

A Reinforcement Learning Project using PPO + Transformer

36 (+3%)

mit

thuml/Large-Time-Series-Model

Official code, datasets and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models" (ICML 2024)

409 (+3%)

mit

Event-AHU/HARDVS

[AAAI-2024] HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors

38 (+3%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+2,857)

apache-2.0

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

32,255 (+1,730)

apache-2.0

huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

136,576 (+1,339)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,297 (+910)

mit

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+692)

mit

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,062 (+505)

mit

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594 (+452)

apache-2.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,116 (+369)

mit

huggingface/text-generation-inference

Large Language Model Text Generation Inference

9,484 (+360)

apache-2.0

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525 (+331)

apache-2.0

ml-tooling/best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

18,345 (+257)

cc-by-sa-4.0

amusi/CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,580 (+252)

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

7,267 (+244)

mit

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,081 (+233)

open-mmlab/mmdetection

OpenMMLab Detection Toolbox and Benchmark

29,859 (+220)

apache-2.0

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+206)

gpl-3.0

BlinkDL/RWKV-LM

12,828 (+148)

apache-2.0

hyunwoongko/transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

3,175 (+124)

vortezwohl/CEO

CEO (ceo-py) is an intuitive and modular AI agent framework for task automation.

126 (+121)

gpl-3.0

open-mmlab/mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

8,418 (+120)

apache-2.0

Last month (relative gain)

vortezwohl/CEO

CEO (ceo-py) is an intuitive and modular AI agent framework for task automation.

126 (+2,420%)

gpl-3.0

taco-group/MWFormer

[TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

32 (+433%)

microsoft/multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

163 (+176%)

mit

zjunlp/KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

110 (+45%)

mit

myscience/open-genie

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

103 (+43%)

mit

gpustack/llama-box

LM inference server implementation based on llama.cpp.

47 (+38%)

mit

Haiyang-W/TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

456 (+34%)

apache-2.0

csinva/interpretable-embeddings

Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)

30 (+30%)

harleyszhang/llm_counts

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

44 (+29%)

AndrzejMiskow/TradeAI-Advancing-Algorithmic-Trading-Systems-with-Time-Series-Transformer-for-Cryptocurrency-Data

32 (+28%)

mit

DAMO-NLP-SG/DiGIT

[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

53 (+26%)

mit

yi-ding-cs/EEG-Deformer

[IEEE J-BHI-2024] A Convolutional Transformer to decode mental states from Electroencephalography (EEG) for Brain-Computer Interfaces (BCI)

46 (+24%)

DSL-FIQA/DSL-FIQA

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer (CVPR 2024)

35 (+21%)

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+20%)

apache-2.0

SkyworkAI/MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention

191 (+19%)

apache-2.0

bowang-lab/ECG-FM

An electrocardiogram analysis foundation model.

102 (+17%)

mit

naver-ai/rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

252 (+17%)

XiangZ-0/HiT-SR

[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

93 (+16%)

apache-2.0

interestingLSY/swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

122 (+16%)

apache-2.0

Yaziwel/Region-Attention-Transformer-for-Medical-Image-Restoration

[MICCAI 2024] Region Attention Transformer for Medical Image Restoration.

29 (+16%)

apache-2.0

Last 12-months (new repositories)

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594

apache-2.0

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525

apache-2.0

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

2,112

mit

johnmai-dev/ChatMLX

🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.

662

apache-2.0

Event-AHU/Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

639

mit

Adamdad/kat

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

637

mit

Haiyang-W/TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

456

apache-2.0

thuml/Large-Time-Series-Model

Official code, datasets and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models" (ICML 2024)

409

mit

HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

370

mit

Haiyang-W/GiT

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

318

apache-2.0

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315

HKUDS/OpenGraph

[EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"

302

apache-2.0

autonomousvision/LaRa

[ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model

287

mit

IAAR-Shanghai/Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

286

jy-yuan/KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

256

mit

AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

256

apache-2.0

naver-ai/rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

252

telekom/create-tsi

Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.

231

VisionVerse/Blind-Motion-Deblurring-Survey

Deep learning in motion deblurring: current status, benchmarks and future prospects[J], The Visual Computer, 2024.

193

mit

SkyworkAI/MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention

191

apache-2.0

Last 12-months (absolute gain)

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

32,255 (+20,234)

apache-2.0

huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

136,576 (+19,088)

apache-2.0

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+17,259)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,297 (+16,660)

mit

ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

36,452 (+9,984)

mit

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

13,062 (+6,792)

mit

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594 (+6,571)

apache-2.0

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,081 (+5,534)

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525 (+5,483)

apache-2.0

amusi/CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,580 (+4,255)

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,116 (+4,126)

mit

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

7,267 (+4,004)

mit

ml-tooling/best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

18,345 (+3,452)

cc-by-sa-4.0

open-mmlab/mmdetection

OpenMMLab Detection Toolbox and Benchmark

29,859 (+3,385)

apache-2.0

huggingface/text-generation-inference

Large Language Model Text Generation Inference

9,484 (+3,107)

apache-2.0

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+2,458)

gpl-3.0

BlinkDL/RWKV-LM

12,828 (+2,299)

apache-2.0

Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

2,112 (+2,111)

mit

microsoft/aici

AICI: Prompts as (Wasm) Programs

1,972 (+1,971)

mit

PaddlePaddle/PaddleSpeech

11,291 (+1,903)

apache-2.0

Last 12-months (relative gain)

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,594 (+28,570%)

apache-2.0

InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

5,525 (+13,055%)

apache-2.0

fishaudio/fish-speech

SOTA Open Source TTS

17,435 (+9,806%)

apache-2.0

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,088 (+9,791%)

mit

Event-AHU/Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

639 (+7,888%)

mit

microsoft/vidur

A large-scale simulation framework for LLM inference

303 (+5,960%)

mit

pytorch/ao

PyTorch native quantization and sparsity for training and inference

1,685 (+3,140%)

bsd-3-clause

SkyworkAI/MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention

191 (+3,083%)

apache-2.0

caiyuanhao1998/SAX-NeRF

"Structure-Aware Sparse-View X-ray 3D Reconstruction" (CVPR 2024)

476 (+2,875%)

mit

HMUNACHI/nanodl

A Jax-based library for designing and training transformer models from scratch.

278 (+2,680%)

mit

Haiyang-W/TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

456 (+2,582%)

apache-2.0

jy-yuan/KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

256 (+2,460%)

mit

SalvatoreRa/ML-news-of-the-week

A collection of the the best ML and AI news every week (research, news, resources)

115 (+1,817%)

apache-2.0

OpenCSGs/llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...

72 (+1,700%)

apache-2.0