3 results found Sort:

A curated list for Efficient Large Language Models
Created 2023-05-22
577 commits to main branch, last one 2 days ago
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
Created 2024-05-28
11 commits to main branch, last one 3 months ago
3
35
apache-2.0
1
D^2-MoE: Delta Decompression for MoE-based LLMs Compression
Created 2025-02-24
4 commits to main branch, last one 5 days ago