3 results found Sort:

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Created 2023-06-12
50 commits to main branch, last one 10 months ago
29
327
unknown
12
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Created 2024-01-31
12 commits to main branch, last one 6 months ago
46
289
other
20
OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving
Created 2023-06-26
3,108 commits to main branch, last one 8 days ago