6 results found Sort:
- Filter by Primary Language:
- Python (5)
- Go (1)
- +
A Golang implemented Redis Server and Cluster. Go 语言实现的 Redis 服务器和分布式集群
Created
2019-06-01
258 commits to master branch, last one about a month ago
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Created
2023-06-12
38 commits to main branch, last one 2 months ago
Completion After Prompt Probability. Make your LLM make a choice
Created
2023-02-22
420 commits to main branch, last one 14 days ago
Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)
Created
2024-01-14
54 commits to main branch, last one 4 months ago
Notes about LLaMA 2 model
Created
2023-08-21
4 commits to main branch, last one 9 months ago
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture ...
Created
2023-10-01
5 commits to main branch, last one 8 months ago