2 results found Sort:

fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.
Created 2023-03-21
64 commits to main branch, last one about a year ago
54
222
apache-2.0
3
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
Created 2023-06-28
190 commits to main branch, last one 17 days ago