21 results found Sort:
- Filter by Primary Language:
- Python (12)
- C (3)
- Go (2)
- Rust (1)
- JavaScript (1)
- Java (1)
- PHP (1)
- +
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Created
2024-11-01
458 commits to main branch, last one 14 hours ago
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Created
2017-12-06
127 commits to master branch, last one 4 years ago
Content-Addressable Data Synchronization Tool
Created
2017-01-13
684 commits to main branch, last one about a year ago
Alternative casync implementation
Created
2017-11-09
302 commits to master branch, last one 24 days ago
A package for parsing PDFs and analyzing their content using LLMs.
Created
2024-07-26
28 commits to main branch, last one 5 months ago
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
Created
2023-11-05
121 commits to main branch, last one 5 days ago
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Created
2023-09-25
629 commits to development branch, last one 3 months ago
Live TS segmenter and HLS manifest creation in Go
Created
2019-05-31
115 commits to master branch, last one 3 years ago
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Created
2023-05-09
88 commits to main branch, last one about a year ago
A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
Created
2024-08-04
11 commits to master branch, last one about a month ago
📑 Split Laravel jobs into multiple separate job chunks
Created
2022-09-18
50 commits to v1 branch, last one 7 months ago
a modular multimodal framework for ai applications
Created
2024-04-08
3,034 commits to master branch, last one 9 days ago
An asynchronous event-driven HTTP client based on netty.
Created
2020-12-08
203 commits to main branch, last one 2 years ago
Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
Created
2018-08-17
139 commits to master branch, last one about a year ago
Extract and align grammar patterns from English sentences.
Created
2018-06-07
13 commits to master branch, last one 4 years ago
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
Created
2024-02-27
119 commits to main branch, last one 20 days ago
Build document-native LLM applications
This repository has been archived
(exclude archived)
Created
2024-07-30
20 commits to main branch, last one 3 months ago
FastCDC implementation in Python https://pypi.org/project/fastcdc/
Created
2020-05-07
105 commits to master branch, last one 6 months ago
Incremental asset delivery library
Created
2019-07-04
603 commits to main branch, last one about a month ago
LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.
Created
2023-09-12
64 commits to main branch, last one about a year ago
Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines
Created
2024-09-06
82 commits to main branch, last one 2 months ago