20 results found Sort:

446
1.9k
apache-2.0
60
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Created 2017-12-06
127 commits to master branch, last one 4 years ago
65
1.8k
mit
13
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Created 2024-11-01
273 commits to main branch, last one 5 days ago
117
1.5k
unknown
81
Content-Addressable Data Synchronization Tool
Created 2017-01-13
684 commits to main branch, last one about a year ago
44
343
bsd-3-clause
15
Alternative casync implementation
Created 2017-11-09
300 commits to master branch, last one 3 days ago
A package for parsing PDFs and analyzing their content using LLMs.
Created 2024-07-26
28 commits to main branch, last one 3 months ago
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Created 2023-09-25
629 commits to development branch, last one 2 months ago
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
Created 2023-11-05
88 commits to main branch, last one 4 months ago
Live TS segmenter and HLS manifest creation in Go
Created 2019-05-31
115 commits to master branch, last one 3 years ago
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Created 2023-05-09
88 commits to main branch, last one 12 months ago
📑 Split Laravel jobs into multiple separate job chunks
Created 2022-09-18
50 commits to v1 branch, last one 6 months ago
22
81
apache-2.0
3
An asynchronous event-driven HTTP client based on netty.
Created 2020-12-08
203 commits to main branch, last one 2 years ago
42
73
apache-2.0
7
a modular multimodal framework for ai applications
Created 2024-04-08
2,956 commits to master branch, last one 12 days ago
12
62
other
8
Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
Created 2018-08-17
139 commits to master branch, last one about a year ago
Extract and align grammar patterns from English sentences.
Created 2018-06-07
13 commits to master branch, last one 4 years ago
Build document-native LLM applications
This repository has been archived (exclude archived)
Created 2024-07-30
20 commits to main branch, last one 2 months ago
Incremental asset delivery library
Created 2019-07-04
603 commits to main branch, last one 10 days ago
FastCDC implementation in Python https://pypi.org/project/fastcdc/
Created 2020-05-07
105 commits to master branch, last one 5 months ago
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
Created 2024-02-27
107 commits to main branch, last one 7 days ago
LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.
Created 2023-09-12
64 commits to main branch, last one about a year ago
Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines
Created 2024-09-06
82 commits to main branch, last one about a month ago