Statistics for topic information-retrieval
RepositoryStats tracks 633,544 Github repositories, of these 230 are tagged with the information-retrieval topic. The most common primary language for repositories using this topic is Python (124). Other languages include: Jupyter Notebook (21), Java (16)
Stargazers over time for topic information-retrieval
Most starred repositories for topic information-retrieval (view more)
Trending repositories for topic information-retrieval (view more)
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
π₯ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation π₯. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
π₯ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation π₯. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced info...
Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.
π SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
π₯ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation π₯. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a c...
π₯ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation π₯. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
Query Expension for Better Query Embedding using LLMs
Official code for "π Retrieval Models Arenβt Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Official code for "π Retrieval Models Arenβt Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"
A daily digest web app that scrapes and summarizes blogs, Reddit threads, GitHub trending, and Hacker-News-trending articles all in one place.
Query Expension for Better Query Embedding using LLMs
π₯ Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation π₯. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techni...
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
A list of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
π‘ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
AI-first Search & Answer Engine for work. Open-source alternative to Glean.
Coeus π is an OSINT ToolBox empowering users with tools for effective intelligence gathering from open sources. From social media monitoring π± to data analysis π, it offers a centralized platform f...
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)
A list of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy