63 results found Sort:
- Filter by Primary Language:
- Python (35)
- Jupyter Notebook (12)
- C++ (5)
- R (4)
- MATLAB (2)
- JavaScript (1)
- HTML (1)
- Pascal (1)
- Cython (1)
- +
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created
2023-12-12
1,951 commits to main branch, last one a day ago
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created
2022-09-26
1,656 commits to main branch, last one a day ago
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Created
2020-03-13
543 commits to master branch, last one 9 days ago
a delightful machine learning tool that allows you to train, test, and use models without writing code
automl
sklearn
automation
data-science
scikit-learn
data-analysis
hacktoberfest
preprocessing
neural-network
machinelearning
neural-networks
machine-learning
hacktoberfest2021
automl-experiments
artificial-intelligence
machine-learning-library
machine-learning-algorithms
scikitlearn-machine-learning
Created
2020-08-27
429 commits to master branch, last one about a year ago
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
Created
2017-10-31
465 commits to master branch, last one 23 days ago
MLBox is a powerful Automated Machine Learning python library.
Created
2017-06-01
1,121 commits to master branch, last one 4 years ago
Automated Time Series Forecasting
Created
2019-11-26
967 commits to master branch, last one 15 days ago
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Created
2020-04-03
1,075 commits to main branch, last one 4 months ago
Audio processing by using pytorch 1D convolution network
Created
2019-09-02
305 commits to master branch, last one 10 months ago
A Deep Learning Python Toolkit for Healthcare Applications.
Created
2020-08-03
742 commits to master branch, last one 7 months ago
Collection of various algorithms implemented in R.
Created
2018-09-23
288 commits to master branch, last one 24 days ago
High performance model preprocessing library on PyTorch
This repository has been archived
(exclude archived)
Created
2021-09-27
439 commits to main branch, last one about a year ago
✔️Contextual word checker for better suggestions (not actively maintained)
Created
2020-04-10
184 commits to master branch, last one 15 days ago
A curated list of awesome CAE frameworks, libraries and software.
Created
2016-08-21
36 commits to master branch, last one 4 months ago
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Created
2018-10-05
30 commits to master branch, last one 3 years ago
A full pipeline AutoML tool for tabular data
Created
2020-10-22
763 commits to main branch, last one 6 months ago
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
Created
2016-04-02
132 commits to master branch, last one 4 months ago
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
Created
2019-10-10
1,307 commits to main branch, last one about a year ago
Japanese text normalizer for mecab-neologd
Created
2015-07-21
103 commits to master branch, last one 7 months ago
Just some tool repackers like to use...
This repository has been archived
(exclude archived)
Created
2022-01-09
29 commits to main branch, last one about a year ago
[WIP] VoiceSmith makes training text to speech models easy.
Created
2022-05-17
238 commits to main branch, last one 2 years ago
Analysis ready CMIP6 data in python the easy way with pangeo tools.
Created
2019-10-16
615 commits to main branch, last one 5 months ago
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
This repository has been archived
(exclude archived)
Created
2020-07-24
128 commits to main branch, last one 3 years ago
This is the preprocessing step of the LIDC-IDRI dataset
Created
2020-04-24
30 commits to master branch, last one 4 years ago
Mambular is a Python package that simplifies tabular deep learning by providing a suite of models for regression, classification, and distributional regression tasks. It includes models such as Mambul...
Created
2024-05-03
588 commits to master branch, last one 16 days ago
An "R" package for automatic download and preprocessing of MODIS Land Products Time Series
Created
2014-07-09
1,619 commits to master branch, last one 5 months ago
The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.
Created
2018-01-25
34 commits to master branch, last one 3 years ago
Dataflow Programming for Machine Learning in R
Created
2017-10-10
3,343 commits to master branch, last one 3 days ago
PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data
Created
2018-04-12
392 commits to main branch, last one 15 days ago
Automated rejection and repair of bad trials/sensors in M/EEG
Created
2016-05-23
565 commits to main branch, last one about a month ago