24 results found Sort:

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Created 2016-11-13
266 commits to master branch, last one 3 years ago
97
1.2k
bsd-3-clause
20
Prepping tables for machine learning
Created 2018-03-12
1,686 commits to main branch, last one 18 hours ago
Machine Learning library for the web and Node.
Created 2018-04-29
805 commits to master branch, last one 5 years ago
54
500
mit
5
Easy to use Python library of customized functions for cleaning and analyzing data.
Created 2020-03-25
884 commits to main branch, last one 5 days ago
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Created 2020-04-09
1,290 commits to main branch, last one 3 days ago
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Created 2018-10-05
30 commits to master branch, last one 3 years ago
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery...
Created 2023-01-21
542 commits to main branch, last one about a year ago
123
261
apache-2.0
16
Open source project for data preparation of LLM application builders
Created 2024-04-08
3,787 commits to dev branch, last one 23 hours ago
37
132
gpl-3.0
12
Social Media Mining Toolkit (SMMT) main repository
Created 2020-02-05
106 commits to master branch, last one about a year ago
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
Created 2020-09-09
158 commits to main branch, last one 29 days ago
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Created 2019-07-20
174 commits to master branch, last one 2 years ago
A time series signal analysis and classification framework
Created 2019-03-25
79 commits to master branch, last one 3 years ago
A quantitative study on over 1.25 million tweets about ChatGPT, employed data scrapping, data cleaning, EDA, topic modeling, and sentiment analysis.
Created 2023-03-02
41 commits to main branch, last one about a year ago
Resources of our survey paper "A Comprehensive Survey on AI Integration at the Edge: Techniques, Applications, and Challenges"
Created 2023-01-19
123 commits to main branch, last one 12 days ago
18
49
bsd-3-clause
2
Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning
Created 2019-03-29
96 commits to master branch, last one 3 years ago
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.
Created 2021-03-14
71 commits to master branch, last one 2 years ago
This repository has no description...
Created 2023-11-13
144 commits to main branch, last one 10 months ago
Data stream analytics: Implement online learning methods to address concept drift and model drift in dynamic data streams. Code for the paper entitled "A Multi-Stage Automated Online Network Data Stre...
Created 2022-10-01
26 commits to main branch, last one about a year ago
The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained
Created 2023-01-13
3 commits to main branch, last one about a year ago