42 results found Sort:

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text ...
Created 2016-09-25
67 commits to master branch, last one 3 years ago
A fast, efficient universal vector embedding utility package.
Created 2018-02-24
350 commits to master branch, last one 4 years ago
240
1.6k
mit
50
🦆 Contextually-keyed word vectors
Created 2016-01-23
460 commits to master branch, last one about a year ago
793
1.2k
unknown
51
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre...
Created 2018-01-28
120 commits to master branch, last one 4 years ago
135
991
lgpl-2.1
39
Data repository for pretrained NLP models and NLP corpora.
Created 2017-10-13
75 commits to master branch, last one 6 years ago
中文詞向量訓練教學
Created 2016-08-26
26 commits to master branch, last one 2 years ago
Fast word vectors with little memory usage in Python
Created 2018-09-03
61 commits to master branch, last one 4 years ago
79
395
unknown
32
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models...
Created 2017-09-10
42 commits to master branch, last one 3 years ago
Log Anomaly Detection - Machine learning to detect abnormal events logs
This repository has been archived (exclude archived)
Created 2018-09-13
465 commits to master branch, last one 5 years ago
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Created 2018-01-23
157 commits to master branch, last one 2 years ago
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
Created 2022-03-13
107 commits to main branch, last one about a year ago
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
Created 2015-05-22
185 commits to master branch, last one 4 months ago
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Created 2019-03-17
126 commits to master branch, last one about a year ago
48
197
gpl-3.0
12
Web-ify your word2vec: framework to serve distributional semantic models online
Created 2015-10-24
409 commits to master branch, last one 7 months ago
A simple Python3 tool to detect similarities between files within a repository
Created 2018-11-11
69 commits to master branch, last one 6 months ago
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Created 2019-01-27
119 commits to master branch, last one 2 years ago
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Created 2019-04-17
182 commits to master branch, last one 2 years ago
Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.
Created 2021-02-25
44 commits to main branch, last one about a year ago
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Created 2017-10-13
131 commits to master branch, last one 2 years ago
:notebook: Long(er) text representation and classification using Doc2Vec embeddings
Created 2017-03-07
29 commits to master branch, last one 4 years ago
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Created 2018-08-09
97 commits to master branch, last one about a year ago
36
92
lgpl-2.1
26
document embedding and machine learning script for beginners
Created 2016-12-14
48 commits to master branch, last one 2 years ago
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Created 2019-11-09
84 commits to master branch, last one 4 months ago
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Created 2016-09-04
40 commits to master branch, last one 4 years ago
gensim 中文文档
Created 2018-09-28
36 commits to master branch, last one 3 years ago
15
79
mit
3
Free hands-on course with the implementation (in Python) and description of several Natural Language Processing (NLP) algorithms and techniques, on several modern platforms and libraries.
Created 2019-06-04
79 commits to main branch, last one about a year ago
A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
Created 2019-01-07
92 commits to master branch, last one about a year ago
Text Summarization for Research Papers
Created 2020-04-11
9 commits to master branch, last one 4 years ago