42 results found Sort:
- Filter by Primary Language:
- Python (28)
- Jupyter Notebook (12)
- HTML (1)
- JavaScript (1)
- +
Topic Modelling for Humans
Created
2011-02-10
4,534 commits to develop branch, last one 3 months ago
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text ...
Created
2016-09-25
67 commits to master branch, last one 3 years ago
A fast, efficient universal vector embedding utility package.
Created
2018-02-24
350 commits to master branch, last one 4 years ago
🦆 Contextually-keyed word vectors
Created
2016-01-23
460 commits to master branch, last one about a year ago
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre...
Created
2018-01-28
120 commits to master branch, last one 3 years ago
Data repository for pretrained NLP models and NLP corpora.
Created
2017-10-13
75 commits to master branch, last one 6 years ago
Compute Sentence Embeddings Fast!
Created
2019-06-06
168 commits to master branch, last one 2 years ago
中文詞向量訓練教學
Created
2016-08-26
26 commits to master branch, last one about a year ago
Fast word vectors with little memory usage in Python
Created
2018-09-03
61 commits to master branch, last one 4 years ago
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models...
Created
2017-09-10
42 commits to master branch, last one 3 years ago
Log Anomaly Detection - Machine learning to detect abnormal events logs
This repository has been archived
(exclude archived)
Created
2018-09-13
465 commits to master branch, last one 4 years ago
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Created
2018-01-23
157 commits to master branch, last one 2 years ago
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
Created
2022-03-13
107 commits to main branch, last one about a year ago
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
Created
2015-05-22
185 commits to master branch, last one 3 months ago
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
gensim
pytorch
deepwalk
node2vec
word2vec
clustering
word-vector
deep-learning
ego-splitting
factorization
node-embedding
graph-embedding
machine-learning
network-embedding
community-detection
deep-neural-network
graph-neural-network
implicit-factorization
graph-representation-learning
overlapping-community-detection
Created
2019-03-17
126 commits to master branch, last one about a year ago
Web-ify your word2vec: framework to serve distributional semantic models online
Created
2015-10-24
409 commits to master branch, last one 6 months ago
A simple Python3 tool to detect similarities between files within a repository
Created
2018-11-11
69 commits to master branch, last one 5 months ago
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Created
2019-01-27
119 commits to master branch, last one 2 years ago
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Created
2019-04-17
182 commits to master branch, last one 2 years ago
Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.
Created
2021-02-25
44 commits to main branch, last one 11 months ago
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Created
2017-10-13
131 commits to master branch, last one 2 years ago
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Created
2018-08-09
97 commits to master branch, last one about a year ago
:notebook: Long(er) text representation and classification using Doc2Vec embeddings
Created
2017-03-07
29 commits to master branch, last one 4 years ago
document embedding and machine learning script for beginners
Created
2016-12-14
48 commits to master branch, last one 2 years ago
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Created
2019-11-09
84 commits to master branch, last one 3 months ago
Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.
Created
2016-09-04
40 commits to master branch, last one 4 years ago
gensim 中文文档
Created
2018-09-28
36 commits to master branch, last one 3 years ago
Free hands-on course with the implementation (in Python) and description of several Natural Language Processing (NLP) algorithms and techniques, on several modern platforms and libraries.
Created
2019-06-04
79 commits to main branch, last one about a year ago
Text Summarization for Research Papers
Created
2020-04-11
9 commits to master branch, last one 4 years ago
A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).
Created
2019-01-07
92 commits to master branch, last one about a year ago