3 results found Sort:
Normalize a URL
Created
2015-01-11
166 commits to main branch, last one 10 months ago
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
Created
2019-01-22
78 commits to master branch, last one 4 days ago
Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters
Created
2015-07-07
318 commits to master branch, last one about a month ago