26 results found Sort:

3.6k
33.1k
agpl-3.0
188
A generative speech model for daily dialogue.
Created 2024-05-27
400 commits to main branch, last one 22 days ago
2.6k
11.0k
mit
307
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Created 2018-02-05
15 commits to master branch, last one 5 years ago
249
3.6k
mit
66
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Created 2022-12-31
255 commits to main branch, last one 3 days ago
A linting tool for Chinese language.
Created 2019-06-23
515 commits to main branch, last one 3 months ago
81
883
apache-2.0
7
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Created 2023-04-18
19 commits to main branch, last one 9 months ago
61
557
cc-by-4.0
29
Rime Cantonese input schema | 粵語拼音輸入方案
Created 2019-09-05
888 commits to main branch, last one 15 hours ago
A framework for cleaning Chinese dialog data
Created 2021-02-28
77 commits to master branch, last one 3 years ago
🌏 简体中文 GeoJSON 世界地图,带有国家(地区)的 ISO 3166 代码、中文简称与全称。A simplified Chinese world map in GeoJSON format, including ISO 3166 codes, Chinese short names, and full names of countries (regions).
Created 2021-12-06
87 commits to main branch, last one about a month ago
收集非普通話漢語和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects
Created 2017-10-08
444 commits to master branch, last one 6 days ago
Learn, read, write and practice Mandarin by drawing strokes in Anki Desktop, AnkiDroid and AnkiMobile with audio of HSK 2.0 (HSK1-6) and HSK 3.0 (HSK 1-9) characters.
Created 2020-03-12
396 commits to main branch, last one 4 months ago
Discovering magic squares in Tang Dynasty poems
Created 2021-03-24
3 commits to main branch, last one 3 years ago
Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many mo...
Created 2020-10-31
50 commits to master branch, last one 6 days ago
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
Created 2021-01-07
1,312 commits to main branch, last one 8 months ago
10
132
cc-by-sa-4.0
3
CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조
Created 2021-01-15
115 commits to main branch, last one 3 months ago
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
Created 2019-12-04
10 commits to master branch, last one 3 years ago
solidity-by-example 教程中文翻译|@Web3-Club
Created 2023-03-19
267 commits to main branch, last one 3 months ago
15
72
cc0-1.0
8
Từ điển tiếng Việt dành cho máy đọc sách Kindle, Kobo, Pocketbook v.v.
Created 2015-10-06
635 commits to master branch, last one 3 days ago
3
68
cc0-1.0
3
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
Created 2022-04-24
13 commits to main branch, last one 2 years ago
文本去重
Created 2023-02-15
67 commits to main branch, last one 7 months ago
A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.
Created 2021-09-16
436 commits to main branch, last one 2 months ago
This codebase is a solution for making Chinese study, through Anki, more enjoyable by making the flashcards beautiful.
Created 2020-09-26
396 commits to main branch, last one about a year ago
9
44
other
2
開放粵語字典 - 現代粵語字音數據庫
Created 2020-10-22
6 commits to master branch, last one about a year ago
A demo of fine tune Stable Diffusion on Pokemon-Blip-Captions in English, Japanese and Chinese Corpus
Created 2022-10-31
14 commits to main branch, last one about a year ago
Complete, HSK 2.0/3.0 (汉语水平考试) Vocabulary Lists in Json
Created 2023-10-10
46 commits to main branch, last one 2 months ago
Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and add text post-processing
Created 2023-07-07
12 commits to master branch, last one 2 months ago