COLLECTED BY
Organization:
Internet Archive
The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the
Wayback Machine .
Content crawled via the
Wayback Machine Live Proxy mostly by the Save Page Now feature on web.archive.org.
Liveweb proxy is a component of Internet Archive’s wayback machine project. The liveweb proxy captures the content of a web page in real time, archives it into a ARC or WARC file and returns the ARC/WARC record back to the wayback machine to process. The recorded ARC/WARC file becomes part of the wayback machine in due course of time.
The Wayback Machine - https://web.archive.org/web/20190921172839/https://github.com/topics/chinese-nlp
Here are
95 public repositories
matching this topic...
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Updated
Sep 21, 2019
15
commits
3
contributors
Python
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
Updated
Sep 21, 2019
138
commits
7
contributors
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Updated
Sep 21, 2019
49
commits
1
contributors
Language Technology Platform
Updated
Sep 21, 2019
703
commits
14
contributors
C++
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Updated
Sep 20, 2019
63
commits
4
contributors
Python
An Efficient Lexical Analyzer for Chinese
Updated
Sep 19, 2019
59
commits
6
contributors
Python
pyltp: the python extension for LTP
Updated
Sep 21, 2019
174
commits
4
contributors
C++
Baidu's open-source lexical analysis tool for Chinese, including word segmentation, part-of-speech tagging & named entity recognition.
Updated
Sep 20, 2019
52
commits
5
contributors
C++
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Updated
Sep 20, 2019
1
commits
22
contributors
Python
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and modules for the latest lucene,solr,elasticsearch
Updated
Sep 21, 2019
554
commits
8
contributors
Java
An Efficient Lexical Analyzer for Chinese
Updated
Sep 18, 2019
34
commits
4
contributors
C++
Datasets, SOTA results of every fields of Chinese NLP
Updated
Sep 21, 2019
177
commits
5
contributors
HTML
Some useful Chinese corpus datasets 中文语料小数据
Updated
Sep 12, 2019
19
commits
1
contributors
Updated
Sep 17, 2019
19
commits
1
contributors
:four_leaf_clover: Another Chinese chatbot implemented in PyTorch, which is the sub-module of intelligent work order processing robot. 👩🔧
Updated
Sep 21, 2019
8
commits
1
contributors
Python
SpaCy 中文模型 | Models for SpaCy that support Chinese
Updated
Sep 20, 2019
64
commits
1
contributors
Jupyter Notebook
zhparser is a PostgreSQL extension for full-text search of Chinese
Updated
Sep 20, 2019
87
commits
3
contributors
C
An Efficient Lexical Analyzer for Chinese
Updated
Sep 19, 2019
99
commits
3
contributors
Java
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
Updated
Sep 18, 2019
30
commits
1
contributors
Python
An Efficient Chinese Text Classifier
Updated
Sep 21, 2019
10
commits
2
contributors
Java
ltp4j: Language Technology Platform For Java
Updated
Sep 18, 2019
84
commits
8
contributors
C++
中文自然语言处理工具集【断句/分词/词性标注/组块/句法分析/语义分析/NER/N元语法/HMM/代词消解/情感分析/拼写检查】
Updated
Sep 20, 2019
998
commits
9
contributors
Java
一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面
Updated
Sep 10, 2019
52
commits
1
contributors
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
Updated
Sep 9, 2019
328
commits
1
contributors
Python
Chinese Open Information Extraction (Tree-based Triple Relation Extraction Module)
Updated
Sep 14, 2019
4
commits
1
contributors
Python
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Updated
Sep 19, 2019
250
commits
5
contributors
JavaScript
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Updated
Sep 10, 2019
15
commits
1
contributors
Python
THU Chinese Keyphrase Extraction Toolkit
Updated
Aug 17, 2019
7
commits
2
contributors
C++
使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Updated
Aug 27, 2019
30
commits
1
contributors
Python
Collections of Chinese NLP corpus
Updated
Sep 21, 2019
11
commits
1
contributors
Python
Loading…
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.