The Wayback Machine - https://web.archive.org/web/20220820114123/https://github.com/topics/wikiextractor
Here are
4 public repositories
matching this topic...
对知识库Wikidata的爬虫以及数据处理脚本 将三元组关系对齐到语料库的脚本 获取知识图谱数据的脚本
-
Updated
Aug 11, 2021
-
JavaScript
Extracting useful metadata from Wikipedia dumps in any language.
-
Updated
Sep 20, 2019
-
Python
Java tool to Wikimedia dumps into Java Article pojos for test or fake data.
📚 A Kotlin project which extracts ngram counts from Wikipedia data dumps.
-
Updated
Jun 20, 2022
-
Kotlin
Improve this page
Add a description, image, and links to the
wikiextractor
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
wikiextractor
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.