The Wayback Machine - https://web.archive.org/web/20230320120338/https://github.com/huggingface/tokenizers/pulls
Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adding ByteFallback support for tokenizers.
#1183 opened Mar 17, 2023 by Narsil Loading…
Use LTO for release and benchmark builds
#1157 opened Jan 31, 2023 by csko Loading…
Allow to build without onig or fancy-regex
#1144 opened Jan 13, 2023 by llogiq Loading…
Bump dirs from 3.0 to 4.0
#1142 opened Jan 6, 2023 by hvaara Loading…
TEST Please Ignore
#1134 opened Dec 23, 2022 by hvaara Draft
Parallelize unigram trainer
#976 opened Apr 6, 2022 by mishig25 Loading…
Attempt to make unigram faster 2.
#921 opened Feb 24, 2022 by thomasw21 Loading…
Attempt to make Unigram trainer parallel.
#920 opened Feb 24, 2022 by Narsil Loading…
JVM - Add bindings with Java API
#842 opened Dec 7, 2021 by nguyenvietyen Loading…
Make AddedVocabulary more versatile
#720 opened May 28, 2021 by Backfighter Loading…
add Zenodo DOI Badge
#716 opened May 25, 2021 by SaulLu Loading…
C++ bindings
#559 opened Dec 9, 2020 by alexeyr Draft
2 of 7 tasks
ProTip! Adding no:label will show everything without a label.