National Library of the Netherlands / Research
- The Hague, The Netherlands
- https://twitter.com/KBNLresearch
- willemjan.faber@kb.nl
Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
Repositories
-
-
geolocatedomains
Geolocate list of web domains
-
xs4all-resources
Scripts and documentation related to the xs4all homepage rescue efforts
-
Demosaurus
Demo for automatic author attribution (thesaureren)
-
-
-
Annif
Forked from NatLibFi/AnnifAnnif is a multi-algorithm automated classification and subject indexing tool for libraries, archives and museums. This repository is used for developing a production version of the system, based on ideas from the initial prototype.
-
dbnl-scripts
Forked from evanmiltenburg/dbnl-scriptsScripts to scrape DBNL and work with the texts.
-
dbnl
Forked from fonolog/dbnlScripts to work with the Public Domain files of DBNL: https://www.dbnl.org/letterkunde/pd/index.php
-
dac
Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia descriptions using either a binary SVM classifier or a neural net.
-
meresco-lucene
Forked from seecr/meresco-luceneMeresco Lucene is a set of components and tools to integrate Lucene (based on PyLucene 4.3) into Meresco
-
detectDamagedAudio
Tests on how to detect damaged WAV files
-
multiNER
Multiple NER-tool's combined in one output. Incovating mutliple NER-engine's in parallel.
-
digger
DIGGER dataset code
-
SaveToWaybackMachine
Forked from ookgezellig/SaveToWaybackMachineSaving URLs of Leesplein.nl to Wayback Machine of The Internet Archive
-
EntangledHistories
Processing of Transkribus output using xslt and running it through Annif
-
-
Brinkman-catalogus
Forked from Karen-GH/Master-ThesisThe data and code accomanying my research master thesis: Exploring text mining techniques tostructure a digitised catalogue.
-
openjpeg-decoder-service
A java based jp2 decoder service.
-
Europeana-Full-Text-in-Python
Forked from ian-nai/Europeana-Full-Text-in-PythonVarious Python scripts to assist with searching and downloading full text records via the Europeana APIs.
-
ochre
Toolbox for OCR post-correction
-
weightless-core
Forked from seecr/weightless-coreWeightless supports implementing complete Python programs as co-routines, including protocol stacks, such as the HTTP protocol.
-
Annif-corpora
Forked from NatLibFi/Annif-corporaDocument and subject corpus collection for use in testing Annif subject indexing tool
-
xxLINK-resources
Documentation and scripts related to xxLINK web sites recovery efforts
-
meresco-uri-enumerate
Forked from seecr/meresco-uri-enumerateMeresco Uri Enumerate maps uris to integer numbers

