Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
python
pdf
elasticsearch
enrichment
ocr
annotation
etl
solr
rdf
extractor
extract
extract-information
named-entity-recognition
documents
ingest
extract-text
enrichment-analysis
solr-dataimporter
ingests-documents
ingestion-pipeline
-
Updated
Apr 12, 2020 - Python

