document-analysis

Looks like the function below returns bytes with value 1 instead of 255 which produces near black png. for all other type of filters it works fine.

Filter: FlateDecode
ColorSpace: DeviceGray
BitsPerComponent: 1

public static byte[] Convert(ColorSpaceDetails details, IReadOnlyList decoded, int bitsPerComponent, int imageWidth, int imageHeight);

While this sample was originally created for multi-page documents in PDF, other related use-cases (such as ID document or receipt extraction) may operate on single-page images/photographs/scans instead.

Today there's support for images in some aspects of the pipeline, but others assume PDF. It would be great to round out support for images as source documents - particularly for common JPEG+PNG

May	JUN	Jul
	10
2021	2022	2023

document-analysis

Here are 42 public repositories matching this topic...

UglyToad / PdfPig

Image with FlateDecode filter and 1 bit per component issue

measurement properties

Yuliang-Liu / Curve-Text-Detector

tstanislawek / awesome-document-understanding

masyagin1998 / robin

chriswolfvision / local_adaptive_binarization

pandora-analysis / pandora

anisha2102 / docvqa

monniert / docExtractor

jpWang / LiLT

ankanbhunia / AdverseBiNet

aws-samples / amazon-textract-transformer-pipeline

[Enhancement] End-to-end support for images (as well as PDFs)

swapnil-ahlawat / Document_Layout_Analysis-MonkAI

huyhoang17 / kuzushiji_recognition

ihdia / docvisor

bookalope / Bookalope

TUWien / ReadModules

bookalope / InDesign-CEP

therealexpertai / nlapi-java

TUWien / ReadFramework

ethanhezhao / MetaLDA

JPLeoRX / detectron2-publaynet

omni-us / research-ContentDistillation-HTR

qurator-spk / sbb_column_classifier

Schlafenhase / Document-Analyzer

MBAigner / GraphConverter

JuanCarlosMartinezSevilla / MuRET-UserTool

MILE-IISc / DegradedWordsKannada

fredrikwahlberg / das2018

sohaib023 / T-Truth

MILE-IISc / MergedSymbolsKannada

Improve this page

Add this topic to your repo