The Wayback Machine - https://web.archive.org/web/20220610195535/https://github.com/topics/document-analysis
Skip to content
#

document-analysis

Here are 42 public repositories matching this topic...

bunchofcoders
bunchofcoders commented Dec 28, 2021

Looks like the function below returns bytes with value 1 instead of 255 which produces near black png. for all other type of filters it works fine.

Filter: FlateDecode
ColorSpace: DeviceGray
BitsPerComponent: 1

public static byte[] Convert(ColorSpaceDetails details, IReadOnlyList decoded, int bitsPerComponent, int imageWidth, int imageHeight);

amazon-textract-transformer-pipeline
athewsey
athewsey commented Nov 17, 2021

While this sample was originally created for multi-page documents in PDF, other related use-cases (such as ID document or receipt extraction) may operate on single-page images/photographs/scans instead.

Today there's support for images in some aspects of the pipeline, but others assume PDF. It would be great to round out support for images as source documents - particularly for common JPEG+PNG

enhancement good first issue

Improve this page

Add a description, image, and links to the document-analysis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-analysis topic, visit your repo's landing page and select "manage topics."

Learn more