Questions tagged [pdf]
Portable Document Format (PDF) is an open standard for electronic document exchange maintained by the International Organization for Standardization (ISO). Questions can be about creating, reading, editing PDFs using different languages.
84 questions
4
votes
1
answer
83
views
IBM File Processing and PDF Conversion App Using PyQt6 and FPDF
I am trying to convert a 111MB TEXT file to PDF quickly. I'm currently using the FPDF library, and large files take about 40 minutes to process. The text file is an IBM carriage control (ANSI) file ...
-1
votes
1
answer
123
views
Send populated PDF membership forms to members via email
I have written a routine to send emails to members.
Basically the routine creates a table containing members who will receive an email.
The routine copies the 1st record to another table, exports that ...
6
votes
2
answers
752
views
Speed up search function for PDFs
This function takes a file path to a PDF file, and a search string(s). It spits out a count of the number of times the string(s) shows in the PDF. Any ideas how I can make it faster?
It can be tested ...
2
votes
1
answer
90
views
Re-write of PDF invoices from user input, decoupling of Invoice from FPDF class
Following on from this post and taking on board some of the suggestions, what follows is a re-write of the original python code. It's also just a rewrite of some of the functionality, this code only ...
1
vote
2
answers
244
views
PDF invoices from user input
Firstly the goal of this code is a little more complex than in this post and the other invoice questions are C#
Is there anything that just pops out as immeadiately unpythonic/wrong ?
I have a few ...
0
votes
0
answers
113
views
Walk a folder of files and extract creation date, countries mentioned and a summary based on their text for each
I made a script that scans a folder of 723 documents (pdf, pptx, docx) to extract the 'Name', 'Type', 'NbPages', 'Creation Date', 'Period Date', 'Countries', 'Summary', 'Indicateurs', 'Path'. I think ...
4
votes
2
answers
238
views
Unstructured to Structured TOC
The following code tries to convert an unstructured TOC with bounding box layout data given by the output of pdftotext -bbox-layout -f 11 -l 13 new_book.pdf toc.html...
7
votes
3
answers
937
views
Brute force search for pdf password in python
I forgot the password to my pdf. I remembered a few characters, so I made a simple password-cracker program in Python. Is the document lost now given the number of combinations? Out of curiosity, how ...
3
votes
1
answer
104
views
PDFcomp: A PDF comprimer easy script
I developed this script in dead time to be able to compress the PDFs that they send me to university and also to be able to send scans to the various institutions that require formats that do not ...
4
votes
2
answers
390
views
Attempting an OOP approach for PDF Paranoia - Automate the Boring Stuff CH 15
This exercise is called PDF Paranoia and it comes from CH 15 of the book Automate the Boring Stuff With Python - second edition. There are two parts to the exercise, which I've separated into two ...
4
votes
1
answer
2k
views
Adding a PDF file using iText7
Can I improve this code and make it more beautiful? For example, I am using a switch statement and for each page but I only add edits to page 1 and page 2?
...
3
votes
1
answer
413
views
Resume Builder using Jinja Templates and HTML
I have attempted to hack together a CV/Resume builder and I would like a review. The aim is to output a styled PDF containing dynamic fields. I have taken a simple HTML file, an arguably bloated CSS ...
0
votes
1
answer
209
views
How to optimize the code of my pdf generation from DGV code using iText5 & c#?
I have 2 DGV's one containing bills info & another containing credit notes info. When at least one item from both the DGV's are selected then it should generate a pdf (result1) taking data from ...
4
votes
1
answer
178
views
Shell script to download Project Euler problems and combine to PDF
This is a script I created that downloads Project Euler webpages and combines them to PDF. The script also downloads animated files.
...
3
votes
1
answer
259
views
Using fluent builder pattern for generating pdf
I'm testing to use a fluent builder pattern for generating a pdf file using AbcPDF.
I've looked at several methods of accomplishing this and I'd be very glad to get some input on what I've written so ...