Looks interesting, I will check it out ... thanks Gianni
-----Original Message----- From: ProfoxTech profoxtech-bounces@leafe.com On Behalf Of Gianni Turri Sent: Saturday, January 13, 2024 12:07 PM To: profoxtech@leafe.com Subject: Re: PDF Scraping
Another option is the Balabolka Text Extract Utility, I have used it with success in the past.
https://www.cross-plus-a.com/btext.htm
This is the command line version, so you can run it from VFP.
Example usage:
blb2txt -f "My file.pdf" -out "My file.txt"
The program has many options, for example you can process many files at once.
Gianni
On Fri, 12 Jan 2024 12:46:50 +0000, Chris Davis chrisd@actongate.co.uk wrote:
Forgot Ghostscript could do that, thank you Alan ... works a treat ?
-----Original Message----- From: ProfoxTech profoxtech-bounces@leafe.com On Behalf Of Alan Bourke Sent: Friday, January 12, 2024 11:27 AM To: profoxtech@leafe.com Subject: Re: PDF Scraping
Chris
This is not easy in general and probably not possible without going outside of VFP. You're probably looking at leveraging Ghostcript somehow to parse the PDF files and dump the text out.
-- Alan Bourke alanpbourke (at) fastmail (dot) fm
[excessive quoting removed by server]