Darren,
I already feel better knowing I'm not the only one thinking about this and that it is not a stupid question! :)
Paul H. Tarver Tarver Program Consultants, Inc. Email: paul@tpcqpc.com
-----Original Message----- From: Darren [mailto:foxdev@ozemail.com.au] Sent: Friday, April 28, 2017 10:41 AM To: profoxtech@leafe.com Subject: RE: Reading & Extracting Data From PDF Files
I too would be keen to know of a "Reliable" way to get text etc. out of a PDF file. I'd have to think that it stands to reason the thing appears formatted so have to wonder how hard it can be. To date my experience has been poor at best so I'm with you Paul. I have had some limited luck with a couple do the PDF to Text offerings out there. Another that I found produced pretty good results is the utility that comes with Beyond compare - can't think of the name of it ATM but it hides in one of the BC directories.
If there was a good reliable solution I'd be happy to part with a good sum for it as it would be of great use.
A reasonably priced tool to convert RTF would be handy as well.
-----Original Message----- From: ProfoxTech [mailto:profoxtech-bounces@leafe.com] On Behalf Of Paul H. Tarver Sent: Saturday, 29 April 2017 1:19 AM To: profoxtech@leafe.com Subject: Reading & Extracting Data From PDF Files
Original Thread: Getting count of rows in a text file -- best approach?
A couple of times I've heard people mention reading in PDF files using FileToStr and I want to know more about reading and extracting data from PDF files. I do a lot of data conversion and interface work with lots of file formats, but I've not been very successful at importing and extracting data from PDF reports. Obviously a scanned image saved as a PDF would have to be ocr'd first, but is there is a reliable way to extract data from PDF reports and if so, how? I'm sure I don't know all the ends and outs of the PDF format, but when I try, I seem to get a strange mix of formatting details and data combined in a random way.
Am I being thick here or is there really a way that I can get any PDF file from any client and then successfully extract the data elements from that format?
I'm prepared to be thought of as stupid but be gentle! :)
Paul H. Tarver Tarver Program Consultants, Inc. Email: paul@tpcqpc.com
-----Original Message----- From: Brant E. Layton [mailto:dcci@futureone.com] Sent: Wednesday, April 26, 2017 3:17 PM To: profoxtech@leafe.com Subject: RE: Getting count of rows in a text file -- best approach?
|My experience was moving PDF files in and out of SQLServer tables - |found an abrupt truncation at the 16,777,184 mark...
Brant Layton| |480.964.1316| On 4/26/2017 12:57 PM, profoxtech-request@leafe.com wrote:
RE: Getting count of rows in a text file -- best approach?
--- StripMime Report -- processed MIME parts --- multipart/alternative text/plain (text body -- kept) text/html ---
[excessive quoting removed by server]