+ Reply to Thread
Results 1 to 5 of 5

Search and Extract from PDF

  1. #1
    Registered User
    Join Date
    09-09-2009
    Location
    Chatham, ON
    MS-Off Ver
    Excel 2003
    Posts
    22

    Search and Extract from PDF

    Hello
    I have a PDF file that i want to get data from.
    I need to search through the pdf until i match what i am looking for and then collect the data that follows, is there a way to do this?
    Somehow save the entrie pdf to a string or something?

    Thanks in advance

  2. #2
    Forum Expert royUK's Avatar
    Join Date
    11-18-2003
    Location
    Derbyshire,UK
    MS-Off Ver
    Xp; 2007; 2010
    Posts
    26,200

    Re: Search and Extract from PDF

    I doubt very much if you can do this. A pdf is a format that basically prevents altering, copying etc. Yo can get software to convert to Excel, but I'm not sure how successful it would be
    Hope that helps.

    RoyUK
    --------
    For Excel Tips & Solutions, free examples and tutorials why not check out my web site

    Free DataBaseForm example

  3. #3
    Registered User
    Join Date
    09-09-2009
    Location
    Chatham, ON
    MS-Off Ver
    Excel 2003
    Posts
    22

    Re: Search and Extract from PDF

    ok thanks, i wasn't sure if there was a way to read it into a string or not, thanks

  4. #4
    Registered User
    Join Date
    08-19-2009
    Location
    Chicago, Illinois
    MS-Off Ver
    Excel 2007
    Posts
    10

    Re: Search and Extract from PDF

    The only possible way might be to OCR it to a text file (that's Optical Character Recognition, I think). Microsoft Office has a tool called Document Imaging which can do some OCR, and since it's an MS program, it can be controlled through VBA similarly to Excel or Access. I think you'd have to convert the PDF to a picture format like TIF before the document imaging program will read it though too.

  5. #5
    Registered User
    Join Date
    06-30-2009
    Location
    Southern California, Earth
    MS-Off Ver
    Excel 2010
    Posts
    72

    Re: Search and Extract from PDF

    Chinnick,

    This is very much possible, albeit with a good amount of work ahead of you. I currently use a freeware program called pdftotext to accomplish said task.

    Here is a snippet of code from how I use it, but essentially you want to install the program in C:\PDF and go from there. Note there are many options with this program, and I use default conversion with a delimited import into Excel. You may or may not need to change this depending on what you are looking for.

    Please Login or Register  to view this content.
    You can download the program here: http://www.foolabs.com/xpdf/download.html
    Last edited by hoffey; 09-22-2009 at 04:57 PM.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1