+ Reply to Thread
Results 1 to 11 of 11

Excel reading PDF file - it Fails

  1. #1
    Forum Contributor
    Join Date
    10-05-2021
    Location
    Bronx, NY USA
    MS-Off Ver
    2021/365
    Posts
    126

    Excel reading PDF file - it Fails

    It seems that Excel ain't doing that good a job of importing an adding machine tape PDF.

    And I'm worried about this 'cause accountants use adding machine tapes all the time.

    The bottom line was that it couldn't be imported into Excel (Get data, From file, From PDF).

    Why is this?

    It seems like a straightforward PDF.
    Attached Files Attached Files

  2. #2
    Administrator 6StringJazzer's Avatar
    Join Date
    01-27-2010
    Location
    Tysons Corner, VA, USA
    MS-Off Ver
    MS365 Family 64-bit
    Posts
    24,721

    Re: Excel reading PDF file - it Fails

    Your PDF file is just one image. I'm not sure what you expect to happen if you import that to Excel.
    Jeff
    | | |會 |會 |會 |會 | |:| | |會 |會
    Read the rules
    Use code tags to [code]enclose your code![/code]

  3. #3
    Forum Contributor
    Join Date
    10-05-2021
    Location
    Bronx, NY USA
    MS-Off Ver
    2021/365
    Posts
    126

    Re: Excel reading PDF file - it Fails

    I was expecting Excel to convert it to an Excel spreadsheet.

    Should I have OCR'd the PDF first?

    In all the videos that I've seen on Excel getting data from PDFs it never mentioned that the PDF should be OCR'd.

  4. #4
    Forum Contributor
    Join Date
    10-05-2021
    Location
    Bronx, NY USA
    MS-Off Ver
    2021/365
    Posts
    126

    Re: Excel reading PDF file - it Fails

    Even after OCR'ing the PDF, it still can't be imported into Excel
    Attached Files Attached Files

  5. #5
    Administrator 6StringJazzer's Avatar
    Join Date
    01-27-2010
    Location
    Tysons Corner, VA, USA
    MS-Off Ver
    MS365 Family 64-bit
    Posts
    24,721

    Re: Excel reading PDF file - it Fails

    Your second file is still just an image. It looks just like the first file.

    Excel doesn't automatically OCR images when you import a PDF. It only deals with data.

    Excel 365 claims to be able to import an image file (not a PDF file containing pictures) and convert it to data. I converted your image to a JPG and tried to import to Excel but got junk as a result.

    If you have the tools to OCR your image first, that is going to give you the most reliable results.

    Values as displayed
    A
    B
    C
    D
    E
    F
    1
    ?
    20
    2024
    5
    20
    2
    10 . ” I, . }?C*
    3
    ? .
    Jun-63
    4
    ? ,
    5
    73 , 366 . ? 2
    6
    0-00
    7
    . •???.
    ?
    8
    9 , '86 . 8 ?
    9
    1 8 - 6 6 5 - 3 ?
    10
    555. 59
    11
    ??
    12
    1 0 , 593 .
    13
    78
    14
    3 - 990 - 85
    15
    5 . 897 . 58
    16
    72
    17
    ? ?
    18
    72
    19
    5 ,987• 5 ?
    20
    99
    21
    ? . 350
    22
    1 86 . 436.? 9
    Page001

  6. #6
    Forum Guru TMS's Avatar
    Join Date
    07-15-2010
    Location
    The Great City of Manchester, NW England ;-)
    MS-Off Ver
    MSO 2007,2010,365
    Posts
    44,463

    Re: Excel reading PDF file - it Fails

    You can do it with Google Translate | Images. You need to get a jpg from the content of the PDF. One way to do that is open the PDF in Acrobat. Then use Shift-Windows Key-S to get a screen print of the page. Then in Google Translate, select Images and paste the screen print into the translate window. You might have to select the numbers again and drag them to the Text window. From there, you can copy and paste the text.

    See attached.
    Attached Files Attached Files
    Last edited by TMS; 03-20-2024 at 11:01 AM.
    Trevor Shuttleworth - Retired Excel/VBA Consultant

    I dream of a better world where chickens can cross the road without having their motives questioned

    'Being unapologetic means never having to say you're sorry' John Cooper Clarke


  7. #7
    Administrator 6StringJazzer's Avatar
    Join Date
    01-27-2010
    Location
    Tysons Corner, VA, USA
    MS-Off Ver
    MS365 Family 64-bit
    Posts
    24,721

    Re: Excel reading PDF file - it Fails

    I tried this with my own OCR tool. The resolution of the tape isn't great. I could get no meaningful results from it. It read the decimal points as midline dots or hyphens because they are printed midline on the tape instead of at the baseline, and read the commas as various other symbols. I think you would need a really good OCR tool to extract data out of this.

  8. #8
    Administrator 6StringJazzer's Avatar
    Join Date
    01-27-2010
    Location
    Tysons Corner, VA, USA
    MS-Off Ver
    MS365 Family 64-bit
    Posts
    24,721

    Re: Excel reading PDF file - it Fails

    Quote Originally Posted by TMS View Post
    See attached.
    Missing attachment

  9. #9
    Forum Guru TMS's Avatar
    Join Date
    07-15-2010
    Location
    The Great City of Manchester, NW England ;-)
    MS-Off Ver
    MSO 2007,2010,365
    Posts
    44,463

    Re: Excel reading PDF file - it Fails

    Oops. There now. Missed the fact that it complained the size was too large.

    It's a bit of a messy process. Also uses Google Lens. Think I need to practice before I can fully document the steps.

  10. #10
    Forum Contributor
    Join Date
    10-05-2021
    Location
    Bronx, NY USA
    MS-Off Ver
    2021/365
    Posts
    126

    Re: Excel reading PDF file - it Fails

    Thanks for the responses, but I think going to chalk this one up to it just can't be done.

    I simply thought that when Excel could read or import PDF files, something as simple as an adding machine tape that was scan to PDF would be one of the ideal PDF documents scan.

    But alas, it just wasn't meant to be

  11. #11
    Administrator 6StringJazzer's Avatar
    Join Date
    01-27-2010
    Location
    Tysons Corner, VA, USA
    MS-Off Ver
    MS365 Family 64-bit
    Posts
    24,721

    Re: Excel reading PDF file - it Fails

    The fact that it's in a PDF file is beside the point. It's an image, and Excel's ability to "read" an image is limited.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Reading a txt file into excel
    By dshilan in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 03-25-2013, 12:43 PM
  2. Reading Data from one excel file to another?
    By Fergs in forum Excel General
    Replies: 3
    Last Post: 04-29-2009, 01:49 PM
  3. reading excel file from zip
    By walid66 in forum Excel Programming / VBA / Macros
    Replies: 5
    Last Post: 06-21-2008, 10:40 AM
  4. Excel File Fails/Closes When I Save
    By buddhajb in forum Excel General
    Replies: 1
    Last Post: 03-07-2008, 06:26 AM
  5. [SOLVED] reading txt file and copying the lines in new excel file
    By [email protected] in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 08-11-2006, 02:25 PM
  6. reading from text file to excel file
    By dgoel in forum Excel Programming / VBA / Macros
    Replies: 0
    Last Post: 04-18-2005, 03:06 PM
  7. [SOLVED] Reading CSV file from Excel (VBA)
    By Alex in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 04-06-2005, 11:07 PM
  8. Excel fails to lock file across VPN?
    By LongJudson in forum Excel General
    Replies: 0
    Last Post: 02-07-2005, 01:06 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1