+ Reply to Thread
Results 1 to 3 of 3

Extract text from PDF

  1. #1
    Registered User
    Join Date
    10-27-2022
    Location
    Northbridge, MA
    MS-Off Ver
    365
    Posts
    10

    Extract text from PDF

    My ERP system at work produces reports in the form of PDFs. I used an online file conversion to reformat the PDF as a CSV. However, there is a key piece of data I need from the PDF that is not in table format so when I convert from PDF to CSV this is not included. Please see attached to further explain.

    The sections blocked out in yellow in the PDF snippet become the values in excel replacing the X's.

    The only thing I am missing is the number next to the red arrow (I only need the number, don't need "Summary of"). Is there a way to have excel reference a PDF file and extract this number? There are hundreds of pages in my PDF with each page representing a different part number.

    I'd like to populate the cell highlighted in yellow in my excel sheet with the referenced number in the PDF.

    The spreadsheet only shows data from page 1 of the PDF, I'd like to repeat this for every page, without having to go through it manually.

    EDIT: Added a PDF file as well as the PNG
    Attached Images Attached Images
    Attached Files Attached Files
    Last edited by ShcreeminEagle; 12-02-2022 at 06:32 PM.

  2. #2
    Forum Expert
    Join Date
    11-24-2013
    Location
    Paris, France
    MS-Off Ver
    Excel 2003 / 2010
    Posts
    9,831

    Arrow Re: Extract text from PDF


    I've never seen any ERP able to export a pdf file but can't create an easier text file, so weird !

  3. #3
    Forum Moderator alansidman's Avatar
    Join Date
    02-02-2010
    Location
    Steamboat Springs, CO
    MS-Off Ver
    MS Office 365 Version 2405 Win 11 Home 64 Bit
    Posts
    23,892

    Re: Extract text from PDF

    Look at the attached which I imported using Power Query

    Please Login or Register  to view this content.
    Excel 2016 (Windows) 64 bit
    A
    B
    C
    D
    E
    F
    G
    H
    I
    J
    K
    L
    M
    N
    3
    Column1 Column2 Column3 Column4 Column5 Column6 Column7 Column8 Column9 Column10 Column11 Column12 Column13 Column14
    4
    FROM: 11/30/2021 TO: 11/30/2022 Summary Report
    5
    Summary of 01110-3150
    6
    Mold Size PX Density 1.4 Target Wt 1836 Parts Produced 2,268 Target Cycle 140 SRT 77.36
    7
    Units/Shot 2 Final Density 1.5 Avg Wt 1725.08 Parts Packed 2,124 Avg Cycle 150.51 ART 45.3
    8
    Material EPS - Modified Matl Used 9,027.75 Wt Var -6.04% Pack Rate 93.65% Cycle Var 6.98% Run Var -41.44%
    9
    Bead EPS Bead Cost $1.28
    10
    Material Used $11,470.14 Price/Part 13.49 Total Value $0.00 Income/SRT $-306.91
    11
    Packaging Matl $1,112.17 Contribution $-23,742.52 Income/ART $-524.12
    12
    Machine Overhead $9,006.23 Gross Operating Margin
    13
    Packaging Labor $2,153.98
    14
    Freight Cost $0.00
    15
    Commission $0.00
    Sheet: Sheet1

    Power Query is a free AddIn for Excel 2010 and 2013, and is built-in functionality from Excel 2016 onwards (where it is referred to as "Get & Transform Data").

    It is a powerful yet simple way of getting, changing and using data from a broad variety of sources, creating steps which may be easily repeated and refreshed. I strongly recommend learning how to use Power Query - it's among the most powerful functionalities of Excel.

    - Follow this link to learn how to install Power Query in Excel 2010 / 2013.

    - Follow this link for an introduction to Power Query functionality.

    - Follow this link for a video which demonstrates how to use Power Query code provided.
    Attached Files Attached Files
    Alan עַם יִשְׂרָאֵל חַי


    Change an Ugly Report with Power Query
    Database Normalization
    Complete Guide to Power Query
    Man's Mind Stretched to New Dimensions Never Returns to Its Original Form

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Replies: 10
    Last Post: 08-01-2022, 08:05 AM
  2. [SOLVED] extract text form text string( extract 5 charactors in front of all left parenthese)
    By happyexcel2021 in forum Excel Formulas & Functions
    Replies: 4
    Last Post: 07-30-2021, 06:05 PM
  3. Replies: 9
    Last Post: 08-20-2020, 05:30 AM
  4. VBA to open saved web html pages - extract text - paste text within individual cell
    By EddieRubi in forum Excel Programming / VBA / Macros
    Replies: 17
    Last Post: 07-23-2015, 01:43 PM
  5. Replies: 4
    Last Post: 08-13-2014, 11:03 PM
  6. [SOLVED] Extract text from a given point in a text string, when data points do not share the given
    By reedersketer in forum Excel Formulas & Functions
    Replies: 3
    Last Post: 07-09-2014, 03:57 PM
  7. [SOLVED] Extract text from a string of text (amend formula to include new criteria)
    By robertguy in forum Excel Formulas & Functions
    Replies: 2
    Last Post: 09-10-2013, 04:53 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1