I need help with a Power Query code that can loop through multiple PDFs in a folder and extract the tables in red font as shown in the sample PDF into Excel format as shown. Each PDF has a date on the top. I want to grab the date shown and fill it in a column as shown in the sample result. Different PDFs have different dates but the table formats are pretty similar.
In the PDFS, I only care about the tables in the red font (I used red here to differentiate the table that I need from other tables). I may have multiple tables before it as shown in the sample PDF, but I want to discard those tables. My main table of interest always appears as the last table in the document and always starts with ?Type? and ends with ?Comment?
The main issue I have is that the table in the red font have breaks in them but I still want to grab all the data in red and the headers do not span across the page. There is no specific pattern on how these breaks occur.
Thank you for your anticipated help.
Bookmarks