Sign up for our Excel webinar, times added weekly:
This video teaches you how to import tables from a PDF file into Excel with Power Query. Table data that spans multiple pages in the PDF can be appended together into one cohesive Excel Table.
If you’d like to view the accompanying blog post on my website, you can access it here:
Upload your own example PDF file for a future video tutorial:
From PDF is a new feature in Power Query that is currently available of the Beta Channel for Microsoft/Office 365 subscribers. It will be rolling out to other channels in the coming months.
Importing PDF files into Excel has always been a challenge. This new feature of Power Query detects structured data tables within pages, making it easy to cleanup and prepare the data in the Power Query Editor.
We can also append (combine or stack) multiple tables from multiple pages in the PDF file. Power Query allows us to automate this entire process, making it fast and easy to import multiple PDF files into Excel.
Related Videos:
How to Combine Excel Tables or Worksheets with Power Query:
Power Query Overview – Automate Data Tasks in Excel & Power BI:
How to Install Power Query in Excel 2010 or 2013 for Windows:
#MsExcel #ExcelCampus
00:00 Introduction
01:11 Import from PDF
02:48 Append Data…(read more)
Convert Word 2 PDF: Word to PDF Converter
Convert PowerPoint 2 PDF: PPT to PDF Converter
Convert Excel 2 PDF: Excel to PDF Converter
Convert an Image 2 PDF: Image to PDF Converter
Convert HTML 2 PDF: HTML to PDF Converter
More Tools: PDF Converter
I'm trying to convert bank statements from pdf to Excel then use power query to organize all the bank deposits and label the sources of income, like if it was a cash deposit. Has anybody had success with this ?
Thank you
Excellent
What if I want to select all the tables?
hello Jon, how to import a folder with mutiple FDF files which each PDF file with at least two tables into Excel?
Thanks
Hi Jon, great video. How do we do the same thing for multiple pdfs. Im not able to replicate this when extracting from a file with many pdfs and each pdf has different number of pages with same table columns. Do share if this is possible
How can I import a monthly pdf file with a dynamic table? For example, in month1 the table is on page 100, and in month2 is on page 500. BTW this pdf file has over 1,000 tables. Many thanks, love your video.
You are awesome – Thank you!!!!
Hi
I'm having issues with bulk converting into excel using pdf from folder. I'm bringing in my bank statements for analysis each month. I have jan-aug ready in my folder to bulk install initially and then want to update sept nov dec etc on month by month basis from there on.
Unfortunately some of the pdf files have differing numbers of column headers, depending on the file month. They range from 3 columns to 8 columns? extra columns occur randomly between promoted named headers. There should only be 6 promoted columns I want. I cant just remove the other columns because they contain my data.
Its proving almost impossible to do a single query to automate the process. Any tips on this?
I’m curious to know if we can count how many tables there are in a pdf file? Currently working on automation project which needs a power query. Source file is a pdf. Please share if this is possible.
How to select all the pages of the pdf in powerquery?
Thank you very much ! This is, so far that i saw showing upto Append. My excel don't give composite table Option for all table together. This help a lot.
Actually I have been a PDF file and I need to import into excel but not working properly with me could you suport me
well demonstrated! Thank you!
I have a complex example that would be very useful. I have 4 years ov invoices from vendor in PDF's. I want to merge them into single XLS, then have a combined data set to see queries on the data. Like item searchable charges, taxes, etc.
My excel not show FROM PDF, how can I get it dear
I have hundreds of PDF files from BIM360 and I'm looking to search all the files in a folder for a specific text Serial Number and bring all the number into one chart with the file name and document number… not easy as although the PDF's are from one source and share similarities the form differ slightly depending on test assets
Hi Jon, Is it possible to fetch only limited fields from a PDF file like Vendor name, amount and address?
Thank you for the video, but I really need help for this problem: I have multiple pdf files in a folder and each file has multiple pages containing tables I need, but every month , depending on the lenght of the data in the tables, the tables might be on a different page (pdf page number) than last month. Every table in each pdf is always formatted the same every month but each table in the pdf is different from the other. I need to create a separate query for each table (example if I have a total of 20 pdf tables, I need 20 queries to elaborate and import in 20 separate data models). How can I achieve that, making sure that if next month the pdf has more pages, it incluedes every table I need. Please help me.
Hi,
Below option is not showing in my excel :
1.Select Data > Get Data > From File > From PDF.
Can anyone help me out here, how to get this option?
Excel-lent. ugh horrible. A question. How would you approach specific fields: dates: and : serial number: across multiple .pdf Microsoft Outlook attachments?
In Acrobat DC Pro would you first combine the reports as a portfolio and then query? Standardized reports. same data fields. Obviously I've tried your approach but unsuccessfully.