Skip to content
How to extract line...
 
Notifications
Clear all

How to extract line items from a PDF?

7 Posts
4 Users
0 Reactions
6 Views
fergotz
(@fergotz)
Posts: 3
Active Member
Topic starter
 

I'm encountering difficulties in establishing a dependable scenario to extract each line item from a Purchase Order Document. My current approach involves using @PDF.co with the Parse Doc module and linking it to a custom template. However, this scenario lacks reliability because the PDF format isn't consistent, and the instructions for the parse PDF template need to be precise.

Here's a sample of the first, middle, and last pages of a PDF document. This PDF can contain multiple pages between the first and last, and the number of line items can alter the formatting of the first and last pages.

First page

Middle page

Last page

How can I instruct the PDF.co module to capture all line items as a single table?

Perhaps the PDF.co module isn't the most suitable tool for this task, but I'm unaware of any other options that can achieve this.

 
Posted : 27/09/2024 1:41 am
XenoMax
(@xenomax)
Posts: 17
Active Member
 

Have you experimented with the AI Invoice Parser? It could potentially resolve the problem.

image

 
Posted : 27/09/2024 1:44 am
fergotz
(@fergotz)
Posts: 3
Active Member
Topic starter
 

I'm unable to find that module as an option.

 
Posted : 27/09/2024 1:46 am
samliew
(@samliew)
Posts: 293
Reputable Member
 

I personally use DumplingAI’s “Extract data from PDF with AI” module. You can also use “Convert PDF to Text” or “Extract Data from Image(s)”.

Hope this helps! Let me know if there are any further questions or issues.

P.S.: Investing some effort into the callin.io Academy will save you lots of time and frustration using callin.io.

 
Posted : 27/09/2024 1:48 am
XenoMax
(@xenomax)
Posts: 17
Active Member
 

That's odd, it ought to be visible within the PDF section of the modules, appearing as the sixth option from the top.

 
Posted : 27/09/2024 1:51 am
fergotz
(@fergotz)
Posts: 3
Active Member
Topic starter
 

Thanks! DumplingAI worked perfectly. Although, I think it will get very expensive after 4 or 5 PDFs because even for this first test using the free 100 credits of my account, it only extracted 57 complete line items, and this one sample has 533 :skull:. Any idea of another tool that could be cheaper?

Again… DumplingAI worked perfectly! It did extract the line items very accurately. Thanks for suggesting that tool.

 
Posted : 27/09/2024 2:04 am
PDF.co
(@pdf-co)
Posts: 1
New Member
 

Hello,

You can locate the AI Invoice Parser functionality within the 'Parse a Document' section. Please refer to the sample screenshot provided below for assistance.

Should you still have trouble locating it, you can also utilize the callin.io 'Make an API Call' feature to directly interact with the AI Invoice Parser API endpoint. A guiding screenshot is included below.

If you have any further questions or require additional support, please feel free to reach out.

 
Posted : 02/10/2024 7:58 pm
Share: