Skip to main content

Hi,

 

I’m trying to extract system generated PO’s using Document Automation. I have encountered one pdf where one table field is span into 2 pages.

Item: Required field and Primary column

Material: Required field

Description: Required field

Page 1 in DA:

Example: 

Item     Material     Description 

1           AHTD2      Item is shipped

                               from Hyderabad

                               Item no: 21342

2           AJUG5      Item is shipped

                              from Mumbai

                              Item No: 57560

3          AKIU4        Item is shipped

Page 2 In DA:

Item      Material     Description

                               from Delhi

                               Item No: 24633

3rd row is continued in 2nd page. Thats where it is going to validation since other fields are empty and required fields.

Validation Page:

Item     Material     Description 

1           AHTD2      Item is shipped

                               from Hyderabad

                               Item no: 21342

2           AJUG5      Item is shipped

                              from Mumbai

                              Item No: 57560

3          AKIU4        Item is shipped

Error    Error           from Delhi

Error    Error           Item No: 24633

 

Is there any way to extract table fields, only if primary column is having a value.

 

Thank you in advance.

 

Hemantha Pindra.

That’s an interesting question, HemanthaPindra !

I think the following community members may be able to help 

@ChanduMohammad 

@Zaid Chougle 

@Tamil Arasu10 

@Paul Hawkins 

@Padmakumar 


Hi ​@HemanthaPindra ,

 

Have you checked the below documentation?

 

Improve table data extraction


Hi ​@Padmakumar,

 

We had faced this issue earlier, and at that time, we did try the steps mentioned in the documentation - such as choosing the primary column and setting the end-of-table indicator but it didn’t help.

 

Thank you.


Hi ​@Padmakumar,

 

We had faced this issue earlier, and at that time, we did try the steps mentioned in the documentation - such as choosing the primary column and setting the end-of-table indicator but it didn’t help.

 

Thank you.

 

If your CR is v.31 or higher, you can utilize the GenAI's vision powered data extraction for this. Using prompt tag to extract a particular column data is possible through this which works better in case of table fields spanning across multiple pages.

 

 


Hello ​@Padmakumar ​@HemanthaPindra 

 

I am also facing same issue. I am using CR v.36. I used GenAI prompt tags still this problem not solved. What to do?


Hello ​@Padmakumar ​@HemanthaPindra 

 

I am also facing same issue. I am using CR v.36. I used GenAI prompt tags still this problem not solved. What to do?

 

I would recommend raising a support ticket for this.


@HemanthaPindra Try using our new vision-based document extraction technology. I had good luck extracting a table that spanned a page.

https://docs.automationanywhere.com/bundle/enterprise-v2019/page/data-extraction-vision-models.html


Reply