Skip to main content
Question

Extract from only the first page

  • May 9, 2026
  • 1 reply
  • 29 views

I am using AI to extract text from a handwritten form, I have build the workflow: 

  1. AI call to extract data
  2. Generate a new pdf with the data extracted from AI
  3. Merge the original (handwritten) PDF with the newly generated pdf
  4. pass the merged document into a learning instance to allow for field / document rules to validate the AI data

My problem is that the learning instance keeps wanting to get data from the 2nd page. Is there a way force either the learning instance or extraction task to only process the first page of a file? 

1 reply

Aaron.Gleason
Automation Anywhere Team
Forum|alt.badge.img+6
  • Automation Anywhere Team
  • May 9, 2026

@cgillies Here is what I would do:

  1. Build a Task Bot that uses PDF: Split document to separate each page of the PDF into separate PDFs. 
  2. Get only the first file (page 1) and process that file.
  3. Remove all the split files.

You could also export page 1 as a different file type, such as a PNG or TIFF, but then you’re losing the direct-extraction capabilities of a document that was printed to file. You would have to use OCR which isn’t a perfect technology, even in 2026.