Skip to main content

Hi Team,

I have a Thai language pdf where i need to extract the data from PDF. But when i try to extract the data from that PDF i am getting boxes instead of Thai characters. I have tried multiple way to read the data from PDF like PDF extract text activity and converted pdf to image and extracted data from that PDF. In all the way the extracted data is showing as boxes.

Please suggest possible ways to extract/identify thai characters using automation anywhere.

Thanks in advance

 

I tried by using this file:

https://www.thaivivat.co.th/garage/pdf/48_file_3959.pdf

When I used PDF: Extract Text, I got this output:

You may be running into an issue with the specific PDF. Is it something you can share to my work email? aaron.gleason@automationanywhere.com


Reply