Question

Getting an error in document automation

Forum|Forum|1 year ago
July 2, 2025
4 replies
106 views

Gireesh B P 3262
Cadet | Tier 2

I am trying to extract few form field from document automation with unstructured document and i am facing below issue. Let me know if any of you faced the same issue?

EXTRACT_FAILED - [Native] : 500 : DOCUMENT_FAILED : HTTPSConnectionPool(host='openaipublic.blob.core.windows.net', port=443): Max retries exceeded with url: /encodings/cl100k_base.tiktoken (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection

+3

Padmakumar
Premier Pathfinder | Tier 7
Forum|Forum|1 year ago
July 3, 2025

Hi @Gireesh B P 3262 ,

Kindly refer the below article and see it helps or not.

DA | EXTRACT_FAILED - [Native] : 500 : DOCUMENT_FAILED : HTTPSConnectionPool(host='openaipublic.blob.core.windows.net', port=443):

Padmakumar

Like

+4

Shreya.Kumar
Forum|Forum|1 year ago
July 7, 2025

@Gireesh B P 3262 , have you been able to try the suggestion by @Padmakumar ?

Like

G

Gireesh B P 3262
Author
Cadet | Tier 2
Forum|Forum|1 year ago
July 9, 2025

Hi Shreya...the issue with proxy and i am working with the internal team. Once i got complete solution i will post here @Shreya.Kumar

Like

P

poonamindure
Navigator | Tier 3
Forum|Forum|1 year ago
July 14, 2025

I am trying to extract few form field from document automation with unstructured document and i am facing below issue. Let me know if any of you faced the same issue?

EXTRACT_FAILED - [Native] : 500 : DOCUMENT_FAILED : HTTPSConnectionPool(host='openaipublic.blob.core.windows.net', port=443): Max retries exceeded with url: /encodings/cl100k_base.tiktoken (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection

Please try below some points:

1.Ensure scanned PDFs are at least 300 DPI.

2.Avoid blurry, marked-up, or dot-matrix printed documents.

3.Supported formats include PDF, JPG, PNG, TIFF.
4.Check if the OCR engine (like Tesseract or Google Vision) is properly installed and configured.

5.Go to Control Room > Bots > Learning Instances to verify OCR settings
6.Make sure the Bot Agent has admin permissions and access to the document folder

7.Try recreating the learning instance and re-uploading the document.

Like

Sign up

Login to the Pathfinder Community