Solved

how to extract text from captcha image

  • 14 December 2022
  • 6 replies
  • 4014 views

Badge

In a website if we find a text captcha. Then how to get the text from it and then submit...?

icon

Best answer by Zaibi 14 December 2022, 22:49

View original

6 replies

Userlevel 5
Badge +9

Hi @srinurockz96 ,

The intention of captcha is to avoid bots. Probably it’s necessary  use another kind of tool, for example, script python, AI vision, or even a service to resolve captchas (https://2captcha.com/, and so)

Team, let us know any feedback.

Regards.

Userlevel 3
Badge +6

Dear @srinurockz96,

 

If you are using Automation Anywhere to automate a website that has a text-based captcha, you can use the OCR (optical character recognition) feature of Automation Anywhere to extract the text from the captcha image. Once the text has been extracted, you can use a "Type into" action to enter the text into the appropriate field on the website, and then use a "Click" action to submit the form.

Here is an example of how you might accomplish this:

  1. Use the "Capture Text" action to capture the text from the captcha image.

  2. Use the "Extract Data" action to extract the text from the captcha image and store it in a variable.

  3. Use the "Type into" action to enter the text from the variable into the appropriate field on the website.

  4. Use the "Click" action to submit the form.

It's important to note that text-based captchas can be difficult for OCR systems to accurately recognize, so you may need to experiment with different settings and techniques to get the best results. Additionally, some websites may use different types of captchas, such as images with distorted text or mathematical equations, which will require different approaches to solve.

 

Regards,

Userlevel 7
Badge +13

Hi @srinurockz96 ,

 

I've seen some ways of solving CAPTCHAs with Python + ML and training it against images etc and for certain ones it can work very well (usually ones with obfuscated text)

 

Another alternative I've seen is an API which uses human interaction to solve them for you

 

https://2captcha.com/

 

Never used it and it comes at a cost, but an interesting idea!. 

 

Apart from all, 

Captchas can be automated. but it is not at all recommended. Because Captchas are the way to avoid automation. Here you can go for attended automation. let the user enter the captcha.

 

There are legal issues as well when we try to automate captchas. Be aware of it. 

 

Attended automation will be a good option for you.

Userlevel 5
Badge +9

Hi @srinurockz96 ,

 

I've seen some ways of solving CAPTCHAs with Python + ML and training it against images etc and for certain ones it can work very well (usually ones with obfuscated text)

 

Another alternative I've seen is an API which uses human interaction to solve them for you

 

https://2captcha.com/

 

Never used it and it comes at a cost, but an interesting idea!. 

 

Apart from all, 

Captchas can be automated. but it is not at all recommended. Because Captchas are the way to avoid automation. Here you can go for attended automation. let the user enter the captcha.

 

There are legal issues as well when we try to automate captchas. be aware of it. 

 

Attended automation will be a good option for you.

 

Right!  Should be a good reason to try solving captchas. 

Userlevel 1
Badge +2

I don’t think automating Captchas are a good practice. As what @Padmakumar has mentioned, captchas are deployed to avoid automations.

 

Also, OCR is not a good option to automate captchas as it lacks accuracy especially because captchas can be in different format or style.

Badge +2

Dear @srinurockz96,

 

If you are using Automation Anywhere to automate a website that has a text-based captcha, you can use the OCR (optical character recognition) feature of Automation Anywhere to extract the text from the captcha image. Once the text has been extracted, you can use a "Type into" action to enter the text into the appropriate field on the website, and then use a "Click" action to submit the form.

Here is an example of how you might accomplish this:

  1. Use the "Capture Text" action to capture the text from the captcha image.

  2. Use the "Extract Data" action to extract the text from the captcha image and store it in a variable.

  3. Use the "Type into" action to enter the text from the variable into the appropriate field on the website.

  4. Use the "Click" action to submit the form.

It's important to note that text-based captchas can be difficult for OCR systems to accurately recognize, so you may need to experiment with different settings and techniques to get the best results. Additionally, some websites may use different types of captchas, such as images with distorted text or mathematical equations, which will require different approaches to solve.

 

Regards,

Hi Zaibi, 

"Capture Text" is “Capture text of window”? I can’t find "Capture Text" in action list of RPA 

Reply