Skip to main content
Solved

how to extract text from captcha image

  • December 14, 2022
  • 6 replies
  • 5452 views

Forum|alt.badge.img

In a website if we find a text captcha. Then how to get the text from it and then submit...?

Best answer by Zaibi

Dear @srinurockz96,

 

If you are using Automation Anywhere to automate a website that has a text-based captcha, you can use the OCR (optical character recognition) feature of Automation Anywhere to extract the text from the captcha image. Once the text has been extracted, you can use a "Type into" action to enter the text into the appropriate field on the website, and then use a "Click" action to submit the form.

Here is an example of how you might accomplish this:

  1. Use the "Capture Text" action to capture the text from the captcha image.

  2. Use the "Extract Data" action to extract the text from the captcha image and store it in a variable.

  3. Use the "Type into" action to enter the text from the variable into the appropriate field on the website.

  4. Use the "Click" action to submit the form.

It's important to note that text-based captchas can be difficult for OCR systems to accurately recognize, so you may need to experiment with different settings and techniques to get the best results. Additionally, some websites may use different types of captchas, such as images with distorted text or mathematical equations, which will require different approaches to solve.

 

Regards,

View original
Did this topic help answer your question?

6 replies

Raul Jaimes
Forum|alt.badge.img+9
  • Navigator | Tier 3
  • 494 replies
  • December 14, 2022

Hi @srinurockz96 ,

The intention of captcha is to avoid bots. Probably it’s necessary  use another kind of tool, for example, script python, AI vision, or even a service to resolve captchas (https://2captcha.com/, and so)

Team, let us know any feedback.

Regards.


Zaibi
Forum|alt.badge.img+6
  • Navigator | Tier 3
  • 128 replies
  • Answer
  • December 14, 2022

Dear @srinurockz96,

 

If you are using Automation Anywhere to automate a website that has a text-based captcha, you can use the OCR (optical character recognition) feature of Automation Anywhere to extract the text from the captcha image. Once the text has been extracted, you can use a "Type into" action to enter the text into the appropriate field on the website, and then use a "Click" action to submit the form.

Here is an example of how you might accomplish this:

  1. Use the "Capture Text" action to capture the text from the captcha image.

  2. Use the "Extract Data" action to extract the text from the captcha image and store it in a variable.

  3. Use the "Type into" action to enter the text from the variable into the appropriate field on the website.

  4. Use the "Click" action to submit the form.

It's important to note that text-based captchas can be difficult for OCR systems to accurately recognize, so you may need to experiment with different settings and techniques to get the best results. Additionally, some websites may use different types of captchas, such as images with distorted text or mathematical equations, which will require different approaches to solve.

 

Regards,


Padmakumar
Forum|alt.badge.img+13
  • Navigator | Tier 3
  • 785 replies
  • December 15, 2022

Hi @srinurockz96 ,

 

I've seen some ways of solving CAPTCHAs with Python + ML and training it against images etc and for certain ones it can work very well (usually ones with obfuscated text)

 

Another alternative I've seen is an API which uses human interaction to solve them for you

 

https://2captcha.com/

 

Never used it and it comes at a cost, but an interesting idea!. 

 

Apart from all, 

Captchas can be automated. but it is not at all recommended. Because Captchas are the way to avoid automation. Here you can go for attended automation. let the user enter the captcha.

 

There are legal issues as well when we try to automate captchas. Be aware of it. 

 

Attended automation will be a good option for you.


Raul Jaimes
Forum|alt.badge.img+9
  • Navigator | Tier 3
  • 494 replies
  • December 15, 2022
Padmakumar wrote:

Hi @srinurockz96 ,

 

I've seen some ways of solving CAPTCHAs with Python + ML and training it against images etc and for certain ones it can work very well (usually ones with obfuscated text)

 

Another alternative I've seen is an API which uses human interaction to solve them for you

 

https://2captcha.com/

 

Never used it and it comes at a cost, but an interesting idea!. 

 

Apart from all, 

Captchas can be automated. but it is not at all recommended. Because Captchas are the way to avoid automation. Here you can go for attended automation. let the user enter the captcha.

 

There are legal issues as well when we try to automate captchas. be aware of it. 

 

Attended automation will be a good option for you.

 

Right!  Should be a good reason to try solving captchas. 


Forum|alt.badge.img+2

I don’t think automating Captchas are a good practice. As what @Padmakumar has mentioned, captchas are deployed to avoid automations.

 

Also, OCR is not a good option to automate captchas as it lacks accuracy especially because captchas can be in different format or style.


Forum|alt.badge.img+2
  • Cadet | Tier 2
  • 5 replies
  • June 26, 2023
Zaibi wrote:

Dear @srinurockz96,

 

If you are using Automation Anywhere to automate a website that has a text-based captcha, you can use the OCR (optical character recognition) feature of Automation Anywhere to extract the text from the captcha image. Once the text has been extracted, you can use a "Type into" action to enter the text into the appropriate field on the website, and then use a "Click" action to submit the form.

Here is an example of how you might accomplish this:

  1. Use the "Capture Text" action to capture the text from the captcha image.

  2. Use the "Extract Data" action to extract the text from the captcha image and store it in a variable.

  3. Use the "Type into" action to enter the text from the variable into the appropriate field on the website.

  4. Use the "Click" action to submit the form.

It's important to note that text-based captchas can be difficult for OCR systems to accurately recognize, so you may need to experiment with different settings and techniques to get the best results. Additionally, some websites may use different types of captchas, such as images with distorted text or mathematical equations, which will require different approaches to solve.

 

Regards,

Hi Zaibi, 

"Capture Text" is “Capture text of window”? I can’t find "Capture Text" in action list of RPA 


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings