I want to build a desktop program so that it's easier for the user to run and do the auto submission. I choose the Dot Net framework as the platform to work on.
As you can see from the above image, there is a captcha image generated every time this page is accessed. So naturally the solution is to go to OCR library. From my experience, Tesseract is one of the best solution out there. However, it will be difficult to use it directly as it is developed in C / C++ language.
So is there any convenient wrapper that provide a more direct usage of the library? The answer is yes, there are several of them. But I am not going to go through all of them, rather, just the one that I tested working, it's :
The github project mentioned these :
1. Add the Tesseract NuGet Package by running Install-Package Tesseract from the Package Manager Console.
2. Ensure you have Visual Studio 2012 x86 & x64 runtimes installed (see note above).
3. Download language data files for tesseract 3.02 from tesseract-ocr and add them to your project, ensure 'Copy to output directory' is set to Always.
4. Check out the Samples solution ~/Samples/Tesseract.Samples.sln for a working example