OCR with Neural network: data extraction

Posted by Sebastian Hoitz on Stack Overflow See other posts from Stack Overflow or by Sebastian Hoitz
Published on 2010-03-20T12:28:56Z Indexed on 2010/03/20 12:31 UTC
Read the original article Hit count: 946

Filed under:

ocr

|

artificial-neural-network

|

.NET

I'm using the AForge library framework and its neural network.

At the moment when I train my network I create lots of images (one image per letter per font) at a big size (30 pt), cut out the actual letter, scale this down to a smaller size (10x10 px) and then save it to my harddisk. I can then go and read all those images, creating my double[] arrays with data. At the moment I do this on a pixel basis.

So once I have successfully trained my network I test the network and let it run on a sample image with the alphabet at different sizes (uppercase and lowercase).

But the result is not really promising. I trained the network so that RunEpoch had an error of about 1.5 (so almost no error), but there are still some letters left that do not get identified correctly in my test image.

Now my question is: Is this caused because I have a faulty learning method (pixelbased vs. the suggested use of receptors in this article: http://www.codeproject.com/KB/cs/neural_network_ocr.aspx - are there other methods I can use to extract the data for the network?) or can this happen because my segmentation-algorithm to extract the letters from the image to look at is bad?

Does anyone have ideas on how to improve it?

© Stack Overflow or respective owner

Related posts about ocr

free open-source linux screenshot & ocr tool

as seen on Super User - Search for 'Super User'
I'm looking for a tool which would be able to capture a screen region, pass it to OCR and put the result into clipboard. "import ppm:- | gocr -i - | xclip -selection c" works, but gocr is unreliable: simple text on a webpage has errors. It is a clear font but the OCR tool always misses "r" and replaces… >>> More
OCR, OCR-B Fonts in PHP?

as seen on Stack Overflow - Search for 'Stack Overflow'
Hello, I am looking for a good solution to parse OCR-B fonts off a PNG images fed from scanners. Any tips on a engine? In php >>> More
OCR with Neural network: data extraction

as seen on Stack Overflow - Search for 'Stack Overflow'
I'm using the AForge library framework and its neural network. At the moment when I train my network I create lots of images (one image per letter per font) at a big size (30 pt), cut out the actual letter, scale this down to a smaller size (10x10 px) and then save it to my harddisk. I can then go… >>> More
OCR: How to improve accuracy - existing libraries for removing non-text 'furniture', shapes, etc to

as seen on Stack Overflow - Search for 'Stack Overflow'
I want to remove rectangles etc that enclose text in a screenshot image, so that I can perform optical character recognition to get accurate text from the screenshot. Background: I doing this to extract data from a legacy application for use with other applications. This is the only way to get at… >>> More
OCR an RSA key fob (security token)

as seen on Stack Overflow - Search for 'Stack Overflow'
I put together a quick WinForm/embedded IE browser control which logs into our company's bank website each morning and scrapes/exports the desired deposit information (the bank is a smallish regional bank). Since we have a few dozen "pseudoaccounts" that draw from the same master account, this actually… >>> More

Related posts about artificial-neural-network

Support Vector Machine or Artificial Neural Network for text processing?

as seen on Stack Overflow - Search for 'Stack Overflow'
We need to decide between Support Vector Machines and Fast Artificial Neural Network for some text processing project. It includes Contextual Spelling Correction and then tagging the text to certain phrases and their synonyms. Which will be the right approach? Or is there an alternate to both of… >>> More
Artificial neural network

as seen on Stack Overflow - Search for 'Stack Overflow'
hai this is naveena My guide given a simple example to solve in artificial neural network and PSO If any body help then i m very happy the example is `A B C a1 b1 c1 a2 b2 c2 how i have to solve manually i cannot understand plz any help me and send a mail to this… >>> More
Need help with artificial neural network

as seen on Stack Overflow - Search for 'Stack Overflow'
I have an input data for neural network that consists of 2 vectors with 200 elements, that i got from some program for generating signals. So it is actually 2x200 input to my nnet. As target data, i have one 1x200 vector that i also got from the same program. That is my training data set. I gather… >>> More
What problems have you solved using artificial neural networks?

as seen on Stack Overflow - Search for 'Stack Overflow'
I'd like to know about specific problems you - the SO reader - have solved using artificial neural network techniques and what libraries/frameworks you used if you didn't roll your own. Questions: What problems have you used artificial neural networks to solve? What libraries/frameworks did you… >>> More
Neural Net Optimize w/ Genetic Algorithm

as seen on Stack Overflow - Search for 'Stack Overflow'
Is a genetic algorithm the most efficient way to optimize the number of hidden nodes and the amount of training done on an artificial neural network? I am coding neural networks using the NNToolbox in Matlab. I am open to any other suggestions of optimization techniques, but I'm most familiar… >>> More