Rate this paper
  • Currently rating
  • 1
  • 2
  • 3
  • 4
  • 5
5.00 / 3
Paper Topic:

Optical Character Recognition software

The Optical Character Reader has traditionally been well-known in the area of scanning of handwritten documents (preprinted such as utility bills filled in with meter readings by human readers ) and process the numbers or text from the scanning process into computer readable formats through software . The OCR is one of the best methods to use when there is a need for the capture of neat handwritten documents . SAT tests electronic bill calculation and MCQ quizzes are part of the applications of the OCR . This will however , research the OCR comparing it with

other available methods /devices for data capture and evaluate the usefulness of the OCR against them

THE RESEARCH

GOCR : Historically , GOCR software has not been one of the toppers in this field . With high error rates in character recognition (98 for version 0 .4 , it is just worth giving a test try at most . Although the subsequent versions had those bugs fixed , the efficiency of GOCR has always been lower than the other OCR software . GOCR works in two modes reading off black text off white backgrounds as well as reading off white text off black backgrounds . The latter was however , more difficult to program for the developers and still has high margin of errors . Its ability to recognize handwritten characters with a lot of deviation is poor . Although much work has been done in the later versions to improve this , optical character recognition accuracy is still one of the biggest issues for GOCR . The GOCR has the highest number of characters recognized incorrectly . Therefore , I 's are recognized wrongly as l 's and v 's are recognized as u 's . GOCR is useful in situations where the handwriting is exceptionally neat or the document error rate is not a matter of concern (which of course will be a rare case . Also it should be noted that GOCR is open-source software . This means that GOCR code is readily available free of cost . Therefore different versions floating around are actually revisions by different programmers on the basis of their knowledge . Thus , GOCR offers a few features that are unique : the ability to work with a different variety of formats of images (which is also found in others , but with one or two omissions

Tesseract OCR : One of the reviews of this software went like this : It sounds like it Tesseract OCR I unusable at the current moment , but the developments made by Google in the subsequent versions leave a promising note for the future

In short , Tesseract is one of those open source optical character reading software that is not considered to be one of the most efficient software suites . In fact , one of the drawbacks of Tesseract is the command line interface with the user . This seems most absurd for software that deals with pictures and graphics . However , the software is configured to accept picture or graphics from hardware and then automatically read it and transform it into text . This OCR software has yet to come to...

Not the Essay You're looking for? Get a custom essay (only for $12.99)