Skip to content

danbopes/tesseract

 
 

Repository files navigation

A .NET wrapper for tesseract-ocr.

Warning - Prerelease software (alpha)

This is currently prerelease software as such the public API is subject to change.

Dependencies

Visual Studio 2008 SP1 x86 Runtime

Since tesseract and leptonica binaries are compiled with Visual Studio 2008 SP1 you'll need to ensure you have the Visual Studio 2008 SP1 Runtime installed. This can be found here.

Getting started quickly

Note: Compiling the project requires at least MS Visual Studio 11 Express for Desktop or SharpDevelop 4.4.

  1. Fork this project (see: https://help.github.com/articles/fork-a-repo)
  2. Ensure you have Visual Studio 2008 SP1 x86 runtime installed (see note above).
  3. Download language data files for tesseract 3.02 from http://code.google.com/p/tesseract-ocr/
  4. Build BaseApiTester project
  5. Copy language files into BaseApiTester\bin\[config]\tessdata
  6. Run BaseApiTester for example (work in progress)

To-do

Please help yourselves to one of the following tasks (or create a new one). Please leave a comment in the corresponding task to avoid duplication of work.

Task Num Task Status
| Tesseract - Core Apis 					| Active
| Tesseract - Regression test infrastructure			| Pending
| Leptonica - Scanned image preparation functions		| Pending
| Tesseract - API Cleanup and improvements			| Pending
| Tesseract - Create sample app					| Pending
| Tesseract - Dictionary lookup [not sure about this]		| Pending

License

Copyright 2012 Charles Weld.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License. You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

About

A .Net wrapper for tesseract-ocr

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published