tesseract

Tesseract Open Source OCR Engine (main repository)

APACHE-2.0 License

Stars
61.3K
Committers
212

Bot releases are hidden (Show)

tesseract - 5.0.0-beta-20210815

Published by stweil about 3 years ago

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Modernize more code
  • More options for binarization
  • Improved support for ARM NEON
  • No longer depends on Abseil for unit tests
  • Support float for model training and text recognition (faster, requires less RAM)

See also list of all changes.

tesseract - 5.0.0-alpha-20210401

Published by stweil over 3 years ago

This is a new pre-release of Tesseract 5.0.0.

  • Replaced all remaining STRING by std::string
  • Replaced lots of GenericVector by std::vector
  • Replaced all malloc / free by C++ code
  • Modernized and formatted code

See also list of all changes.

tesseract - 5.0.0-alpha-20201231

Published by stweil almost 4 years ago

This is a new pre-release of Tesseract 5.0.0.

It has massive changes in the public API which is a great step towards a final 5.0.0.
All unit tests pass, but because of those changes more practical experience is needed.

  • the public API no longer uses proprietary data types GenericVector, STRING
  • pdf.ttf is no longer needed because it is now integrated into the code

See also list of all changes.

tesseract - 5.0.0-alpha-20201224

Published by stweil almost 4 years ago

This is a new pre-release of Tesseract 5.0.0.

It is considered to be production ready for end users, but nevertheless not stable because more incompatible API changes are planned.

  • improved performance (also on ARM / ARM64)
  • improved unit tests
  • many fixes
  • faster flat build with automake
  • support for latest macOS (including new M1 processor)

See also list of all changes.

tesseract - 4.1.1 Release

Published by zdenop almost 5 years ago

  • Implemented sw build (cppan is deprecated)
  • Improved cmake build
  • Code cleanup and optimization
  • A lot of bug fixes...
tesseract - 4.1.0 Release

Published by zdenop over 5 years ago

  • Added new renderers Alto, LSTMBox, WordStrBox.
  • Added character boxes in hOCR output.
  • Added python training scripts (experimental) as alternative shell scripts.
  • Better support AVX / AVX2 / SSE.
  • Disable OpenMP support by default (see e.g. #1171, #1081).
  • Fix for bounding box problem.
  • Implemented support for whitelist/blacklist in LSTM engine.
  • Improved cmake configuration.
  • Code modernization and improvements.
  • A lot of bug fixes...

Detailed changelog is on wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

tesseract - 4.0.0 Release

Published by zdenop almost 6 years ago

Detailed Release notes, Changelog and documentation can be found in project wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

tesseract - 3.05.02 Release

Published by zdenop over 6 years ago

Bug fix release

tesseract - 3.05.01 Release

Published by zdenop over 7 years ago

Bug fix release

tesseract - 3.05.00 Release

Published by zdenop over 7 years ago

  • Made some fine tuning to the hOCR output.
    • Added TSV as another optional output format.
    • Fixed ABI break introduced in 3.04.00 with the AnalyseLayout() method.
    • text2image tool - Enable all OpenType ligatures available in a font. This feature requires Pango 1.38 or newer.
    • Training tools - Replaced asserts with tprintf() and exit(1).
    • Fixed Cygwin compatibility.
    • Improved multipage tiff processing.
    • Improved the embedded pdf font (pdf.ttf).
    • Enable selection of OCR engine mode from command line.
    • Changed tesseract command line parameter '-psm' to '--psm'.
    • Added new C API for orientation and script detection, removed the old one.
    • Increased minimum autoconf version to 2.59.
    • Removed dead code.
    • Fixed many compiler warning.
    • Fixed memory and resource leaks.
    • Fixed some issues with the 'Cube' OCR engine.
    • Fixed some openCL issues.
    • Added option to build Tesseract with CMake build system.
    • Implemented CPPAN support for easy Windows building.
tesseract - 3.04.01 release

Published by zdenop over 8 years ago

bug-fix release of 3.04 version

tesseract - 3.04.00 release

Published by zdenop about 9 years ago

  • Added OpenCL support (experimental)
  • Many bug fixes
Package Rankings
Top 0.19% on Alpine-v3.17
Top 3.73% on Alpine-edge
Top 0.07% on Alpine-v3.7
Top 0.06% on Alpine-v3.18
Top 0.09% on Alpine-v3.14
Top 6.42% on Alpine-v3.15
Top 4.9% on Alpine-v3.13
Top 5.39% on Alpine-v3.12
Top 23.32% on Anaconda.org
Top 6.86% on Alpine-v3.16
Top 5.81% on Alpine-v3.9
Top 0.18% on Alpine-v3.8
Top 7.41% on Alpine-v3.10
Top 0.1% on Alpine-v3.11
Badges
Extracted from project README
Build status Build status Coverity Scan Build Status CodeQL OSS-Fuzz GitHub license Downloads