tesseract

Tesseract Open Source OCR Engine (main repository)

APACHE-2.0 License

Stars
61.3K
Committers
212

Bot releases are hidden (Show)

tesseract - 5.4.1 Latest Release

Published by stweil 4 months ago

What's Changed

This release fixes a regression with legacy or mixed models (issue #4257).

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.4.0...5.4.1

tesseract - 5.4.0

Published by stweil 4 months ago

What's Changed

This releases provides an improved PDF renderer, adds a new PAGE XML renderer, extends the API to retrieve the text angle/gradient and has lots of smaller updates for code and documentation:

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.4...5.4.0

tesseract - 5.4.0-rc1

Published by stweil 5 months ago

What's Changed

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.4.0-rc1...5.4.0-rc2

tesseract - 5.4.0-rc1

Published by stweil 5 months ago

What's Changed

This releases provides an improved PDF renderer, adds a new PAGE XML renderer, extends the API to retrieve the text angle/gradient and has lots of smaller updates for code and documentation:

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.4...5.4.0-rc1

tesseract - 5.3.4

Published by stweil 9 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.3...5.3.4

tesseract - 5.3.3

Published by stweil about 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.2...5.3.3

tesseract - 5.3.2

Published by stweil over 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.1...5.3.2

tesseract - 5.3.1

Published by stweil over 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.0...5.3.1

tesseract - 5.3.0

Published by stweil almost 2 years ago

This is a new minor version of Tesseract 5.

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.2.0...5.3.0

tesseract - 5.3.0-rc1

Published by stweil almost 2 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.2.0...5.3.0-rc1

tesseract - 5.2.0

Published by stweil over 2 years ago

This is a new minor version of Tesseract 5.

  • Improvements and fixes for continuous integration,
    autoconf and cmake builds.
  • Set /Os for some 32 bit MS compilers (fixes #3769).
  • Improve comments and other documentation.
  • Add initial support for Intel AVX512F.
  • Fix for very large PDF files on 32 bit hosts (fixes #3805).
  • Fix NEON detection on FreeBSD.
  • Fix regression with UZN files (fixes #3837).
  • Fix calling delete[] for memory allocated by malloc in C API.
  • Add an API function to init tesseract with traineddata from memory
    (fixes #3691).
  • Replace direct access to Leptonica internal data structures by
    function calls and support latest releases of Leptonica.
  • Replace std::regex by std::string functions (fixes issue #3830).
  • Use compiled-in TESSDATA_PREFIX also on Windows (fixes #3767).
  • Add new parameter 'invert_threshold', change the default threshold
    from 0.5 to 0.7 and mark parameter 'tessedit_do_invert' as deprecated.

See also list of all changes.

tesseract - 5.1.0

Published by stweil over 2 years ago

This is a new minor version of Tesseract 5.

  • Handle image and line regions in output formats ALTO, hOCR and text.
  • New parameter curl_timeout for curl_easy_setop.
  • Build fixes and improvements.
  • Catch nullptr in PageIterator::Orientation to improve robustness.
  • Remove unused code.

See also list of all changes.

tesseract - 5.0.1

Published by stweil almost 3 years ago

This is a bug fix release of Tesseract 5.0.

  • Add SPDX-License-Identifier to public include files.
  • Support redirections when running OCR on a URL.
  • Lots of fixes and improvements for cmake builds.
    Distributions should use the autoconf build.
  • Fix broken msys2 build with gcc 11.
  • Fix parameter certainty_scale (was duplicated).
  • Fix some compiler warnings and clean code.
  • Correctly detect amd64 and i386 on FreeBSD.
  • Add libarchive and libcurl in continuous integration actions.
  • Update submodule googletest to release v1.11.0.

See also list of all changes.

tesseract - 5.0.0

Published by stweil almost 3 years ago

This is the final stable release of Tesseract 5.0.0.

  • Limit BCER to interval [0,1]
  • Improved build process
  • Cleaned code

See also list of all changes.

tesseract - 5.0.0-rc3

Published by stweil almost 3 years ago

This is the third release candidate of Tesseract 5.0.0.

  • Improve training messages
  • Add RowAttributes getter to PageIterator

See also list of all changes.

tesseract - 4.1.3

Published by stweil almost 3 years ago

This is a new stable release of Tesseract 4.1.

  • Fix broken autoconf build (issue #3642)

See also list of all changes.

tesseract - 4.1.2

Published by stweil almost 3 years ago

This is a new stable release of Tesseract 4.1.

Note: The autoconf build is broken (see issue #3642), so please use 4.1.3.

  • Allow line images with larger width for training
  • Bug fixes
  • Build updates and fixes

See also list of all changes.

tesseract - 5.0.0-rc2

Published by stweil almost 3 years ago

This is the second release candidate of Tesseract 5.0.0.

  • Fix regression for OCR with more than one model file
  • Bug fixes
  • Optimizations

See also list of all changes.

tesseract - 5.0.0-rc1

Published by stweil almost 3 years ago

This is the first release candidate of Tesseract 5.0.0.

  • Enable fast float32 LSTM by default
  • Switch to NFC normalisation everywhere
  • Remove banner message
  • Disable music staff detection and removal
  • Add new command line option --loglevel
  • Bug fixes

See also list of all changes.

tesseract - 5.0.0-beta-20210916

Published by stweil about 3 years ago

This is a new pre-release of Tesseract 5.0.0.

  • Bug fixes
  • Extend URI support for Tesseract with libcurl
  • Rename processed TIFF output file and add page number if needed

See also list of all changes.

Package Rankings
Top 0.19% on Alpine-v3.17
Top 3.73% on Alpine-edge
Top 0.07% on Alpine-v3.7
Top 0.06% on Alpine-v3.18
Top 0.09% on Alpine-v3.14
Top 6.42% on Alpine-v3.15
Top 4.9% on Alpine-v3.13
Top 5.39% on Alpine-v3.12
Top 23.32% on Anaconda.org
Top 6.86% on Alpine-v3.16
Top 5.81% on Alpine-v3.9
Top 0.18% on Alpine-v3.8
Top 7.41% on Alpine-v3.10
Top 0.1% on Alpine-v3.11
Badges
Extracted from project README
Build status Build status Coverity Scan Build Status CodeQL OSS-Fuzz GitHub license Downloads