files_fulltextsearch_tesseract

OCR your documents before index

AGPL-3.0 License

Stars

30

View Code on GitHub View on X

Ecosystems: Nextcloud

files_fulltextsearch_tesseract

OCR your documents before index

Installation / Setup

install Tesseract
download language files from: https://github.com/tesseract-ocr/tessdata
copy language files into /usr/share/tessdata/ (or /usr/share/tesseract-ocr/tessdata/, depends on our distribution)
configure this app in the Full text search Admin panel
report bugs

more

devblog about PDF and OCR: https://daita.github.io/files-fulltextsearch-tesseract-ocr-pdf/

Related Projects

files_fulltextsearch

🔍 Index the content of your files

assistant

✨ Nextcloud Assistant

documentserver_community

Document server for onlyoffice

30 Aug 2019 126

files_pdfviewer

A PDF viewer for Nextcloud

fulltextsearch

🔍 Core of the full-text search framework for Nextcloud

26 Aug 2016 213

collectives

Collectives is a Nextcloud App for activist and community projects to organize together.

translate

A Machine translation provider using Opus models by University of Helsinki running locally on CPU

fulltextsearch_elasticsearch

🔍 Use Elasticsearch to index the content of your Nextcloud

integration_openproject

Integration of OpenProject project manager in Nextcloud

integration_deepl

news

RSS/Atom feed reader

02 Jun 2016 860

mail

💌 Mail app for Nextcloud

28 Aug 2016 830

recognize

👁 👂 Smart media tagging for Nextcloud: recognizes faces, objects, landscapes, music genres

22 Apr 2021 543

bookmarks_fulltextsearch

🔍 Indexing bookmarks

text2image_stablediffusion