docconv

Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text

MIT License

Stars
1.6K
Committers
22

Bot releases are visible (Hide)

docconv - v2.0.0-pre.3 Latest Release

Published by jonathaningram 11 months ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v2.0.0-pre.2...v2.0.0-pre.3

docconv - v2.0.0-pre.2

Published by jonathaningram 11 months ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v2.0.0-pre.1...v2.0.0-pre.2

docconv - v2.0.0-pre.1

Published by jonathaningram 11 months ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.8...v2.0.0-pre.1

docconv - v1.3.8

Published by jonathaningram 12 months ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.7...v1.3.8

docconv - v1.3.7

Published by jonathaningram about 1 year ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.6...v1.3.7

docconv - v1.3.6

Published by jonathaningram about 1 year ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.5...v1.3.6

docconv - v1.3.5

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.4...v1.3.5

docconv - v1.3.4

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.3...v1.3.4

docconv - v1.3.3

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.2...v1.3.3

docconv - v1.3.2

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.1...v1.3.2

docconv - v1.3.1

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.3.0...v1.3.1

docconv - v1.3.0

Published by jonathaningram about 2 years ago

What's Changed

Full Changelog: https://github.com/sajari/docconv/compare/v1.2.1...v1.3.0

docconv - v1.2.1

Published by jkaho over 2 years ago

Vulnerabilities fixed:

  • deprecated protobuf dependency (#107)
  • msoleps stack overflow bug (dependency updated) (#108)
  • remote code execution vulnerability in the PDF OCR converter (#110)
  • unbounded memory consumption when reading files from ZIP archive (#111)

Other:

  • improvements to client error messages (#113)
docconv - v1.2.0

Published by jsok about 3 years ago

Bug fixes and improvements:

  • docd: refactor Dockerfile and publish to DockerHub (#101)
  • Updated dependency for poppler and removed bash arg check (#100)
  • doc: improve metadata parsing so that titles can be reliably extracted (#99)
  • add test for TestConvertHTML (#93)
  • actions: stop building for Go 1.13
  • rtf: don't ignore lines less than 5 characters long (#91)
  • pptx_test: check returned error before deferring f.Close()
  • add note about ignored error check
  • docd: remove unused convertPath function
  • remove path separator from ioutil.TempFile prefix
  • add Sourcegraph badge to README
  • support PowerPoint files in Convert functions
  • ocr: update gosseract to v2
  • pptx: add support for MS PowerPoint files (#71)
  • avoid double copy on tidy
  • Update tidy pkg to use NewLocalFile func
  • support windows temp directories
  • add go mod support
  • pdf: add extra time layout for pdfs
  • try to avoid loading file data into slices
  • Fixes
  • Get docx contents reading [Content_Types].xml to get correct file names
docconv - Bug fixes and improvements

Published by mish15 over 5 years ago