pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

MIT License

Downloads
62.3K
Stars
2.1K
Committers
19

Bot releases are hidden (Show)

pytextrank - V3.3.0 on PyPi Latest Release

Published by ceteri 8 months ago

  • updated build (for PEP 621, etc.)
  • using nx.DiGraph instead
  • new release procedures

What's Changed

Full Changelog: https://github.com/DerwenAI/pytextrank/compare/v3.2.5...v3.3.0

pytextrank - v3.2.5 release on PyPi

Published by ceteri about 1 year ago

2023-08-07

  • improved scrubber; kudos @0dB @Ankush-Chander
  • updated dependencies, responding to vuln scanner

What's Changed

New Contributors

Full Changelog: https://github.com/DerwenAI/pytextrank/compare/v3.2.4...v3.2.5

pytextrank - v3.2.4 release on PyPi

Published by ceteri about 2 years ago

2022-07-27

  • better support for "ru" and other languages without noun_chunks support in spaCy
  • updated example notebook to illustrate TopicRank algorithm
  • made the node bias setting case-independent for Biased Textrank algorithm; kudos @Ankush-Chander
  • updated summarization tests; kudos @tomaarsen
  • reworked some unit tests to be less brittle, less dependent on specific spaCy point releases

What's Changed

Full Changelog: https://github.com/DerwenAI/pytextrank/compare/v3.2.3...v3.2.4

pytextrank - v3.2.3 release on PyPi

Published by ceteri over 2 years ago

2022-03-06

  • handles missing noun_chunks in some language models (e.g., "ru") #204
  • add TopicRank algorithm; kudos @tomaarsen
  • improved test suite; fixed tests for newer spacy releases; kudos @tomaarsen

What's Changed

New Contributors

Full Changelog: https://github.com/DerwenAI/pytextrank/compare/v3.2.2...v3.2.3

pytextrank - v3.2.2 release on PyPi

Published by ceteri about 3 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/DerwenAI/pytextrank/compare/v3.2.1...v3.2.2

pytextrank - v3.2.1 release on PyPi

Published by ceteri about 3 years ago

2021-07-24

  • add "paragraph" option into summary() function; kudos @CaptXiong
pytextrank - v3.2.0 release on PyPi

Published by ceteri over 3 years ago

2021-07-17

Various support for spaCy 3.1.x updates, which changes some interfaces.

  • NB: THE SCRUBBER UPDATE WILL BREAK PREVIOUS RELEASES
  • allow Span as scrubber argument, to align with spaCy 3.1.x; kudos @Ankush-Chander
  • add lgtm code reviews (slow, not integrating into GitHub PRs directly)
  • evaluating grayskull to generate a conda-forge recipe
  • add use of pipdeptree to analyze dependencies
  • use KG from biblio.ttl to generate bibliography
  • fixed overlooked comment from earlier code; kudos @debraj135
  • add visualisation using altair; kudos @louisguitton
  • add scrubber usage in sample notebook; kudos @Ankush-Chander
  • integrating use of MkRefs to generate semantic reference pages in docs
pytextrank - v3.1.1 release on PyPi

Published by ceteri over 3 years ago

2021-03-25

  • fix the span length calculation in explanation notebook; kudos @Ankush-Chander
  • add BiasedTextRank by @Ankush-Chander (many thanks!)
  • add conda environment.yml plus instructions
  • use bandit to check for security issues
  • use codespell to check for spelling errors
  • add pre-commit checks in general
  • update doc._.phrases in the call to change_focus() so the summarization will sync with the latest focus
pytextrank - v3.1.0 release on PyPi

Published by ceteri over 3 years ago

2021-03-12

  • rename master branch to main
  • add a factory class that assigns each doc its own Textrank object; kudos @Ankush-Chander
  • refactor the stopwords feature as a constructor argument
  • add get_unit_vector() method to expose the characteristic unit vector
  • add calc_sent_dist() method to expose the sentence distance measures (for summarization)
  • include a unit test for summarization
  • updated contributor instructions
  • pylint coverage for code checking
  • linking definitions and citations in source code apidocs to our online docs
  • updated links on PyPi
pytextrank - v3.0.1 release on PyPi

Published by ceteri over 3 years ago

2021-02-27

pytextrank - v3.0.0 release on PyPi

Published by ceteri over 3 years ago

2021-02-14

  • THIS WILL BREAK THINGS!!!
  • support for spaCy 3.0.x; kudos @Lord-V15
  • full integration of PositionRank
  • migrated all unit tests to pytest
  • removed use of logger for debugging, introducing icecream instead
pytextrank - v2.1.0 on PyPi

Published by ceteri over 3 years ago

2021-01-31

  • add PositionRank by @louisguitton (many thanks!)
  • fixes chunk in explain_summ.ipynb by @anna-droid-beep
  • add option preserve_order in TextRank.summary by @kavorite
  • tested with spaCy 2.3.5
pytextrank - v2.0.3 on PyPi

Published by ceteri about 4 years ago

2020-09-15

  • try-catch ZeroDivisionError in summary method -- kudos @shyamcody
  • tested with updated dependencies: spaCy 2.3.x and NetworkX 2.5
pytextrank - v2.0.2 release on PyPi

Published by ceteri over 4 years ago

2020-05-20

  • fixed default value of ._.phrases to allow for disabling PTR in a pipeline
pytextrank - v2.0.1 release on PyPi

Published by ceteri over 4 years ago

2020-03-02

  • fix KeyError issue for pre Python 3.6
  • integrated codecov.io
  • added PyTextRank to the spaCy uniVerse
  • fixed README.md instructions to download en_core_web_sm
pytextrank - v2.0.0 release on PyPi

Published by ceteri almost 5 years ago

  • refactored library to run as a spaCy extension
  • supports multiple languages
  • significantly faster, with less memory required
  • better extraction of top-ranked phrases
  • changed license to MIT
  • uses lemma-based stopwords for more precise control
  • WIP toward integration with knowledge graph use cases
pytextrank - v1.2.1 release on PyPi

Published by ceteri almost 5 years ago

  • fixed error in installation instructions
pytextrank - v1.2.0

Published by ceteri almost 5 years ago

  • updated for current versions of spaCy and networkX -- kudos @dimmu
  • removed deprecated argument -- kudos @laxatives
pytextrank - v1.1.1 update

Published by ceteri about 7 years ago

Patch disables use of NER in spaCy until an intermittent bug is resolved.
Will probably replace named tuples with spaCy spans instead.

pytextrank - v1.1.0 release on PyPi

Published by ceteri over 7 years ago

Replaced TextBlob usage with spaCy for improved parsing results. Updated the other Python dependencies.
Also added better handling for UTF-8.

Package Rankings
Top 2.13% on Pypi.org
Badges
Extracted from project README
DOI Checked with mypy security: bandit Star History Chart
Related Projects