A module for creating stopword lists for any language, based on a set of documents.
MIT License
Bot releases are visible (Hide)
Published by eklem over 2 years ago
Published by eklem over 4 years ago
Security fix
Published by eklem almost 5 years ago
Published by eklem over 5 years ago
Published by eklem over 5 years ago
commander and lodash updated because of security vulnerabilities.
Published by eklem over 5 years ago
Published by eklem about 6 years ago
Updating commander to newest version where some vulnerability bugs were fixed.
Published by eklem about 6 years ago
Will now work with more than just English characters. Tested with Norwegian. Also now adds most used numbers.
Published by eklem over 6 years ago
Misc. Greenkeeper dependencies updates and now testing on version 6,8 and 9 of Node.js
Published by eklem about 7 years ago
Now got the ability to define which fields to calculate stopwords from. This to not add noise comming from fields where words in general have a lower stopwordiness. Client and tests updated too.
Published by eklem about 7 years ago
index.js is now the library part that can be used from any script. Created a small commander-based console client that will take some input.
Published by eklem about 7 years ago
Both stopwords.json
and stopwords-calculation.json
are now written. The first is a subset of the last, only containing the words in an array. Cut-of needs to be done manually, since it contains all found words, not just the top [n] words.
Published by eklem about 7 years ago
Rudimentary calculation in place. Still need to have a stopword-generator to fit the module stopword
.