A Wikipedia text crawler to create stopword lists for any language in the world.
MIT License
Statistics for this project are still being loaded, please check back later.
A module for creating stopword lists for any language, based on a set of documents.
Tokenizing strings of text. Regex extracting arrays of words and optionally numbers, emojis, tags...
Sami stopword lists for natural language processing. Examples on use could be search engines, mac...
An open-source Spelling Bee game for the web.
A module for node.js and the browser that takes in text and strips it of stopwords
Simple document and query processor that makes search running in the browser and node.js a little...
modest natural-language processing
Crawler for NRK Sapmi news bulletins that will be the basis for Sami stopword lists and an exampl...
Fast and extendible Node.js/Javascript fulltext search engine.
Determines the most relevant keywords from an article headline combined with some article text. F...
A collection of tools, APIs and other resources to use in creative coding web projects.
A CJK text tokenizer