wink-nlp

Developer friendly Natural Language Processing ✨

MIT License

Downloads
123K
Stars
1.2K
Committers
7

Bot releases are hidden (Show)

wink-nlp - Enabled more special space characters handling Latest Release

Published by sanjayaksaxena 5 months ago

Version 2.3.0 May 19, 2024

✨ Features

  • Detokenization now restores em/en, third/quarter, thin/hair, medium math space characters & narrow non breaking space characters besides the regular nbsp. 👏 🙌 🛰️
wink-nlp - Improved error handling in contextual vectors

Published by sanjayaksaxena 5 months ago

Version 2.2.2 May 08, 2024

✨ Features

  • .contextualVectors() now throws error if (a) word vectors are not loaded and (b) with lemma: true, "pos" is missing in the NLP pipe. 🤓

🐛 Fixes

  • Refined typescript definitions further. ✅
wink-nlp - Added missing typescript definitions

Published by sanjayaksaxena 6 months ago

Version 2.2.1 May 06, 2024

🐛 Fixes

  • Added missing typescript definitions for word embeddings besides few other typescript fixes. ✅
wink-nlp - Added non-breaking space handling capabilities

Published by sanjayaksaxena 7 months ago

Version 2.2.0 April 03, 2024

✨ Features

  • Detokenization restores both regular and non-breaking spaces to their original positions. 🤓
wink-nlp - Introducing cosine similarity for word vectors

Published by sanjayaksaxena 7 months ago

Version 2.1.0 March 24, 2024

✨ Features

  • You can now use similarity.vector.cosine( vectorA, vectorB ) to compute similarity between two vectors on a scale of 0 to 1. 🤓
wink-nlp - Word embeddings have arrived!

Published by sanjayaksaxena 7 months ago

Version 2.0.0 March 24, 2024

✨ Features

  • Seamless word embedding integration enhances winkNLP's semantic capabilities. 🎉 👏 🙌
  • Pre-trained 100-dimensional word embeddings for over 350,000 English words released: wink-embeddings-sg-100d. 💯
  • API remains unchanged — no code updates needed for existing projects. The new APIs include: 🤩
    • Obtain vector for a token: Use the .vectorOf( token ) API.
    • Compute sentence/document embeddings: Employ the as.vector helper: use .out( its.lemma, as.vector ) on tokens of a sentence or document. You can also use its.value or its.normal. Tokens can be pre-processed to remove stop words etc using the .filter() API. Note, the as.vector helper uses averaging technique.
    • Generate contextual vectors: Leverage the .contextualVectors() method on a document. Useful for pure browser-side applications! Generate custom vectors contextually relevant to your corpus and use them in place of larger pre-trained wink embeddings.
  • Comprehensive documentation along with interesting examples is coming up shortly. Stay tuned for updates! 😎
wink-nlp - Added Deno example

Published by sanjayaksaxena about 1 year ago

Version 1.14.3 July 21, 2023

✨ Features

  • Added a live example for how to run winkNLP on Deno. 👍
wink-nlp - Fixed a bug

Published by sanjayaksaxena over 1 year ago

Version 1.14.2 July 1, 2023

🐛 Fixes

wink-nlp - Squashed a bug

Published by sanjayaksaxena over 1 year ago

Version 1.14.1 June 11, 2023

🐛 Fixes

wink-nlp - Introducing helper for extracting important sentences from a document

Published by sanjayaksaxena over 1 year ago

Version 1.14.0 May 20, 2023

✨ Features

  • You can now use its.sentenceWiseImprotance helper to obtain sentence wise importance (on a scale of 0 to 1) of a document, if it is supported by language model. 📚📊🤓
  • Checkout live example How to visualize key sentences in a document? 👀
wink-nlp - Operational update

Published by sanjayaksaxena over 1 year ago

Version 1.13.1 March 27, 2023

⚙️ Updates

  • Some behind the scene model improvements. 😎 🤓
  • Add clarity on typescript configuration in README. ✅
wink-nlp - Improving mark's functionality in custom entities

Published by sanjayaksaxena almost 2 years ago

Version 1.13.0 December 09, 2022

✨ Features

  • Mark allows marking w.r.t. the last element of the pattern. For example if a pattern matches a fluffy cat then mark: [-2, -1] will extract fluffy cat — especially useful when the match length is unknown. 💃
  • Improved error handling while processing mark's arguments. 🙌
wink-nlp - Operational update

Published by sanjayaksaxena almost 2 years ago

Version 1.12.3 November 18, 2022

⚙️ Updates

  • README is now more informative and links to examples and benchmarks 👍
  • Benchmarked on latest machine, browser versions 🖥
wink-nlp - Ready for Node.js version 18

Published by sanjayaksaxena about 2 years ago

Version 1.12.2 October 13, 2022

🐛 Fixes

  • Fixed incorrect install command in README ✅

⚙️ Updates

  • We have tested winkNLP on Node.js version 18 including its models. 🙌 🎉
wink-nlp - Ready for Node.js version 18

Published by sanjayaksaxena about 2 years ago

Version 1.12.1 October 13, 2022

⚙️ Updates

  • Ready for future — we have tested winkNLP on Node.js version 18 including its models. 🙌 🎉
wink-nlp - Some enhancements plus earned OpenSSF best practices passing badge

Published by sanjayaksaxena over 2 years ago

Version 1.12.0 May 13, 2022

✨ Features

wink-nlp - Enhancing custom entities & BM25Vectorizer

Published by sanjayaksaxena over 2 years ago

Version 1.11.0 January 30, 2022

✨ Features

  • Obtain bag-of-words for a tokenized text from BM25Vectorizer using .bowOf() api — useful for bow based similarity computation. 👍
  • learnCustomEntities() displays a console warning, if a complex short hand pattern is likely to cause learning/execution slow down.🤞❗️
wink-nlp - Enabling loading of BM25Vectorizer model

Published by sanjayaksaxena almost 3 years ago

Version 1.10.0 November 18, 2021

✨ Features

  • Easily load BM25Vectorizer's model using newly introduced .loadModel() api. 🎉
wink-nlp - Enhancing Typescript support

Published by sanjayaksaxena almost 3 years ago

Version 1.9.0 November 06, 2021

✨ Features

  • We have enhanced typescript support to allow easy addition of new typescript enabled language models. 👏

⚙️ Updates

  • Added naive wikification showcase in README. 😎
wink-nlp - Operational update

Published by sanjayaksaxena about 3 years ago

Version 1.8.1 September 22, 2021

⚙️ Updates

  • Included NLP Pipe details in the README file. 🤓