Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval
MIT License
Hummingbird compiles trained ML models into tensor computation for faster inference.
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Data search & enrichment library for Machine Learning → Easily find and add relevant features to ...
Powerful topic model visualization in Python
An experiment about re-implementing supervised learning models based on shallow neural network ap...
The "Python Machine Learning (1st edition)" book code repository and info resource
General Assembly's 2015 Data Science course in Washington, DC
Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and ...
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
A comprehensive toolkit for building Retrieval-Augmented Generation (RAG) pipelines, including da...
Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CI...
An approach to document exploration using Machine Learning. Let's cluster similar research articl...
Some fundamental machine learning and data-analysis techniques are explained through realistic ex...
Text vectorization tool to outperform TFIDF for classification tasks