A Scala feature transformation library for data science and machine learning
APACHE-2.0 License
Scala toolchain for InfluxDB
A recommender system for discovering GitHub repos, built with Apache Spark
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java appli...
Simple and Distributed Machine Learning
Some Data Science examples using Groovy
Basic framework utilities to quickly start writing production ready Apache Spark applications
PySpark + Scikit-learn = Sparkit-learn
Apache Flink
A Pyspark companion for data science tasks.
A simple Spark-powered ETL framework that just works 🍺
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to ha...
MLeap: Deploy ML Pipelines to Production
scala、spark使用过程中,各种测试用例以及相关资料整理
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars cod...