Distributed scikit-learn meta-estimators in PySpark
APACHE-2.0 License
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Easy to use library to bring Tensorflow on Apache Spark
Apache Spark Machine Learning project using MLlib and Linear Regression on Databricks!
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython /...
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteo...
PySpark + Scikit-learn = Sparkit-learn
A commandline tool for analysis of big biological data sets for distributed HPC clusters.
A Pyspark companion for data science tasks.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Ka...
Simple and Distributed Machine Learning
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Apache Spark - A unified analytics engine for large-scale data processing
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Asynchronous actions for PySpark
MLeap: Deploy ML Pipelines to Production