A collection of utilities for handling pySpark's SparkContext
MIT License
Collection of Apache Spark docker images for OKDP
pyspark🍒🥭 is delicious,just eat it!😋😋
PySpark + Scikit-learn = Sparkit-learn
An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.
Asynchronous actions for PySpark
Distributed scikit-learn meta-estimators in PySpark
A Pyspark companion for data science tasks.
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteo...
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Performance Observability for Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing
Basic framework utilities to quickly start writing production ready Apache Spark applications
Official Dockerfile for Apache Spark
Apache Spark 官方文档中文版
FITS data source for Spark SQL and DataFrames