FITS data source for Spark SQL and DataFrames
Scala API for Apache Spark SQL high-order functions
Base classes to use when writing tests with Spark
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditab...
REST job server for Apache Spark
A Spark plugin for reading and writing Excel files
A cluster computing framework for processing large-scale geospatial data
All the things about TPC-DS in Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing
Apache Spark based framework for analysis A/B experiments
Code to accompany Advanced Analytics with Spark from O'Reilly Media
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteo...
A simple Spark-powered ETL framework that just works 🍺