A simple Spark TDD example
MIT License
This is a very basic example of how to use Test Driven Development (TDD) in the context of PySpark, Spark's Python API.
brew install apache-spark
cd /usr/local/Cellar/apache-spark/2.1.0/libexec/conf
cp log4j.properties.template log4j.properties
log4j.rootCategory=ERROR, console
export SPARK_HOME="/usr/local/Cellar/apache-spark/2.1.0/libexec/"
nosetests -vs test_clustering.py