EMR Bootstrap PySpark with Anaconda
This code should help to jump start PySpark with Anaconda on AWS.
Getting Started
conda env create -f environment.yml
- Fill in all the required information e.g. aws access key, secret acess key etc. into the
config.yml.example
file and rename it to config.yml
- Run it
python emr_loader.py
Requirements
Copyright
See LICENSE for details.
Copyright (c) 2016 Dat Tran.