My findings from the Capital One Data Science Challenge
Important files
Main.py
- Script that parses text file into dataframe
DataCleaning.ipynb
- Examines data for any missing values
Visualizations.ipynb
- Looks for interesting discoveries in the dataset
- Geting a better understanding of the data
Preprocessing.ipynb
- Prepares data for different modeling approaches
- Adds in some custom features
ModelResults.ipynb
- Shows results from classification models
ClusteringResults.ipynb
- Shows discoveries found using Kmeans