fast_eda

Fast data exploration using Sweetviz

Stars
18

Table of Contents

  1. What is this repo about?
  2. Quick setup
  3. Example
  4. Let's connect!

What is this repo about?

Tired of spending too much time doing data exploration before training your Machine Learning models?

Looking for a faster way to understand data issues and patterns, before you dive into the fun part of training your ML model?

Wanna learn how to train better ML models, by finding and fixing issues in your data?

In this repo you can find a short Python script that used the Sweetviz library to do data exploration at the speed of light.

I also invite you to read the original article I published on my site:

Fast And Easy Data Exploration For Machine Learning

Quick setup

You need Python >= 3.7 and

$ pip install sweetviz

Example

I attached a sample dataset for a churn prediction model, that you can quickly explore with a one-liner.

$ python eda.py --file v1.csv --target Churn

Let's connect

If you want to learn more about real-world ML topics and become a better data scientist

Subscribe to the datamachines newsletter.

Follow me on Twitter and LinkedIn.

Related Projects