datacompy

Pandas and Spark DataFrame comparison for humans and more!

APACHE-2.0 License

Downloads
901.8K
Stars
399
Committers
26

Bot releases are hidden (Show)

datacompy - Version 0.7.0

Published by fdosani over 4 years ago

  • dropping Python 2 support (3.5+ only)
  • Add datacompy[spark] pip install option (#54)
  • strip spaces in join columns (#62)
  • new function called all_mismatch. This will provide all rows which mismatch back as a dataframe so users can export, query, or analyze if there are a lot of them. (#64)
  • create MANIFEST.in (#66)
datacompy - Version 0.6.0

Published by theianrobertson over 5 years ago

Small bug fix for Python 2.7 installations.

datacompy - Version 0.5.2

Published by theianrobertson over 5 years ago

Changes since 0.5.1:

  • Added ignore_spaces and ignore_case flags for more flexible string comparisons
  • Fixed a bug (#35) with duplicate matching when nulls are present in the join columns
  • Added in pre-commit and black for code formatting
datacompy - Version 0.5.1

Published by theianrobertson over 6 years ago

Adding in rel_tol, abs_tol, show_all_columns, and match_rates for SparkCompare

datacompy - Version 0.5.0

Published by theianrobertson over 6 years ago

First release to public GitHub!