Test-driven data analysis fuses and builds upon the ideas of test-driven development and reproducible research to support higher quality data analysis. This talk will extend the foundation parts of TDDA with extensions including tight constraints on string fields with automatically discovered regular expressions and automatically discovered relationships between datasets.
Test-driven data analysis fuses and builds upon the ideas of test-driven development and reproducible research to support higher quality data analysis.
Foundational concepts are: * Level 0: Reference Tests * Level 1: Automatic constraint discovery and validation.
This talk will extend these to cover tight constraints on string fields with * automatically discovered regular expressions with rexpy * constraints between datasets and probably more.
Background material: