Loading...

Nick Radcliffe - Developments in Test-Driven Data Analysis

394 views

Loading...

Loading...

Transcript

The interactive transcript could not be loaded.

Loading...

Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Jul 26, 2017

Description
Test-driven data analysis fuses and builds upon the ideas of test-driven development and reproducible research to support higher quality data analysis. This talk will extend the foundation parts of TDDA with extensions including tight constraints on string fields with automatically discovered regular expressions and automatically discovered relationships between datasets.

Abstract
Test-driven data analysis fuses and builds upon the ideas of test-driven development and reproducible research to support higher quality data analysis.

Foundational concepts are: * Level 0: Reference Tests * Level 1: Automatic constraint discovery and validation.

This talk will extend these to cover tight constraints on string fields with * automatically discovered regular expressions with rexpy * constraints between datasets and probably more.

Background material:

PyCon UK Talk, Cardiff, Test-Driven Data Analysis https://www.youtube.com/watch?v=FIw_7...
Blog: http://tdda.info, especially posts http://www.tdda.info/the-new-referenc... and http://www.tdda.info/constraint-disco...
Overview: http://www.predictiveanalyticsworld.c... In terms of some of the new material that will be covered in this talk, see
http://www.tdda.info/introducing-rexp...
http://rexpy.herokuapp.com

www.pydata.org

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.

Comments are disabled for this video.
When autoplay is enabled, a suggested video will automatically play next.

Up next


to add this to Watch Later

Add to

Loading playlists...