Test-driven Data Analysis - Couverture souple

Radcliffe, Nicholas J.

 
9781032896700: Test-driven Data Analysis

Synopsis

Test-driven data analysis is the synthesis of ideas from test-driven development of software to data-intensive work including data science, data analysis, and data engineering. It is a methodology for improving the quality of data and of analytical pipelines and processes. It can be thought of as data analysis as if the answers actually matter.

Test-driven data analysis can be thought of as a sibling to reproducible research, with similar concerns, but greater emphasis on automated testing, and less requirement for a human to reproduce results. Extensive checklists are provided that can be used to improve quality before,during, and after analysis.

Key Features:

  • Prevents costly errors in analytical processes before they reach production through automated data validation and reference testing of data pipelines.
    • Provides actionable checklists for issues beyond the reach of automated testing.
    • Equips readers with open-source Python tools and language-agnostic command-line interfaces.
    • Addresses testing challenges for modern LLM-based systems including chat-bots and coding assistants.
    • Instills in analysts an inner voice that is always asking: “How is this misleading data misleading me?”

Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.

À propos de l?auteur

Nicholas Radcliffe is the Founder and Director of Stochastic Solutions Limited, a Scottish company specializing in consulting in data science, data analysis, and data engineering. He has also, since 1995, been a Visiting Professor in the Operations Research Group in the School of Mathematics at the University of Edinburgh. He is known for developing forma analysis (sic) of genetic algorithms and uplift modeling, before more recent work on test-driven data analysis.

Les informations fournies dans la section « A propos du livre » peuvent faire référence à une autre édition de ce titre.

Autres éditions populaires du même titre

9781032897158: Test-Driven Data Analysis

Edition présentée

ISBN 10 :  1032897155 ISBN 13 :  9781032897158
Editeur : Chapman and Hall/CRC, 2026
Couverture rigide