Discover how to describe your data in detail, identify data issues, and find out how to solve them using commonly used techniques and tips and tricks
Key features
Book Description
Getting clean data to reveal insights is essential, as directly jumping into data analysis without proper data cleaning may lead to incorrect results. This book shows you tools and techniques that you can apply to clean and handle data with Python. You'll begin by getting familiar with the shape of data by using practices that can be deployed routinely with most data sources. Then, the book teaches you how to manipulate data to get it into a useful form. You'll also learn how to filter and summarize data to gain insights and better understand what makes sense and what does not, along with discovering how to operate on data to address the issues you've identified. Moving on, you'll perform key tasks, such as handling missing values, validating errors, removing duplicate data, monitoring high volumes of data, and handling outliers and invalid dates. Next, you'll cover recipes on using supervised learning and Naive Bayes analysis to identify unexpected values and classification errors, and generate visualizations for exploratory data analysis (EDA) to visualize unexpected values. Finally, you'll build functions and classes that you can reuse without modification when you have new data.
By the end of this Python book, you'll be equipped with all the key skills that you need to clean data and diagnose problems within it.
What you will learn
Who this book is for
This book is for anyone looking for ways to handle messy, duplicate, and poor data using different Python tools and techniques. The book takes a recipe-based approach to help you to learn how to clean and manage data. Working knowledge of Python programming is all you need to get the most out of the book.
Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.
Michael Walker has worked as a data analyst for over 30 years at a variety of educational institutions. He has also taught data science, research methods, statistics, and computer programming to undergraduates since 2006. He generates public sector and foundation reports and conducts analyses for publication in academic journals.
Les informations fournies dans la section « A propos du livre » peuvent faire référence à une autre édition de ce titre.
Vendeur : HPB-Red, Dallas, TX, Etats-Unis
paperback. Etat : Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority! N° de réf. du vendeur S_427708316
Quantité disponible : 1 disponible(s)
Vendeur : Better World Books, Mishawaka, IN, Etats-Unis
Etat : Very Good. Pages intact with possible writing/highlighting. Binding strong with minor wear. Dust jackets/supplements may not be included. Stock photo provided. Product includes identifying sticker. Better World Books: Buy Books. Do Good. N° de réf. du vendeur 55491962-6
Quantité disponible : 1 disponible(s)
Vendeur : ThriftBooks-Atlanta, AUSTELL, GA, Etats-Unis
Paperback. Etat : Fair. No Jacket. Readable copy. Pages may have considerable notes/highlighting. ~ ThriftBooks: Read More, Spend Less. N° de réf. du vendeur G1800565666I5N00
Quantité disponible : 1 disponible(s)
Vendeur : ThriftBooks-Atlanta, AUSTELL, GA, Etats-Unis
Paperback. Etat : As New. No Jacket. Pages are clean and are not marred by notes or folds of any kind. ~ ThriftBooks: Read More, Spend Less. N° de réf. du vendeur G1800565666I2N00
Quantité disponible : 1 disponible(s)
Vendeur : ThriftBooks-Atlanta, AUSTELL, GA, Etats-Unis
Paperback. Etat : Good. No Jacket. Pages can have notes/highlighting. Spine may show signs of wear. ~ ThriftBooks: Read More, Spend Less. N° de réf. du vendeur G1800565666I3N00
Quantité disponible : 1 disponible(s)
Vendeur : HPB-Ruby, Dallas, TX, Etats-Unis
paperback. Etat : Very Good. Connecting readers with great books since 1972! Used books may not include companion materials, and may have some shelf wear or limited writing. We ship orders daily and Customer Service is our top priority! N° de réf. du vendeur S_460921673
Quantité disponible : 1 disponible(s)
Vendeur : AwesomeBooks, Wallingford, Royaume-Uni
Etat : Very Good. This book is in very good condition and will be shipped within 24 hours of ordering. The cover may have some limited signs of wear but the pages are clean, intact and the spine remains undamaged. This book has clearly been well maintained and looked after thus far. Money back guarantee if you are not satisfied. See all our books here, order more than 1 book and get discounted shipping. . N° de réf. du vendeur 7719-9781800565661
Quantité disponible : 1 disponible(s)
Vendeur : Bahamut Media, Reading, Royaume-Uni
Etat : Very Good. Shipped within 24 hours from our UK warehouse. Clean, undamaged book with no damage to pages and minimal wear to the cover. Spine still tight, in very good condition. Remember if you are not happy, you are covered by our 100% money back guarantee. N° de réf. du vendeur 6545-9781800565661
Quantité disponible : 1 disponible(s)
Vendeur : Black Falcon Books, Wellesley, MA, Etats-Unis
Soft cover. Etat : Near Fine. 1st Edition. First published: December 2020, stated. The book is square and unmarked; spine and wraps uncreased; Mylar protected. N° de réf. du vendeur 017285
Quantité disponible : 1 disponible(s)
Vendeur : Bookbot, Prague, Rébublique tchèque
Softcover. Etat : Fine. Leichte Kratzer / Abnutzungen / Druckstellen; Gebrochener Buchrucken. Discover how to detail your data, identify issues, and solve them with effective techniques. Key features include mastering various data cleaning methods to uncover insights, manipulating data to meet business needs, and validating large volumes of data to diagnose problems before analysis. This guide emphasizes the importance of clean data for accurate insights, illustrating tools and techniques for data handling with Python. You'll start by understanding data shape through routine practices applicable to most sources, then learn to manipulate data into a useful format. The book covers filtering and summarizing data to enhance comprehension and address identified issues. Key tasks include managing missing values, validating errors, removing duplicates, monitoring large datasets, and handling outliers and invalid dates. You'll also explore supervised learning and Naive Bayes analysis for detecting unexpected values and classification errors, along with generating visualizations for exploratory data analysis (EDA). By the end, you'll have the skills to clean data effectively and diagnose problems. Learn to read and analyze data from various sources, summarize attributes, filter relevant data, tackle messy issues, improve productivity with method chaining, and use visualizations for insights. This resource is ideal for anyone aiming to manage poor data using Python tools, requiring only a basic understanding of Python prog. N° de réf. du vendeur 4433f8a7-735e-4e5f-a85a-6468f7504063
Quantité disponible : 1 disponible(s)