This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.
This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.
It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.
Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.
Vendeur : Grand Eagle Retail, Bensenville, IL, Etats-Unis
Paperback. Etat : new. Paperback. This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.Key FeaturesBuild memory-efficient pipelines using PyArrow backends and targeted dtype choices.Write Copy-on-Write-safe assignment patterns that work on pandas 2.3 and 3.0.Engineer rich ML features using ratios, bins, group statistics, and interaction terms.Handle class imbalance with SMOTE, ADASYN, and quantified pandas-based profiling.Scale datasets beyond RAM using Dask lazy evaluation and distributed cluster computing.Accelerate custom scoring functions with Numba JIT and JAX-compiled batch operations.Extract sentiment, topics, and clusters from raw text using TF-IDF and LDA pipelines.Perform spatial joins, buffer analysis, and geocoding with GeoPandas and geopy.Preserve named DataFrames throughout sklearn Pipelines using the set_output API.Migrate confidently from legacy pandas patterns to pandas 2.3 production standards.Table of ContentGetting Started with Pandas 2.3Data Read, Storage, and File FormatsIndexing and Selecting DataData Manipulation and TransformationTime Series and DateTime OperationsPerformance Optimization and ScalingMachine Learning with Pandas 2.3Text Mining and NLPGeospatial Data Analysis This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. N° de réf. du vendeur 9789349174665
Quantité disponible : 1 disponible(s)
Vendeur : California Books, Miami, FL, Etats-Unis
Etat : New. N° de réf. du vendeur I-9789349174665
Quantité disponible : Plus de 20 disponibles
Vendeur : PBShop.store US, Wood Dale, IL, Etats-Unis
PAP. Etat : New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. N° de réf. du vendeur L0-9789349174665
Quantité disponible : Plus de 20 disponibles
Vendeur : PBShop.store UK, Fairford, GLOS, Royaume-Uni
PAP. Etat : New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. N° de réf. du vendeur L0-9789349174665
Quantité disponible : Plus de 20 disponibles
Vendeur : BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - it takes 3-4 days longer - Neuware 220 pp. Englisch. N° de réf. du vendeur 9789349174665
Quantité disponible : 2 disponible(s)
Vendeur : AussieBookSeller, Truganina, VIC, Australie
Paperback. Etat : new. Paperback. This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.Key FeaturesBuild memory-efficient pipelines using PyArrow backends and targeted dtype choices.Write Copy-on-Write-safe assignment patterns that work on pandas 2.3 and 3.0.Engineer rich ML features using ratios, bins, group statistics, and interaction terms.Handle class imbalance with SMOTE, ADASYN, and quantified pandas-based profiling.Scale datasets beyond RAM using Dask lazy evaluation and distributed cluster computing.Accelerate custom scoring functions with Numba JIT and JAX-compiled batch operations.Extract sentiment, topics, and clusters from raw text using TF-IDF and LDA pipelines.Perform spatial joins, buffer analysis, and geocoding with GeoPandas and geopy.Preserve named DataFrames throughout sklearn Pipelines using the set_output API.Migrate confidently from legacy pandas patterns to pandas 2.3 production standards.Table of ContentGetting Started with Pandas 2.3Data Read, Storage, and File FormatsIndexing and Selecting DataData Manipulation and TransformationTime Series and DateTime OperationsPerformance Optimization and ScalingMachine Learning with Pandas 2.3Text Mining and NLPGeospatial Data Analysis This item is printed on demand. Shipping may be from our Sydney, NSW warehouse or from our UK or US warehouse, depending on stock availability. N° de réf. du vendeur 9789349174665
Quantité disponible : 1 disponible(s)
Vendeur : CitiRetail, Stevenage, Royaume-Uni
Paperback. Etat : new. Paperback. This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.Key FeaturesBuild memory-efficient pipelines using PyArrow backends and targeted dtype choices.Write Copy-on-Write-safe assignment patterns that work on pandas 2.3 and 3.0.Engineer rich ML features using ratios, bins, group statistics, and interaction terms.Handle class imbalance with SMOTE, ADASYN, and quantified pandas-based profiling.Scale datasets beyond RAM using Dask lazy evaluation and distributed cluster computing.Accelerate custom scoring functions with Numba JIT and JAX-compiled batch operations.Extract sentiment, topics, and clusters from raw text using TF-IDF and LDA pipelines.Perform spatial joins, buffer analysis, and geocoding with GeoPandas and geopy.Preserve named DataFrames throughout sklearn Pipelines using the set_output API.Migrate confidently from legacy pandas patterns to pandas 2.3 production standards.Table of ContentGetting Started with Pandas 2.3Data Read, Storage, and File FormatsIndexing and Selecting DataData Manipulation and TransformationTime Series and DateTime OperationsPerformance Optimization and ScalingMachine Learning with Pandas 2.3Text Mining and NLPGeospatial Data Analysis This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. N° de réf. du vendeur 9789349174665
Quantité disponible : 1 disponible(s)
Vendeur : buchversandmimpf2000, Emtmannsberg, BAYE, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - Print on Demand Titel. Neuware Libri GmbH, Europaallee 1, 36244 Bad Hersfeld 220 pp. Englisch. N° de réf. du vendeur 9789349174665
Quantité disponible : 1 disponible(s)
Vendeur : preigu, Osnabrück, Allemagne
Taschenbuch. Etat : Neu. Learning Pandas 2, Second Edition | Master Data Wrangling, NLP, Geospatial Analysis, and Production ML Pipelines using pandas 2.3 | Matthew Rosch | Taschenbuch | Englisch | 2026 | GitforGits | EAN 9789349174665 | Verantwortliche Person für die EU: Libri GmbH, Europaallee 1, 36244 Bad Hersfeld, gpsr[at]libri[dot]de | Anbieter: preigu Print on Demand. N° de réf. du vendeur 135413820
Quantité disponible : 5 disponible(s)
Vendeur : AHA-BUCH GmbH, Einbeck, Allemagne
Taschenbuch. Etat : Neu. nach der Bestellung gedruckt Neuware - Printed after ordering - This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.Key FeaturesBuild memory-efficient pipelines using PyArrow backends and targeted dtype choices.Write Copy-on-Write-safe assignment patterns that work on pandas 2.3 and 3.0.Engineer rich ML features using ratios, bins, group statistics, and interaction terms.Handle class imbalance with SMOTE, ADASYN, and quantified pandas-based profiling.Scale datasets beyond RAM using Dask lazy evaluation and distributed cluster computing.Accelerate custom scoring functions with Numba JIT and JAX-compiled batch operations.Extract sentiment, topics, and clusters from raw text using TF-IDF and LDA pipelines.Perform spatial joins, buffer analysis, and geocoding with GeoPandas and geopy.Preserve named DataFrames throughout sklearn Pipelines using the set_output API.Migrate confidently from legacy pandas patterns to pandas 2.3 production standards.Table of ContentGetting Started with Pandas 2.3Data Read, Storage, and File FormatsIndexing and Selecting DataData Manipulation and TransformationTime Series and DateTime OperationsPerformance Optimization and ScalingMachine Learning with Pandas 2.3Text Mining and NLPGeospatial Data Analysis. N° de réf. du vendeur 9789349174665
Quantité disponible : 2 disponible(s)