Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0 - Couverture souple

Karslioglu, Svetlana

9781801074483: Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Couverture souple

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Editeur : Packt Publishing, 2022

Afficher les exemplaires de cette �dition comportant l'ISBN

2 D'occasion

De EUR 52,83

18 Neuf

De EUR 46,81

Create scalable and reliable data pipelines easily with Pachyderm

Key Features

Learn how to build an enterprise-level reproducible data science platform with Pachyderm
Deploy Pachyderm on cloud platforms such as AWS EKS, Google Kubernetes Engine, and Microsoft Azure Kubernetes Service
Integrate Pachyderm with other data science tools, such as Pachyderm Notebooks

Book Description

Pachyderm is an open source project that enables data scientists to run reproducible data pipelines and scale them to an enterprise level. This book will teach you how to implement Pachyderm to create collaborative data science workflows and reproduce your ML experiments at scale.

You'll begin your journey by exploring the importance of data reproducibility and comparing different data science platforms. Next, you'll explore how Pachyderm fits into the picture and its significance, followed by learning how to install Pachyderm locally on your computer or a cloud platform of your choice. You'll then discover the architectural components and Pachyderm's main pipeline principles and concepts. The book demonstrates how to use Pachyderm components to create your first data pipeline and advances to cover common operations involving data, such as uploading data to and from Pachyderm to create more complex pipelines. Based on what you've learned, you'll develop an end-to-end ML workflow, before trying out the hyperparameter tuning technique and the different supported Pachyderm language clients. Finally, you'll learn how to use a SaaS version of Pachyderm with Pachyderm Notebooks.

By the end of this book, you will learn all aspects of running your data pipelines in Pachyderm and manage them on a day-to-day basis.

What you will learn

Understand the importance of reproducible data science for enterprise
Explore the basics of Pachyderm, such as commits and branches
Upload data to and from Pachyderm
Implement common pipeline operations in Pachyderm
Create a real-life example of hyperparameter tuning in Pachyderm
Combine Pachyderm with Pachyderm language clients in Python and Go

Who this book is for

This book is for new as well as experienced data scientists and machine learning engineers who want to build scalable infrastructures for their data science projects. Basic knowledge of Python programming and Kubernetes will be beneficial. Familiarity with Golang will be helpful.

The Problem of Data Reproducibility
Pachyderm Basics�
Pachyderm Pipeline Specification
Installing Pachyderm Locally
Installing Pachyderm on a Cloud Platform
Creating Your First Pipeline
Pachyderm Operations
Creating an End-to-End Machine Learning Workflow�
Distributed Hyperparameter Tuning with Pachyderm
Pachyderm Language Clients
Using Pachyderm Notebooks

Les informations fournies dans la section � Synopsis � peuvent faire r�f�rence � une autre �dition de ce titre.

� propos de l'auteur

Svetlana Karslioglu is a seasoned documentation professional with over 10 years of experience in top Silicon Valley companies. During her tenure at Pachyderm, she authored much of the open source documentation for Pachyderm and was also in charge of the documentation infrastructure. Throughout her career, she has spoken at local conferences and given talks advocating for open infrastructure and unbiased research in artificial intelligence. When Svetlana is not busy writing books, she spends time with her three children and her husband, Murat.

Les informations fournies dans la section � A propos du livre � peuvent faire r�f�rence � une autre �dition de ce titre.

�diteur: Packt Publishing
Date d'�dition: 2022
Langue: anglais
ISBN 10: 1801074488
ISBN 13: 9781801074483
Reliure: Broch�
Nombre de pages: 364
Coordonn�es du fabricant: non disponible
Personne responsable: GPSR Kontakt
gpsr@libri.de

Europaallee 1
Bad Hersfeld
36244
Allemagne

Acheter D'occasion

�tat : Comme neuf

Unread book in perfect condition...

Afficher cet article

EUR 52,83

Exp�dition �EUR 2,31
Exp�dition nationale�: Etats-Unis

Ajouter au panier

Acheter neuf

Afficher cet article

EUR 46,81

Exp�dition �EUR 2,31
Exp�dition nationale�: Etats-Unis

Ajouter au panier

R�sultats de recherche pour Reproducible Data Science with Pachyderm: Learn how...

Image d'archives

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Svetlana Karslioglu

Edit� par Packt Publishing, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Couverture souple

Vendeur : GreatBookPrices, Columbia, MD, Etats-Unis

�valuation du vendeur 5 sur 5 �toiles

Etat : New. N� de r�f. du vendeur 44238536-n

Contacter le vendeur

Acheter neuf

EUR 46,81

Exp�dition �EUR 2,31
Exp�dition nationale�: Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image fournie par le vendeur

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0 (Paperback or Softback)

Karslioglu, Svetlana

Edit� par Packt Publishing 3/18/2022, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Paperback or Softback

Vendeur : BargainBookStores, Grand Rapids, MI, Etats-Unis

�valuation du vendeur 5 sur 5 �toiles

Paperback or Softback. Etat : New. Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0. Book. N� de r�f. du vendeur BBS-9781801074483

Contacter le vendeur

Acheter neuf

EUR 49,20

Livraison gratuite
Exp�dition nationale�: Etats-Unis

Quantit� disponible : 5 disponible(s)

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Svetlana Karslioglu

Edit� par Packt Publishing, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Couverture souple

Vendeur : California Books, Miami, FL, Etats-Unis

�valuation du vendeur 4 sur 5 �toiles

Etat : New. N� de r�f. du vendeur I-9781801074483

Contacter le vendeur

Acheter neuf

EUR 50,45

Livraison gratuite
Exp�dition nationale�: Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Svetlana Karslioglu

Edit� par Packt Publishing, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Ancien ou d'occasion Couverture souple

Vendeur : GreatBookPrices, Columbia, MD, Etats-Unis

�valuation du vendeur 5 sur 5 �toiles

Etat : As New. Unread book in perfect condition. N� de r�f. du vendeur 44238536

Contacter le vendeur

Acheter D'occasion

EUR 52,83

Exp�dition �EUR 2,31
Exp�dition nationale�: Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm

Svetlana Karslioglu

Edit� par Packt Publishing Limited, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf PAP

impression � la demande

Vendeur : PBShop.store US, Wood Dale, IL, Etats-Unis

�valuation du vendeur 5 sur 5 �toiles

PAP. Etat : New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. N� de r�f. du vendeur L0-9781801074483

Contacter le vendeur

Acheter neuf

EUR 57,73

Livraison gratuite
Exp�dition nationale�: Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm

Svetlana Karslioglu

Edit� par Packt Publishing Limited, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf PAP

impression � la demande

Vendeur : PBShop.store UK, Fairford, GLOS, Royaume-Uni

�valuation du vendeur 5 sur 5 �toiles

PAP. Etat : New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. N� de r�f. du vendeur L0-9781801074483

Contacter le vendeur

Acheter neuf

EUR 54,51

Exp�dition �EUR 3,86
Exp�dition depuis Royaume-Uni vers Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm

Svetlana Karslioglu

Edit� par Packt Publishing Limited, GB, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Paperback

Vendeur : Rarewaves USA, OSWEGO, IL, Etats-Unis

�valuation du vendeur 5 sur 5 �toiles

Paperback. Etat : New. Create scalable and reliable data pipelines easily with PachydermKey FeaturesLearn how to build an enterprise-level reproducible data science platform with PachydermDeploy Pachyderm on cloud platforms such as AWS EKS, Google Kubernetes Engine, and Microsoft Azure Kubernetes ServiceIntegrate Pachyderm with other data science tools, such as Pachyderm NotebooksBook DescriptionPachyderm is an open source project that enables data scientists to run reproducible data pipelines and scale them to an enterprise level. This book will teach you how to implement Pachyderm to create collaborative data science workflows and reproduce your ML experiments at scale.You'll begin your journey by exploring the importance of data reproducibility and comparing different data science platforms. Next, you'll explore how Pachyderm fits into the picture and its significance, followed by learning how to install Pachyderm locally on your computer or a cloud platform of your choice. You'll then discover the architectural components and Pachyderm's main pipeline principles and concepts. The book demonstrates how to use Pachyderm components to create your first data pipeline and advances to cover common operations involving data, such as uploading data to and from Pachyderm to create more complex pipelines. Based on what you've learned, you'll develop an end-to-end ML workflow, before trying out the hyperparameter tuning technique and the different supported Pachyderm language clients. Finally, you'll learn how to use a SaaS version of Pachyderm with Pachyderm Notebooks.By the end of this book, you will learn all aspects of running your data pipelines in Pachyderm and manage them on a day-to-day basis.What you will learnUnderstand the importance of reproducible data science for enterpriseExplore the basics of Pachyderm, such as commits and branchesUpload data to and from PachydermImplement common pipeline operations in PachydermCreate a real-life example of hyperparameter tuning in PachydermCombine Pachyderm with Pachyderm language clients in Python and GoWho this book is forThis book is for new as well as experienced data scientists and machine learning engineers who want to build scalable infrastructures for their data science projects. Basic knowledge of Python programming and Kubernetes will be beneficial. Familiarity with Golang will be helpful. N� de r�f. du vendeur LU-9781801074483

Contacter le vendeur

Acheter neuf

EUR 60,25

Livraison gratuite
Exp�dition nationale�: Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm

Svetlana Karslioglu

Edit� par Packt Publishing, Limited, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Couverture souple

Vendeur : Books Puddle, New York, NY, Etats-Unis

�valuation du vendeur 4 sur 5 �toiles

Etat : New. pp. 364. N� de r�f. du vendeur 26394968213

Contacter le vendeur

Acheter neuf

EUR 62,43

Exp�dition �EUR 3,49
Exp�dition nationale�: Etats-Unis

Quantit� disponible : 4 disponible(s)

Ajouter au panier

Image fournie par le vendeur

Reproducible Data Science with Pachyderm

Svetlana Karslioglu

Edit� par Packt Publishing Limited, GB, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Paperback

Vendeur : Rarewaves.com USA, London, LONDO, Royaume-Uni

�valuation du vendeur 5 sur 5 �toiles

Contacter le vendeur

Acheter neuf

EUR 67,05

Livraison gratuite
Exp�dition depuis Royaume-Uni vers Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

Image d'archives

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Svetlana Karslioglu

Edit� par Packt Publishing, 2022

ISBN 10 : 1801074488 ISBN 13 : 9781801074483

Neuf Couverture souple

Vendeur : Ria Christie Collections, Uxbridge, Royaume-Uni

�valuation du vendeur 5 sur 5 �toiles

Etat : New. In. N� de r�f. du vendeur ria9781801074483_new

Contacter le vendeur

Acheter neuf

EUR 53,95

Exp�dition �EUR 14,05
Exp�dition depuis Royaume-Uni vers Etats-Unis

Quantit� disponible : Plus de 20 disponibles

Ajouter au panier

There are 10 autres exemplaires de ce livre sont disponibles

Afficher tous les r�sultats pour ce livre

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0 - Couverture souple

Synopsis

Key Features

Book Description

What you will learn

Who this book is for

Table of Contents

� propos de l'auteur

R�sultats de recherche pour Reproducible Data Science with Pachyderm: Learn how...

Acheter neuf

Acheter neuf

Acheter neuf

Acheter D'occasion

Acheter neuf

Acheter neuf

Acheter neuf

Acheter neuf

Acheter neuf

Acheter neuf

There are 10 autres exemplaires de ce livre sont disponibles