HDInsight Essentials - Second Edition - Couverture souple

Nadipalli, Rajesh

 
9781784399429: HDInsight Essentials - Second Edition

Synopsis

Learn how to build and deploy a modern big data architecture to empower your business

About This Book

  • Learn how to quickly provision a Hadoop cluster using Windows Azure Cloud Services
  • Build an end-to-end application for a big data problem using open source software
  • Discover more about modern data architecture with this guide, to help you understand the transition from legacy relational Enterprise Data Warehouse

Who This Book Is For

If you want to discover one of the latest tools designed to produce stunning Big Data insights, this book features everything you need to get to grips with your data. Whether you are a data architect, developer, or a business strategist, HDInsight adds value in everything from development, administration, and reporting.

What You Will Learn

  • Explore core features of Hadoop, including the HDFS2 and YARN, the new resource manager for Hadoop
  • Build your HDInsight cluster in minutes and learn how to administer it using Azure PowerShell
  • Discover what's new in Hadoop 2.X and the reference architecture for a modern data lake based on Hadoop
  • Find out more about a data lake vision and its core capabilities
  • Ingest and organize your data into HDInsight
  • Utilize open source software to transform data including Hive, Pig, and MapReduce, and make it available for decision makers
  • Get to grips with architectural considerations for scalability, maintainability, and security

In Detail

Traditional relational databases are today ineffective with dealing with the challenges presented by Big Data. A Hadoop-based architecture offers a radical solution, as it is designed specifically to handle huge sets of unstructured data.

This book takes you through the journey of building a modern data lake architecture using HDInsight, a Hadoop-based service that allows you to successfully manage high volume and velocity data in the Microsoft Azure Cloud. Featuring a wealth of practical examples, you'll find tips and techniques to provision your own HDInsight cluster to ingest, organize, transform, and analyze data.

While guided through HDInsight, you'll explore the wider Hadoop ecosystem with plenty of working examples on Hadoop technologies including Hive, Pig, MapReduce, HBase, Storm, and analytics solutions including using Excel PowerQuery, PowerMap, and PowerBI.

Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.

Présentation de l'éditeur

Learn how to build and deploy a modern big data architecture to empower your business

About This Book

  • Learn how to quickly provision a Hadoop cluster using Windows Azure Cloud Services
  • Build an end-to-end application for a big data problem using open source software
  • Discover more about modern data architecture with this guide, to help you understand the transition from legacy relational Enterprise Data Warehouse

Who This Book Is For

If you want to discover one of the latest tools designed to produce stunning Big Data insights, this book features everything you need to get to grips with your data. Whether you are a data architect, developer, or a business strategist, HDInsight adds value in everything from development, administration, and reporting.

What You Will Learn

  • Explore core features of Hadoop, including the HDFS2 and YARN, the new resource manager for Hadoop
  • Build your HDInsight cluster in minutes and learn how to administer it using Azure PowerShell
  • Discover what's new in Hadoop 2.X and the reference architecture for a modern data lake based on Hadoop
  • Find out more about a data lake vision and its core capabilities
  • Ingest and organize your data into HDInsight
  • Utilize open source software to transform data including Hive, Pig, and MapReduce, and make it available for decision makers
  • Get to grips with architectural considerations for scalability, maintainability, and security

In Detail

Traditional relational databases are today ineffective with dealing with the challenges presented by Big Data. A Hadoop-based architecture offers a radical solution, as it is designed specifically to handle huge sets of unstructured data.

This book takes you through the journey of building a modern data lake architecture using HDInsight, a Hadoop-based service that allows you to successfully manage high volume and velocity data in the Microsoft Azure Cloud. Featuring a wealth of practical examples, you'll find tips and techniques to provision your own HDInsight cluster to ingest, organize, transform, and analyze data.

While guided through HDInsight, you'll explore the wider Hadoop ecosystem with plenty of working examples on Hadoop technologies including Hive, Pig, MapReduce, HBase, Storm, and analytics solutions including using Excel PowerQuery, PowerMap, and PowerBI.

Biographie de l'auteur

Rajesh Nadipalli

Rajesh Nadipalli currently manages software architecture and delivery of Zaloni's Bedrock Data Management Platform, which enables customers to quickly and easily realize true Hadoop-based Enterprise Data Lakes. Rajesh is also an instructor and a content provider for Hadoop training, including Hadoop development, Hive, Pig, and HBase. In his previous role as a senior solutions architect, he evaluated big data goals for his clients, recommended a target state architecture, and conducted proof of concepts and production implementation. His clients include Verizon, American Express, NetApp, Cisco, EMC, and UnitedHealth Group. Prior to Zaloni, Rajesh worked for Cisco Systems for 12 years and held a technical leadership position. His key focus areas have been data management, enterprise architecture, business intelligence, data warehousing, and Extract Transform Load (ETL). He has demonstrated success by delivering scalable data management and BI solutions that empower business to make informed decisions. Rajesh authored the first version of the book HDInsight Essentials, Packt Publishing, released in September 2013, the first book in print for HDInsight, providing data architects, developers, and managers with an introduction to the new Hadoop distribution from Microsoft. He has over 18 years of IT experience. He holds an MBA from North Carolina State University and a BSc degree in Electronics and Electrical from the University of Mumbai, India.

Les informations fournies dans la section « A propos du livre » peuvent faire référence à une autre édition de ce titre.