Try for free

Hadoop with Python

Zachary Radtka, Donald Miner

Information

  • Publisher
  • ISBN
  • ePub ISBN
  • O’Reilly Media
  • 9781491942277
  • -
  • Published at
  • Pressing
  • 2015
  • 1

About this book

Hadoop is mostly written in Java, but that doesn’t exclude the use of other programming languages with this distributed storage and processing framework, particularly Python. With this concise book, you’ll learn how to use Python with the Hadoop Distributed File System (HDFS), MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework. Authors Zachary Radtka and Donald Miner from the data science firm Miner & Kasch take you through the basic concepts behind Hadoop, MapReduce, Pig, and Spark. Then, through multiple examples and use cases, you’ll learn how to work with these technologies by applying various Python tools.

Note: Some books are only available in specific countries.

Therefore, always check if your books are available in your country before subscribing by using the search function in the app at buku.app.