Hadoop in Practice

Computers / Data Processing, Computers / Databases / Data Mining, Computers / Software Development & Engineering / Tools, Ebook

Summary

Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you’ll face, like querying big data using Pig or writing a log file loader. You’ll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you’ll find yourself growing more comfortable with Hadoop and at home in the world of big data.

About the Technology

Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.

About the Book

Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You’ll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book’s examples create a well-structured and understandable codebase you can tweak to meet your own needs.

This book assumes the reader knows the basics of Hadoop.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.

What’s Inside

  • Conceptual overview of Hadoop and MapReduce
  • 85 practical, tested techniques
  • Real problems, real solutions
  • How to integrate MapReduce and R

Table of Contents

  • PART 1 BACKGROUND AND FUNDAMENTALS
  • Hadoop in a heartbeat PART 2 DATA LOGISTICS
  • Moving data in and out of Hadoop
  • Data serialization?working with text and beyond PART 3 BIG DATA PATTERNS
  • Applying MapReduce patterns to big data
  • Streamlining HDFS for big data
  • Diagnosing and tuning performance problems PART 4 DATA SCIENCE
  • Utilizing data structures and algorithms
  • Integrating R and Hadoop for statistics and more
  • Predictive analytics with Mahout PART 5 TAMING THE ELEPHANT
  • Hacking with Hive
  • Programming pipelines with Pig
  • Crunch and other technologies
  • Testing and debugging
  • Download Now Read Online

    Hadoop In Practice


    Download Now Read Online

    Author by : Alex Holmes
    Languange Used : en
    Release Date : 2014-10-12
    Publisher by : Manning Publications

    Annotation 'Summary Hadoop in Practice' provides over 100 tested, instantly useful techniques that will help y

    Hadoop In Practice


    Download Now Read Online

    Author by : Alex Holmes
    Languange Used : en
    Release Date : 2012
    Publisher by : Manning Publications

    Presents information and techniques of using Hadoop to query and analyze data which is distributed across larg

    Hadoop In Action


    Download Now Read Online

    Author by : Chuck Lam
    Languange Used : en
    Release Date : 2011-01-01
    Publisher by :

    Special Features: · Introduction to MapReduce· Examples illustrating ideas in practice· Hadoop's Streaming

    Hadoop The Definitive Guide


    Download Now Read Online

    Author by : Tom White
    Languange Used : en
    Release Date : 2012-05-10
    Publisher by : "O'Reilly Media, Inc."

    Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintai

    Professional Hadoop Solutions


    Download Now Read Online

    Author by : Boris Lublinsky
    Languange Used : en
    Release Date : 2013-09-12
    Publisher by : John Wiley & Sons

    The go-to guidebook for deploying Big Data solutions with Hadoop Today's enterprise architects need to underst

    Hadoop Application Architectures


    Download Now Read Online

    Author by : Mark Grover
    Languange Used : en
    Release Date : 2015-06-30
    Publisher by : "O'Reilly Media, Inc."

    Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many source

    Big Data In Practice


    Download Now Read Online

    Author by : Bernard Marr
    Languange Used : en
    Release Date : 2016-03-22
    Publisher by : John Wiley & Sons

    The best-selling author of Big Data is back, this time with a unique and in-depth insight into how specific co

    Leave a Reply