null
Loading... Please wait...
FREE SHIPPING on All Unbranded Items LEARN MORE
Print This Page

Field Guide to Hadoop (An Introduction to Hadoop, Its Ecosystem, and Aligned Technologies)

List Price: $39.99
SKU:
9781491947937
Quantity:
Minimum Purchase
25 unit(s)
  • Availability: Confirm prior to ordering
  • Branding: minimum 50 pieces (add’l costs below)
  • Check Freight Rates (branded products only)

Branding Options (v), Availability & Lead Times

  • 1-Color Imprint: $2.00 ea.
  • Promo-Page Insert: $2.50 ea. (full-color printed, single-sided page)
  • Belly-Band Wrap: $2.50 ea. (full-color printed)
  • Set-Up Charge: $45 per decoration
FULL DETAILS
  • Availability: Product availability changes daily, so please confirm your quantity is available prior to placing an order.
  • Branded Products: allow 10 business days from proof approval for production. Branding options may be limited or unavailable based on product design or cover artwork.
  • Unbranded Products: allow 3-5 business days for shipping. All Unbranded items receive FREE ground shipping in the US. Inquire for international shipping.
  • RETURNS/CANCELLATIONS: All orders, branded or unbranded, are NON-CANCELLABLE and NON-RETURNABLE once a purchase order has been received.
  • Product Details

    Author:
    Kevin Sitto, Marshall Presser
    Format:
    Paperback
    Pages:
    130
    Publisher:
    O'Reilly Media (April 21, 2015)
    Language:
    English
    ISBN-13:
    9781491947937
    ISBN-10:
    1491947934
    Dimensions:
    6" x 9"
    File:
    TWO RIVERS-PERSEUS-Metadata_Only_Perseus_Distribution_Customer_Group_Metadata_20251022163324-20251022.xml
    Folder:
    TWO RIVERS
    List Price:
    $39.99
    As low as:
    $34.39
    Publisher Identifier:
    P-PER
    Discount Code:
    C
    Case Pack:
    60
    Country of Origin:
    United States
    Pub Discount:
    60
    Weight:
    6.4oz
    Imprint:
    O'Reilly Media
  • Overview

    If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You’ll quickly understand how Hadoop’s projects, subprojects, and related technologies work together.

    Each chapter introduces a different topic—such as core technologies or data transfer—and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you’ll have a good grasp of the playing field.

    Topics include:

    • Core technologies—Hadoop Distributed File System (HDFS), MapReduce, YARN, and Spark
    • Database and data management—Cassandra, HBase, MongoDB, and Hive
    • Serialization—Avro, JSON, and Parquet
    • Management and monitoring—Puppet, Chef, Zookeeper, and Oozie
    • Analytic helpers—Pig, Mahout, and MLLib
    • Data transfer—Scoop, Flume, distcp, and Storm
    • Security, access control, auditing—Sentry, Kerberos, and Knox
    • Cloud computing and virtualization—Serengeti, Docker, and Whirr