null
Loading... Please wait...
FREE SHIPPING on All Unbranded Items LEARN MORE
Print This Page

Spark in Action, Second Edition (Covers Apache Spark 3 with Examples in Java, Python, and Scala)

List Price: $59.99
SKU:
9781617295522
Quantity:
Minimum Purchase
25 unit(s)
  • Availability: Confirm prior to ordering
  • Branding: minimum 50 pieces (add’l costs below)
  • Check Freight Rates (branded products only)

Branding Options (v), Availability & Lead Times

  • 1-Color Imprint: $2.00 ea.
  • Promo-Page Insert: $2.50 ea. (full-color printed, single-sided page)
  • Belly-Band Wrap: $2.50 ea. (full-color printed)
  • Set-Up Charge: $45 per decoration
FULL DETAILS
  • Availability: Product availability changes daily, so please confirm your quantity is available prior to placing an order.
  • Branded Products: allow 10 business days from proof approval for production. Branding options may be limited or unavailable based on product design or cover artwork.
  • Unbranded Products: allow 3-5 business days for shipping. All Unbranded items receive FREE ground shipping in the US. Inquire for international shipping.
  • RETURNS/CANCELLATIONS: All orders, branded or unbranded, are NON-CANCELLABLE and NON-RETURNABLE once a purchase order has been received.
  • Product Details

    Author:
    Jean-Georges Perrin
    Format:
    Paperback
    Pages:
    576
    Publisher:
    Manning (June 2, 2020)
    Language:
    English
    ISBN-13:
    9781617295522
    ISBN-10:
    1617295523
    Weight:
    31.2oz
    Dimensions:
    7.38" x 9.25" x 1.3"
    File:
    Eloquence-SimonSchuster_05022026_P10038138_onix30_Complete-20260502.xml
    Folder:
    Eloquence
    List Price:
    $59.99
    Case Pack:
    14
    As low as:
    $53.99
    Publisher Identifier:
    P-SS
    Discount Code:
    G
    Pub Discount:
    37
    Imprint:
    Manning
  • Overview

    Summary
    The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop.

    Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

    About the technology
    Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem.

    About the book
    Spark in Action, Second Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms.

    What's inside

        Writing Spark applications in Java
        Spark application architecture
        Ingestion through files, databases, streaming, and Elasticsearch
        Querying distributed datasets with Spark SQL

    About the reader
    This book does not assume previous experience with Spark, Scala, or Hadoop.

    About the author
    Jean-Georges Perrin is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years.

    Table of Contents

    PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES

    1 So, what is Spark, anyway?

    2 Architecture and flow

    3 The majestic role of the dataframe

    4 Fundamentally lazy

    5 Building a simple app for deployment

    6 Deploying your simple app

    PART 2 - INGESTION

    7 Ingestion from files

    8 Ingestion from databases

    9 Advanced ingestion: finding data sources and building

    your own

    10 Ingestion through structured streaming

    PART 3 - TRANSFORMING YOUR DATA

    11 Working with SQL

    12 Transforming your data

    13 Transforming entire documents

    14 Extending transformations with user-defined functions

    15 Aggregating your data

    PART 4 - GOING FURTHER

    16 Cache and checkpoint: Enhancing Spark’s performances

    17 Exporting data and building full data pipelines

    18 Exploring deployment