null
Loading... Please wait...
FREE SHIPPING on All Unbranded Items LEARN MORE
Print This Page

Scaling Machine Learning with Spark (Distributed ML with MLlib, TensorFlow, and PyTorch)

List Price: $79.99
SKU:
9781098106829
Quantity:
Minimum Purchase
25 unit(s)
  • Availability: Confirm prior to ordering
  • Branding: minimum 50 pieces (add’l costs below)
  • Check Freight Rates (branded products only)

Branding Options (v), Availability & Lead Times

  • 1-Color Imprint: $2.00 ea.
  • Promo-Page Insert: $2.50 ea. (full-color printed, single-sided page)
  • Belly-Band Wrap: $2.50 ea. (full-color printed)
  • Set-Up Charge: $45 per decoration
FULL DETAILS
  • Availability: Product availability changes daily, so please confirm your quantity is available prior to placing an order.
  • Branded Products: allow 10 business days from proof approval for production. Branding options may be limited or unavailable based on product design or cover artwork.
  • Unbranded Products: allow 3-5 business days for shipping. All Unbranded items receive FREE ground shipping in the US. Inquire for international shipping.
  • RETURNS/CANCELLATIONS: All orders, branded or unbranded, are NON-CANCELLABLE and NON-RETURNABLE once a purchase order has been received.
  • Product Details

    Author:
    Adi Polak
    Format:
    Paperback
    Pages:
    291
    Publisher:
    O'Reilly Media (April 11, 2023)
    Language:
    English
    ISBN-13:
    9781098106829
    ISBN-10:
    1098106822
    Dimensions:
    7" x 9.19"
    File:
    TWO RIVERS-PERSEUS-Metadata_Only_Perseus_Distribution_Customer_Group_Metadata_20251023163248-20251023.xml
    Folder:
    TWO RIVERS
    List Price:
    $79.99
    Case Pack:
    12
    As low as:
    $68.79
    Publisher Identifier:
    P-PER
    Discount Code:
    C
    Country of Origin:
    United States
    Pub Discount:
    60
    Weight:
    16.64oz
    Imprint:
    O'Reilly Media
  • Overview

    Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.

    Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.

    You will:

    • Explore machine learning, including distributed computing concepts and terminology
    • Manage the ML lifecycle with MLflow
    • Ingest data and perform basic preprocessing with Spark
    • Explore feature engineering, and use Spark to extract features
    • Train a model with MLlib and build a pipeline to reproduce it
    • Build a data system to combine the power of Spark with deep learning
    • Get a step-by-step example of working with distributed TensorFlow
    • Use PyTorch to scale machine learning and its internal architecture