null
Loading... Please wait...
FREE SHIPPING on All Unbranded Items LEARN MORE
Print This Page

Effective Data Science Infrastructure (How to make data scientists productive)

List Price: $59.99
SKU:
9781617299193
Quantity:
Minimum Purchase
25 unit(s)
  • Availability: Confirm prior to ordering
  • Branding: minimum 50 pieces (add’l costs below)
  • Check Freight Rates (branded products only)

Branding Options (v), Availability & Lead Times

  • 1-Color Imprint: $2.00 ea.
  • Promo-Page Insert: $2.50 ea. (full-color printed, single-sided page)
  • Belly-Band Wrap: $2.50 ea. (full-color printed)
  • Set-Up Charge: $45 per decoration
FULL DETAILS
  • Availability: Product availability changes daily, so please confirm your quantity is available prior to placing an order.
  • Branded Products: allow 10 business days from proof approval for production. Branding options may be limited or unavailable based on product design or cover artwork.
  • Unbranded Products: allow 3-5 business days for shipping. All Unbranded items receive FREE ground shipping in the US. Inquire for international shipping.
  • RETURNS/CANCELLATIONS: All orders, branded or unbranded, are NON-CANCELLABLE and NON-RETURNABLE once a purchase order has been received.
  • Product Details

    Author:
    Ville Tuulos
    Format:
    Paperback
    Pages:
    352
    Publisher:
    Manning (August 16, 2022)
    Language:
    English
    ISBN-13:
    9781617299193
    ISBN-10:
    1617299197
    Dimensions:
    7.375" x 9.25" x 0.8"
    File:
    Eloquence-SimonSchuster_05022026_P10038138_onix30_Complete-20260502.xml
    Folder:
    Eloquence
    List Price:
    $59.99
    As low as:
    $53.99
    Publisher Identifier:
    P-SS
    Discount Code:
    G
    Weight:
    19.2oz
    Case Pack:
    22
    Pub Discount:
    37
    Imprint:
    Manning
  • Overview

    Simplify data science infrastructure to give data scientists an efficient path from prototype to production.

    In Effective Data Science Infrastructure you will learn how to:

        Design data science infrastructure that boosts productivity
        Handle compute and orchestration in the cloud
        Deploy machine learning to production
        Monitor and manage performance and results
        Combine cloud-based tools into a cohesive data science environment
        Develop reproducible data science projects using Metaflow, Conda, and Docker
        Architect complex applications for multiple teams and large datasets
        Customize and grow data science infrastructure

    Effective Data Science Infrastructure: How to make data scientists more productive is a hands-on guide to assembling infrastructure for data science and machine learning applications. It reveals the processes used at Netflix and other data-driven companies to manage their cutting edge data infrastructure. In it, you’ll master scalable techniques for data storage, computation, experiment tracking, and orchestration that are relevant to companies of all shapes and sizes. You’ll learn how you can make data scientists more productive with your existing cloud infrastructure, a stack of open source software, and idiomatic Python.

    The author is donating proceeds from this book to charities that support women and underrepresented groups in data science.

    Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

    About the technology
    Growing data science projects from prototype to production requires reliable infrastructure. Using the powerful new techniques and tooling in this book, you can stand up an infrastructure stack that will scale with any organization, from startups to the largest enterprises.

    About the book
    Effective Data Science Infrastructure teaches you to build data pipelines and project workflows that will supercharge data scientists and their projects. Based on state-of-the-art tools and concepts that power data operations of Netflix, this book introduces a customizable cloud-based approach to model development and MLOps that you can easily adapt to your company’s specific needs. As you roll out these practical processes, your teams will produce better and faster results when applying data science and machine learning to a wide array of business problems.

    What's inside

        Handle compute and orchestration in the cloud
        Combine cloud-based tools into a cohesive data science environment
        Develop reproducible data science projects using Metaflow, AWS, and the Python data ecosystem
        Architect complex applications that require large datasets and models, and a team of data scientists

    About the reader
    For infrastructure engineers and engineering-minded data scientists who are familiar with Python.

    About the author
    At Netflix, Ville Tuulos designed and built Metaflow, a full-stack framework for data science. Currently, he is the CEO of a startup focusing on data science infrastructure.

    Table of Contents
    1 Introducing data science infrastructure
    2 The toolchain of data science
    3 Introducing Metaflow
    4 Scaling with the compute layer
    5 Practicing scalability and performance
    6 Going to production
    7 Processing data
    8 Using and operating models
    9 Machine learning with the full stack