null
Loading... Please wait...
FREE SHIPPING on All Unbranded Items LEARN MORE
Print This Page

Data Science with Python and Dask

List Price: $49.99
SKU:
9781617295607
Quantity:
Minimum Purchase
25 unit(s)
  • Availability: Confirm prior to ordering
  • Branding: minimum 50 pieces (add’l costs below)
  • Check Freight Rates (branded products only)

Branding Options (v), Availability & Lead Times

  • 1-Color Imprint: $2.00 ea.
  • Promo-Page Insert: $2.50 ea. (full-color printed, single-sided page)
  • Belly-Band Wrap: $2.50 ea. (full-color printed)
  • Set-Up Charge: $45 per decoration
FULL DETAILS
  • Availability: Product availability changes daily, so please confirm your quantity is available prior to placing an order.
  • Branded Products: allow 10 business days from proof approval for production. Branding options may be limited or unavailable based on product design or cover artwork.
  • Unbranded Products: allow 3-5 business days for shipping. All Unbranded items receive FREE ground shipping in the US. Inquire for international shipping.
  • RETURNS/CANCELLATIONS: All orders, branded or unbranded, are NON-CANCELLABLE and NON-RETURNABLE once a purchase order has been received.
  • Product Details

    Author:
    Jesse Daniel
    Format:
    Paperback
    Pages:
    296
    Publisher:
    Manning (July 30, 2019)
    Language:
    English
    ISBN-13:
    9781617295607
    ISBN-10:
    1617295604
    Weight:
    18.48oz
    Dimensions:
    7.38" x 9.25" x 0.6"
    File:
    Eloquence-SimonSchuster_05022026_P10038138_onix30_Complete-20260502.xml
    Folder:
    Eloquence
    List Price:
    $49.99
    Case Pack:
    26
    As low as:
    $44.99
    Publisher Identifier:
    P-SS
    Discount Code:
    G
    Pub Discount:
    37
    Imprint:
    Manning
  • Overview

    Summary

    Dask is a native parallel analytics tool designed to integrate seamlessly with the libraries you're already using, including Pandas, NumPy, and Scikit-Learn. With Dask you can crunch and work with huge datasets, using the tools you already have. And Data Science with Python and Dask is your guide to using Dask for your data projects without changing the way you work!

    Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. You'll find registration instructions inside the print book.

    About the Technology

    An efficient data pipeline means everything for the success of a data science project. Dask is a flexible library for parallel computing in Python that makes it easy to build intuitive workflows for ingesting and analyzing large, distributed datasets. Dask provides dynamic task scheduling and parallel collections that extend the functionality of NumPy, Pandas, and Scikit-learn, enabling users to scale their code from a single laptop to a cluster of hundreds of machines with ease.

    About the Book

    Data Science with Python and Dask teaches you to build scalable projects that can handle massive datasets. After meeting the Dask framework, you'll analyze data in the NYC Parking Ticket database and use DataFrames to streamline your process. Then, you'll create machine learning models using Dask-ML, build interactive visualizations, and build clusters using AWS and Docker.

    What's inside

    • Working with large, structured and unstructured datasets
    • Visualization with Seaborn and Datashader
    • Implementing your own algorithms
    • Building distributed apps with Dask Distributed
    • Packaging and deploying Dask apps

    About the Reader

    For data scientists and developers with experience using Python and the PyData stack.

    About the Author

    Jesse Daniel is an experienced Python developer. He taught Python for Data Science at the University of Denver and leads a team of data scientists at a Denver-based media technology company.

    Table of Contents

      PART 1 - The Building Blocks of scalable computing

    1. Why scalable computing matters
    2. Introducing Dask
    3. PART 2 - Working with Structured Data using Dask DataFrames

    4. Introducing Dask DataFrames
    5. Loading data into DataFrames
    6. Cleaning and transforming DataFrames
    7. Summarizing and analyzing DataFrames
    8. Visualizing DataFrames with Seaborn
    9. Visualizing location data with Datashader
    10. PART 3 - Extending and deploying Dask

    11. Working with Bags and Arrays
    12. Machine learning with Dask-ML
    13. Scaling and deploying Dask