October 29th, 2020 | 10:30 AM PDT

Next-Generation Big Data Pipelines with Prefect and Dask

Aaron Richter, Saturn Cloud

Data pipelines are crucial to an organization’s data science efforts. They ensure data is collected and organized in a timely and accurate manner, and is made available for analysis and modeling. In this talk, we’ll introduce the next-generation stack for big data pipelines built upon Prefect and Dask, and compare it to popular tools like Spark, Airflow, and the Hadoop ecosystem.