Random Forest on GPUs: 2000x Faster than Apache Spark
This blog post compares using RAPIDS and Dask vs Apache Spark for model training

Senseye uses Saturn Cloud to train machine learning models on GPUs at a massive scale.
Read article →
This blog post compares using RAPIDS and Dask vs Apache Spark for model training

The distributed computing framework Dask is great for hyperparameter tuning, since you can train different parameter sets concurrently.

Data science has unique workflows that don't always match those of software engineers and require special setup for Kubernetes.

Python is a great language to base your DS/ML framework on, and allows you to avoid being locked into one vendor specific framework.

This article covers efficient ways to load data from Snowflake into a Dask distributed cluster.

It's not always clear when using the distributed framework Dask is the right choice.

Being able to run a Python script on a schedule is an important part of many data science tasks. This blog post walks through three …

Artificial Intelligence has been witnessing monumental growth in bridging the gap between the capabilities of humans and machines. …
Learn how to set a default environment for your Anaconda and Jupyter workflows for a seamless and streamlined data science experience.