Blog

Data Science & ML

Back to Blog ⏎
Article featured image

Computer Vision at Scale With Dask and PyTorch

This tutorial walks through how to use PyTorch and Dask to train an image recognition model across a GPU cluster.

See more

Article featured image

Cross-Entropy Loss Function

When working on a Machine Learning or a Deep Learning Problem, loss/cost functions are used to optimize the model during training. The …

See more

Article featured image

Random Forest on GPUs: 2000x Faster than Apache Spark

This blog post compares using RAPIDS and Dask vs Apache Spark for model training

See more

Article featured image

Supercharging Hyperparameter Tuning with Dask

The distributed computing framework Dask is great for hyperparameter tuning, since you can train different parameter sets concurrently.

See more

Article featured image

Practical Issues Setting up Kubernetes for Data Science on AWS

Data science has unique workflows that don't always match those of software engineers and require special setup for Kubernetes.

See more

Article featured image

Setting Up Your Data Science & Machine Learning Capability in Python

Python is a great language to base your DS/ML framework on, and allows you to avoid being locked into one vendor specific framework.

See more

Article featured image

Snowflake and Dask

This article covers efficient ways to load data from Snowflake into a Dask distributed cluster.

See more

Article featured image

3 Ways to Schedule and Execute Python Jobs

Being able to run a Python script on a schedule is an important part of many data science tasks. This blog post walks through three …

See more