Go to file
2024-10-23 02:01:28 +08:00
cluster_experiment_scripts update gcloud instructions 2024-10-23 01:59:06 +08:00
data update gcloud instructions 2024-10-23 01:59:06 +08:00
experiment_configs update gcloud instructions 2024-10-23 01:59:06 +08:00
local_experiment_scripts update gcloud instructions 2024-10-23 01:59:06 +08:00
notes update notes 2024-10-23 02:01:28 +08:00
.gitignore update gcloud instructions 2024-10-23 01:59:06 +08:00
arg_extractor.py update gcloud instructions 2024-10-23 01:59:06 +08:00
data_augmentations.py update gcloud instructions 2024-10-23 01:59:06 +08:00
data_providers.py update gcloud instructions 2024-10-23 01:59:06 +08:00
experiment_builder.py update gcloud instructions 2024-10-23 01:59:06 +08:00
install.sh update gcloud instructions 2024-10-23 01:59:06 +08:00
model_architectures.py update gcloud instructions 2024-10-23 01:59:06 +08:00
README.md update gcloud instructions 2024-10-23 01:59:06 +08:00
storage_utils.py update gcloud instructions 2024-10-23 01:59:06 +08:00
train_evaluate_emnist_classification_system.py update gcloud instructions 2024-10-23 01:59:06 +08:00

MLP Compute Engines Tutorials Branch

A short code repo that guides you through the process of running experiments on the Google Cloud Platform.

Why do I need it?

Most Deep Learning experiments require a large amount of compute as you have noticed in term 1. Usage of GPU can accelerate experiments around 30-50x therefore making experiments that require a large amount of time feasible by slashing their runtimes down by a massive factor. For a simple example consider an experiment that required a month to run, that would make it infeasible to actually do research with. Now consider that experiment only requiring 1 day to run, which allows one to iterate over methodologies, tune hyperparameters and overall try far more things. This simple example expresses one of the simplest reasons behind the GPU hype that surrounds machine learning research today.

Introduction

The material available includes tutorial documents and code, as well as tooling that provides more advanced features to aid you in your quests to train lots of learnable differentiable computational graphs.

Getting Started

Google Cloud Platform

Google Cloud Platform (GCP) is a cloud computing service that provides a number of services, including the ability to run virtual machines (VMs) on their infrastructure. The VMs are called Compute Engine instances.

As an MLP course student, you will be given 50$ worth of credits. This is enough to run a number of experiments on the cloud.

To get started with GCP, please read the this getting started guide.

The guide will take you through the process of setting up a GCP account, creating a project, creating a VM instance, and connecting to it. The VM instance will be a GPU-endowed Linux machine that already includes the necessary PyTorch packages for you to run your experiments.