Difficulty: Beginner
Estimated Time: 10-15 min

In this lesson we will build a simple ML project from scratch. It explores the NLP problem of predicting tags for a given StackOverflow question. For example, we want the model to classify posts about the Python language by tagging them with python.

However the aim is to show how DVC can be used, so ML details will not be discussed. We will assume that the python scripts process the data properly and will not explain how it works.

Initialize

Step 1 of 3

Step 1

Create a Git repository

mkdir example-get-started

cd example-get-started/

git init

DVC doesn't require Git and can work without it, but in practice it is almost always used with Git.

This tab will not be visible to users and provides only information to help authors when creating content.

Creating Katacoda Scenarios

Thanks for creating Katacoda scenarios. This tab is designed to help you as an author have quick access the information you need when creating scenarios.

Here are some useful links to get you started.

Running Katacoda Workshops

If you are planning to use Katacoda for workshops, please contact [email protected] to arrange capacity.

Debugging Scenarios

Below is the response from any background scripts run or files uploaded. This stream can aid debugging scenarios.

If you still need assistance, please contact [email protected]