Difficulty: Beginner
Estimated Time: 10-15 min

In this lesson we will build a simple ML project from scratch. It explores the NLP problem of predicting tags for a given StackOverflow question. For example, we want the model to classify posts about the Python language by tagging them with python.

However the aim is to show how DVC can be used, so ML details will not be discussed. We will assume that the python scripts process the data properly and will not explain how it works.

Initialize

Step 1 of 3

Step 1

Create a Git repository

mkdir example-get-started

cd example-get-started/

git init

DVC doesn't require Git and can work without it, but in practice it is almost always used with Git.