What is a dataset identifier?

A dataset identifier is a string that uniquely identifies a snapshot of a dataset. It is composed by 3 smaller sub-strings:

  • team slug
  • dataset slug
  • version

And it looks like this: <team-slug>/<dataset-slug>:<version>.

Slugs here are normalized versions of the values they represent, usually without spaces and with special characters either removed or replaced with hiphens.

So, for our cars dataset example, a possible identifier could be: andreas-team/cars:latest.


