Register a Data Family

Data Families allow you to group related Datasets that you register with MarkovML.

For example, you can group all your datasets for sentiment classifier under Sentiment Analysis Data Family . Note that datasets on MarkovML are immutable, i.e., once registered, you should not update an existing dataset. Data Family can also be considered as a logical collection of all versions of datasets for a specific domain.

When you register a dataset, you are required to specify the parent data family for the dataset.

There are two ways of creating a data family

circle-info

A data family should be created before registering any associated datasets.

Create a Data Family Using the Web UI

You can add a new data family as part of the workflow to register a dataset from the MarkovML web application. Once logged in, navigate to the Datasets page. Click the "Add New Dataset" button at the top of the screen.

Begin workflow to add a new dataset

In the final step ("Confirm features"), open the Data family dropdown menu, and you'll see an option to add a new data family.

Click option to add a data family

Give the data family a unique name, and a brief optional description, then click save.

Create a new data family

Create a Data Family Using Python Library

You can create a Data Family on Markov using a single line of Python code.

Now that you've successfully created a data family let's move on to registering a dataset with MarkovML.

Last updated