Data Families allow you to group related Datasets that you register with MarkovML.
For example, you can group all your datasets for sentiment classifier under Sentiment Analysis Data Family . Note that datasets on MarkovML are immutable, i.e., once registered, you should not update an existing dataset. Data Family can also be considered as a logical collection of all versions of datasets for a specific domain.
When you register a dataset, you are required to specify the parent data family for the dataset.
There are two ways of creating a data family
Create a Data Family Using the Web UI
You can add a new data family as part of the workflow to register a dataset from the MarkovML web application.
Once logged in, navigate to the Datasets page. Click the "Add New Dataset" button at the top of the screen.
Begin workflow to add a new dataset In the final step ("Confirm features"), open the Data family dropdown menu, and you'll see an option to add a new data family.
Click option to add a data family Give the data family a unique name, and a brief optional description, then click save.
Create a Data Family Using Python Library
You can create a Data Family on Markov using a single line of Python code.
Now that you've successfully created a data family let's move on to registering a dataset with MarkovML.