LogoLogo
MarkovML HomeLogin to appSDK Docs
  • MarkovML Developer Hub
  • Guides
    • 🖥️Setup MarkovML SDK
      • Upgrading your SDK version
    • 📊Working with Datasets
      • Register Datasets
        • Register Access Credentials (Cloud Storage)
        • Register a Data Family
        • Register a Dataset
      • Read Datasets
        • List Datasets
        • Fetch Dataset
        • Download Dataset Segment
        • Get Dataset Preview
      • Compare Datasets
      • Data Quality
      • Data Family Operations
    • 📚Manage Models and Projects
      • Manage Projects
        • Create a Project
        • Access an existing Project
      • Track Experiments
        • Create Experiment Recorder
        • Add Records to an Experiment Recorder
        • Add Summary to an Experiment Recorder
        • Integrations
        • Examples
          • Image Classifier
      • Record Model Evaluations
        • Create an Evaluation Recorder
        • Add Evaluation Records
      • Complete Examples
        • Sentiment Classifier
        • MNIST Data Classifier
    • 📧Email Notification
  • Fundamentals
    • Datasets & Data Families
    • Projects
    • Models
    • Experiments
    • Model Evaluations
  • 📖CLI Docs
  • CHANGELOG
    • July 5, 2023
    • June 14, 2023
    • May 30, 2023
    • May 1, 2023
    • March 15 2023
    • March 2 2023
    • February 14 2023
    • January 12 2023
    • December 7 2022
    • November 17 2022
Powered by GitBook
On this page
  • Create a Data Family Using the Web UI
  • Create a Data Family Using Python Library

Was this helpful?

  1. Guides
  2. Working with Datasets
  3. Register Datasets

Register a Data Family

PreviousRegister Access Credentials (Cloud Storage)NextRegister a Dataset

Last updated 2 years ago

Was this helpful?

allow you to group related that you register with MarkovML.

For example, you can group all your datasets for sentiment classifier under Sentiment Analysis Data Family . Note that datasets on MarkovML are immutable, i.e., once registered, you should not update an existing dataset. Data Family can also be considered as a logical collection of all versions of datasets for a specific domain.

When you register a dataset, you are required to specify the parent data family for the dataset.

There are two ways of creating a data family

A data family should be created before registering any associated datasets.

Create a Data Family Using the Web UI

You can add a new data family as part of the workflow to from the MarkovML web application. Once logged in, navigate to the Datasets page. Click the "Add New Dataset" button at the top of the screen.

Give the data family a unique name, and a brief optional description, then click save.

Create a Data Family Using Python Library

import markov

# Create a new data family for the dataset
df_reg_resp = markov.data.register_datafamily(
    name="Hate Speech Data Family",
    notes="This is a data family for hate speech datasets",
    lang="en-us",
    source="SOURCE_OF_THIS_DATASET",#e.g kaggle, customer_alpha, annotation_
)

Now that you've successfully created a data family let's move on to registering a dataset with MarkovML.

In the final step ("Confirm features"), open the dropdown menu, and you'll see an option to add a new data family.

You can create a on Markov using a single line of Python code.

📊
Data Family
Datasets
register a dataset
Through UI
Through SDK
Begin workflow to add a new dataset
Click option to add a data family
Create a new data family
Data Families
Data family