In this guide we’ll walk you through the process of creating a dataset using the Hume API. In future sections you’ll use a dataset to train your own model.
Uploading media files to Hume
Upload media files to Hume that you want to exist in your custom dataset. These should be images, videos, audio, or text files.
The API response will show an array of files newly registered with the Hume.
Making your dataset file
We will create a CSV file that has a column for media file IDs and another column for labels.
The file ID column is required and must be named file_id
. The label column can be named whatever you want. And you can even have multiple label columns, but only one will be used for training your model.
Here we’ll add a label column called expressions
and an extra column just for housekeeping called file_name
.
file_name | file_id | expressions |
---|---|---|
neutral_face.jpeg | b3cd5662-ea89-4f00-8eae-86218a556027 | Neutral |
positive_face.jpeg | 44bc2ac8-41d5-401e-8c88-df179b993be7 | Positive |
Registering your dataset
Now that we have our media files registered and a CSV associating those files with labels, we can register our dataset.
Success! Your dataset is registered.