Audio classification — Step-by-step instructions

The challenge

We have a set of voice recordings from different people. We need to get these classified according to the speaker’s gender. We ask performers to listen to the recordings and decide whether it is a man or a woman speaking.
Here’s what it might look like:

Video: 6 basic steps to run a project
Create
a project
Create
a task pool
Upload a file
with data
Create
control tasks
Launch
the pool
Get the
results

Create a project

Interface code

{
  "view": {
    "type": "view.list",
    "items": [
      {
        "type": "view.audio",
        "url": {
          "type": "data.input",
          "path": "path"
        },
        "validation": {
          "type": "condition.played",
          "hint": "you need to listen to the audio"
        }
      },
      {
        "type": "field.button-radio-group",
        "label": "Is it a male or female speaker?",
        "options": [
          {
            "label": "Female",
            "value": "female"
          },
          {
            "label": "Male",
            "value": "male"
          }
        ],
        "data": {
          "type": "data.output",
          "path": "result"
        },
        "validation": {
          "type": "condition.required"
        }
      }
    ]
  },
  "plugins": [
    {
      "type": "plugin.toloka",
      "layout": {
        "kind": "scroll",
        "taskWidth": 300
      }
    },
    {
      "1": {
        "type": "action.set",
        "data": {
          "type": "data.output",
          "path": "result"
        },
        "payload": "female"
      },
      "2": {
        "type": "action.set",
        "data": {
          "type": "data.output",
          "path": "result"
        },
        "payload": "male"
      },
      "type": "plugin.hotkeys"
    }
  ]
}

Create a task pool

Upload a file with data

Dataset
Prepare a TSV file with tasks as shown in our example.
Origin
Bibtex:
@article{adigwe2018emotional,
title={The emotional voices database: Towards controlling the emotion dimension in voice generation systems},
author={Adigwe, Adaeze and Tits, No{'e} and Haddad, Kevin El and Ostadabbas, Sarah and Dutoit, Thierry},
journal={arXiv preprint arXiv:1806.09514},
year={2018}
}

Create control tasks

Launch the pool

Get the results

Application for corporate training
We are offering corporate training to help you solve existing challenges and develop
an internal team of Crowd Science Architects (CSA).
Thu Sep 09 2021 12:58:11 GMT+0300 (Moscow Standard Time)