Audio collection — Step-by-step instructions

The challenge

We have a set of texts, and we need to get voice recordings of these texts. We ask performers to read the texts aloud and record themselves. Recordings like these are used for training voice assistants.
Here’s what it might look like:

Video: 5 basic steps to run a project
Create
a project
Create
a task pool
Upload a file
with data
Launch
the pool
Get the
results

Create a project

Interface code

{
  "view": {
    "type": "view.list",
    "items": [
      {
        "type": "view.text",
        "content": {
          "type": "data.input",
          "path": "text"
        }
      },
      {
        "type": "field.audio",
        "data": {
          "type": "data.output",
          "path": "audio_file"
        },
        "validation": {
          "type": "condition.required"
        }
      }
    ]
  },
  "plugins": [
    {
      "type": "plugin.toloka",
      "layout": {
        "kind": "scroll",
        "taskWidth": 500
      }
    }
  ]
}

Create a task pool

Upload a file with data

Dataset
Prepare a TSV file with tasks as shown in our example.
Origin
Valentini-Botinhao, Cassia. (2017). Noisy speech database for training speech enhancement algorithms and TTS models, 2016 [sound]. University of Edinburgh. School of Informatics. Centre for Speech Technology Research (CSTR). https://doi.org/10.7488/ds/2117.
License: CC BY 4.0

Launch the pool

Get the results

Automated review

Another way to review tasks is to ask other performers to do that. We recommend this option when you have limited resources for checking tasks yourself.

  • Create an audio classification project based on our demo
  • Ask the performers if the recorded texts are correct
  • After the answers are collected, go back to the initial pool and upload review results on the Review assignments page
  • The file should contain the acceptance verdict and a comment. A template can be found in the Download results section
Application for corporate training
We are offering corporate training to help you solve existing challenges and develop
an internal team of Crowd Science Architects (CSA).
Thu Sep 09 2021 12:57:28 GMT+0300 (Moscow Standard Time)