Text classification — Step-by-step instructions

The challenge

We have a set of news article headlines. We need to get these classified according to whether they are clickbait or not. We ask performers to read a headline and decide whether it’s clickbait. Here’s what it might look like:

Video: 7 basic steps to run a project
Create
a project
Create
a training pool
Create
a task pool
Upload a file
with data
Create
control tasks
Launch
the pool
Get the
results

Create a project

Interface code

{
  "view": {
    "type": "view.list",
    "items": [
      {
        "type": "view.group",
        "content": {
          "type": "view.list",
          "items": [
            {
              "type": "view.text",
              "content": {
                "type": "helper.join",
                "items": [
                  "Headline: ",
                  {
                    "type": "data.input",
                    "path": "headline"
                  }
                ],
                "by": ""
              }
            },
            {
              "type": "field.radio-group",
              "label": "Is this headline clickbait?",
              "options": [
                {
                  "label": "Clickbait",
                  "value": "clickbait"
                },
                {
                  "label": "Not clickbait",
                  "value": "notclickbait"
                }
              ],
              "data": {
                "type": "data.output",
                "path": "category"
              },
              "validation": {
                "type": "condition.required",
                "hint": "you need to select one answer"
              }
            }
          ]
        }
      }
    ]
  },
  "plugins": [
    {
      "type": "plugin.toloka",
      "layout": {
        "kind": "scroll",
        "taskWidth": 300
      }
    },
    {
      "1": {
        "type": "action.set",
        "data": {
          "type": "data.output",
          "path": "category"
        },
        "payload": "clickbait"
      },
      "2": {
        "type": "action.set",
        "data": {
          "type": "data.output",
          "path": "category"
        },
        "payload": "notclickbait"
      },
      "type": "plugin.hotkeys"
    }
  ]
}

Create a training pool

Training dataset
Prepare a training dataset as shown in our example.
Origin
License: MIT

Create a task pool

Upload a file with data

Dataset
Prepare a TSV file with tasks as shown in our example.
Origin
License: MIT

Create control tasks

Launch the pool

Get the results

Application for corporate training
We are offering corporate training to help you solve existing challenges and develop
an internal team of Crowd Science Architects (CSA).
Thu Sep 09 2021 13:00:10 GMT+0300 (Moscow Standard Time)