# How to edit tasks by yourself

What makes a good hint?

Avoid wordings like: “You answered incorrectly, please provide the correct response”. The Toloker learns when the hint explains the essence of their mistake.

Make the hints clear. Explain which response should be chosen and why.

Restriction

Task markup is available only for training pools and pools uploaded with “smart mixing”.

## How do I mark up tasks?

1. Make sure the pool isn't running.

2. Click Edit in the Pool tasks block.

3. Open the General, Control, or Training tab. Create a control or a training task from another type of task.

4. Select the responses that should be checked. (The list of the output data fields is on the right.)

5. Click Save and go to next.

In this example, the correct response is added for the control task. Only the choice in the result field (Good/Bad/Loading error) is checked.

## Improve the reliability of control tasks

After you have created the control tasks, make sure that different variations of correct responses occur with the same frequency. This will help avoid random guessing in responses.

1. Go to the task markup page.

2. Open the Training tasks → Distribution of correct responses for control tasks tab.

The distribution of responses is shown as a percentage.

Tip

#### Example of response distribution

1. Make sure the pool isn't running.

2. Click Edit in the Pool tasks block.

3. Click the tab with the type of the task.

4. Find the task in the list and click . Editing mode opens.

## How to delete a task from the pool

1. Make sure the pool isn't running.

2. Click Edit in the Pool tasks block.

3. Click the tab with the type of the task.

4. Find the task in the list and click . You can also delete a task from the pool in editing mode.

Tip

If you set the overlap to 0 via the API, the task won't be visible to Tolokers, and you won't have to delete it.

## Troubleshooting

How do I view the processing log?

To view the processing log, click More on uploading errors. The processing log is written in JSON format. Objects inside result match the line number of the uploaded file. Lines that were processed with an error have the status "success": false.

To work with a large log conveniently, copy it to the text editor.

If the column headings are incorrect, the whole file is rejected. Otherwise, Toloka specifies the number of tasks with processing errors.

#### Processing errors tables

"parsing_error_of": "https://tlk.s3.yandex.net/wsdm2020/photos/2d5f63a3184919ce7e3e7068cf93da4b.jpg\t\t",
"exception_msg": "the nameMapping array and the sourceList should be the same size (nameMapping length = 1, sourceList size = 3)"

 Overview How to fix Extra tabs. If the uploaded file contains more \t column separators after the data or the link than the number of columns set in the input data, you get an error message. For example, if 1 column is set in the input data, and two more \t\t tabs are added in the file after the link, you get 3 columns, 2 of which are excessive. Remove extra column separators in the above example — both \t\t characters.
"exception_msg": "the nameMapping array and the sourceList should be the same size (nameMapping length = 4, sourceList size = 6)"

 Overview How to fix The number of fields in the header and in the row doesn't match. Make sure that: The number of tabs in the file structure is correct. String values with tab characters are enclosed in quotation marks" ".
"code": "VALUE_REQUIRED", "message": "Value must be present and not equal to null"

 Overview How to fix The value is missing for a required input field. Make sure that columns with required input data fields are filled.
"code": "INVALID_URL_SYNTAX", "message": "Value must be in valid url format"

 Overview How to fix Invalid data in a “link” (“url”) field. Make sure that: Links start with the http://, https:// or www prefix.
"exception_msg": "unexpected end of file while reading quoted column beginning on line 2 and ending on line 4"

 Overview How to fix Unpaired quotation mark in a string. Check that all quotation marks are escaped.
How do I know how many tasks a Toloker will see on the page?

How do I upload the file with the accepted assignments back to Toloka for projects with manual review? Where do I find the format of the upload data?

Assignments are reviewed in a file.

Why haven't I received assignments since I launched my first project, and all the uploaded assignments are marked as "Training"?

Check the hint field. For the general tasks, this field must be empty.

How do I create the task file properly so that there are no errors?

In the file with the general tasks, the columns with the INPUT headers must be filled out. You can see those headers if you download a sample file from the pool.

If you are creating control tasks, fill out the GOLDEN columns with the correct responses.

If you are creating a training task, you also need to fill in the HINT:text column. For the general tasks you don't need any columns other than INPUT, so feel free to delete them.

The file format must be TSV, XLSX or JSON, and the encoding must be UTF-8.

Why do I see a syntax error when I upload a task where a user has to view an image and write feedback?

The error might occur if the expected input type is URL, but a string is received.

There may be two reasons:

• The input field has the "link" type.

• The pool was created for an outdated project version. It means that the pool was created before you changed the input field type.

What is the maximum number of tasks per page?

It depends on the task. Technically, you can use as many tasks you want.

But users are reluctant to take lengthy tasks. They'd rather do 10 tasks that take one minute each than one task that takes 10 minutes.

In addition, if you use a large number of tasks on the page, there might be issues with uploading the files to be labeled. This problem might occur with images.

The third thing to consider is quality control and manual review. If you use recompletion of assignments from banned users, you should split the task into smaller parts so that fewer assignments are recompleted. You are more likely to meet your budget this way.

How do I specify smart mixing settings in the interface when uploading a file?

Smart mixing settings are specified for the file rather than for the pool.

The settings specified during the first file upload are applied to all the files that are uploaded to this pool later on.

What is the right time limit for the task completion?

The same task appeared on different pages

The same task may appear on different pages if:

• Dynamic overlap is used (incremental relabeling, IRL). As an example, let's say there were 5 tasks on a page. For 4 of them, responses coincided and the common response was counted as correct. The fifth task was mixed into another set because it didn't get into the final response and it needs to be “reassessed”.

• Different tasks have different overlap. Tasks with higher overlap will be additionally shown in sets with the other remaining tasks in the pool.

• If a quality control rule changes a task's overlap, it will appear in a different set.

How many tasks should be in a suite?

The number of tasks depends on how difficult and time-consuming the tasks are. Keep the size reasonably small. Large task suites are unpopular, partly because they are inconvenient for Tolokers (for example, if the internet connection is unstable).

We recommend adding at least 1% of control tasks in the pool. And for small pools — 5–10%.

Why's that?

Each control task is shown to the Toloker only once. If you use smart mixing, you determine how many control tasks should be in a suite. If each suite contains one control task, then the maximum number of suites the Toloker can complete is equal to the number of control tasks in the pool. If you increase the number of control tasks in a suite, the number of suites available to the Toloker decreases by the same number.

There shouldn't be too few pages available. Otherwise:

• You won't be able to correctly evaluate the quality of the Toloker's responses.

• The Toloker won't be interested in completing such tasks because they'll spend a lot of time studying instructions but won't earn much.

Example

#### A large pool with 1% of control tasks (good)

There are 10,000 tasks in the pool, and 100 of them are control tasks (1%). Each suite contains 10 tasks, and 1 of them is a control task. Hence, a user can complete up to 100 suites.

There are 100 tasks in the pool, and 1 of them is a control task (1%). Each suite contains 10 tasks, and 1 of them is a control task. Hence, each user can only complete 1 suite.

#### A small pool with 10% control tasks (good)

There are 100 tasks in the pool, and 10 of them are control tasks (10%). Each suite contains 10 tasks, and 1 of them is a control task. Hence, each user can complete up to 100 suites

What for

In a large pool with few control tasks, a situation might occur when users who have completed a lot of tasks in the project stop getting new task suites. This happens when the Toloker completes all control tasks in the pool.

Note

To filter out Tolokers, use the Control tasks quality control rule. To rank Tolokers by the quality of responses in control tasks, use a skill.

How are the correct responses to control questions counted?

The Control tasks rule starts working after the Toloker completes the number of control tasks you specified. If your pool contains both training and control tasks, you can take into account the responses in both of them (the Number of responses parameter) or only in control tasks (the Number of control responses parameter).

As soon as the needed number of responses is collected, Toloka calculates the percentage of correct and incorrect responses and performs an action (assigns a skill, or blocks the Toloker in the pool or in the project). Then this percentage is updated as the tasks are completed by the Toloker. The number of the Toloker's recent responses that's used in the calculation is set in the Recent control task responses to use field. If you leave it empty, all the responses from the Toloker in the pool are counted.