Ways to group tasks in suites

Note.

The Keep task order option is described in another section. Learn more.

By empty row

You divide the tasks into suites yourself in the TSV file. To do this, add an empty line after each task suite in the file. After you upload the file to the pool, Toloka will put the tasks that are between two empty lines into one suite.

This method is appropriate for labeling groups of data around a single object, for example, links for search queries. In this case, each suite may have a different number of tasks grouped according to certain criteria.

Set manually

Enter the number of tasks per suite. Task suites are formed from the tasks in the order they are placed in the TSV file.

This method is appropriate if you need your tasks to have a certain number of suites and don't want to divide them into suites yourself.

Smart mixing

Specify how many tasks of each type should be in each task suite. For example, 8 main tasks, 1 training and 1 control task. If necessary, specify the minimum number of tasks for each type in additional settings.

When to use

This method is useful if the created pool:

Sample settings


Features
  • Tasks are divided into lists: regular, control, and training.

  • The number of tasks of the given type that you specified in the settings is added from each list. By default, tasks are randomly selected.

  • If the Keep task order option is enabled, tasks are added in the same order as they were listed in the source TSV file. This takes into account the overlap: the task that goes first will be assigned until it reaches the desired overlap.

  • Tasks in the suite are mixed up before the page is shown to the performer.

  • If there aren't enough main tasks and the Assign partial page option is set, the performer is given an incomplete task suite. Please note that the number of control and training tasks in this case must be complete.

Attention. If you upload a file via “Smart mixing”, you won't be able to use other ways of task distribution on the pages in this pool.

After uploading the tasks with smart mixing you will be able to mark up tasks and set selective majority vote checking.

Setting overlap

If you upload tasks from the Toloka interface, infinite overlap is set automatically for control and training tasks, so that there is enough to mark up all main tasks.

You can set the overlap via the Toloka API or use the following ways to upload tasks: By empty row and Set manually.

Important.

Set infinite overlap for control tasks.

If another overlap value is set, control tasks may end during labeling and the pool will stop being labeled.

Smart mixing and keeping the task order

More info about keeping task order.

Smart mixing without "Keep task order"

If the Keep task order option is disabled, task suites won't be formed in order (from top to bottom), and users will get different control tasks within identical suites.

Smart mixing + "Keep task order"

If the Keep task order option is enabled, task suites will be formed in order (from top to bottom) and users will get the same control tasks within identical suites.

How to distribute tasks as suites
Characteristics/upload type By empty row and Set manually By empty row and Set manually (keep task order) Smart mixing Smart mixing (keep task order)
To generate task suites, tasks are taken in the order of rows (from top to bottom) in an uploaded file Yes Yes No Yes
Tasks are mixed within a suite No No Yes Yes
Task suites are distributed to performers in the same order No Yes Yes Yes
Within identical task suites, control tasks are the same for all performers Yes Yes No Yes
How to distribute tasks as suites
Characteristics/upload type By empty row and Set manually By empty row and Set manually (keep task order) Smart mixing Smart mixing (keep task order)
To generate task suites, tasks are taken in the order of rows (from top to bottom) in an uploaded file Yes Yes No Yes
Tasks are mixed within a suite No No Yes Yes
Task suites are distributed to performers in the same order No Yes Yes Yes
Within identical task suites, control tasks are the same for all performers Yes Yes No Yes

Tips and recommendations

  • If you used Set manually, you can find out the number of tasks per suite in the pool settings. But some suites may be incomplete.

  • If you uploaded tasks in a different way, you can check how they're grouped into suites in the Toloka interface for requesters. To do this, on the pool page, click filesDownload all tasks. You can also check task distribution across suites using the Toloka API.

Troubleshooting

How do I specify smart mixing settings in the interface when uploading a file?

Smart mixing settings are specified for the file rather than for the pool.

The settings specified during the first file upload are applied to all the files that are uploaded to this pool later on.

What is the maximum number of tasks per page?

It depends on the task. Technically, you can use as many tasks you want.

But users are reluctant to take lengthy tasks. They'd rather do 10 tasks that take one minute each than one task that takes 10 minutes.

In addition, if you use a large number of tasks on the page, there might be issues with uploading the files to be labeled. This problem might occur with images.

The third thing to consider is quality control and assignment review. If you use recompletion of assignments from banned users, you should split the task into smaller parts so that fewer assignments are recompleted. You are more likely to meet your budget this way.

The same task appeared on different pages

The same task may appear on different pages if:

  • Dynamic overlap is used (incremental relabeling, IRL). As an example, let's say there were 5 tasks on a page. For 4 of them, responses coincided and the common response was counted as correct. The fifth task was mixed into another set because it didn't get into the final response and it needs to be “reassessed”.
  • Different tasks have different overlap. Tasks with higher overlap will be additionally shown in sets with the other remaining tasks in the pool.
  • If a quality control rule changes a task's overlap, it will appear in a different set.
How do I upload the file with the accepted assignments back to Toloka for projects with non-automatic acceptance? Where do I find the format of the upload data?

Use the button Upload review results to upload your file. You can see the format here.

Assignments are reviewed in a TSV file.