How to upload tasks
To upload file with tasks to a pool:
-
Go to the Prepare and upload data step if you're creating a pool or click the Upload button on the pool page.
-
Attach a prepared dataset or media files.
A prepared datasetMedia files-
Download the template and replace the sample data in it with your own data.
-
Click Select prepared dataset and choose your file with tasks.
Use sample dataIf you want to see what your project will look like after the launch, but you don't have any labeling tasks yet, you can upload ready-made sample data to the pool. Sample data is available for the templates:
- Image classification
- Product search relevance
- Object recognition & detection
- Clickbait or not?
Click Use sample data next to Attach the prepared file with data. This lets you avoid any additional actions with files.
Once you've finished working with the sample data and everything looks good, prepare your data and upload it to the pool.
Upload media files from your device to avoid difficulties with placing links in the file with tasks.
To the right of the Select prepared dataset link, click Select media files and choose files.
Features:
-
This option is available only for the projects that have a single
URL
type input field in the specification. -
After uploading, tasks are automatically created with links to the uploaded files.
Note
Supported file formats:
- Images — GIF, JPG, JPEG, PNG, WebP.
- Audio — FLAC, MP3, WAV, M4A.
- Video — MP4.
-
-
Click Continue.
-
Choose a way to put tasks into suites and specify the number of tasks per suite. For more information, see Ways to group tasks in suites.
-
Click Combine tasks into suites.
-
Wait for the result.
To delete all the tasks in the pool, click Delete.
Restriction
You can add up to one million tasks to the pool. To upload more tasks, create another pool.
Tips and recommendations
-
Set the number of tasks per suite depending on the complexity and time allocated for a task.
We recommend that you distribute them so that each task suite takes no more than five minutes to complete.
-
If you get a processing error, it means that the data file is not formatted correctly. For example, there are unnecessary tabs in the file or some lines, headers, or quotes are missing.
In this case, click Cancel, correct the mistakes, and then upload the file again.
Processing errors
If an error occurred while uploading the file, use the table.
To view the processing log, click More on uploading errors. The processing log is written in JSON format. Objects inside result
match the line number of the uploaded file. Lines that were processed with an error have the status "success": false
.
Tip
To work with a large log conveniently, copy it to the text editor.
"parsing_error_of": "https://tlk.s3.yandex.net/wsdm2020/photos/2d5f63a3184919ce7e3e7068cf93da4b.jpg\t\t",
"exception_msg": "the nameMapping array and the sourceList should be the same size (nameMapping length = 1, sourceList size = 3)"
Overview |
How to fix |
Extra tabs. If the uploaded file contains more For example, if 1 column is set in the input data, and two more |
Remove extra column separators in the above example — both |
"exception_msg": "the nameMapping array and the sourceList should be the same size (nameMapping length = 4, sourceList size = 6)"
Overview |
How to fix |
The number of fields in the header and in the row doesn't match. |
Make sure that:
|
"code": "VALUE_REQUIRED", "message": "Value must be present and not equal to null"
Overview |
How to fix |
The value is missing for a required input field. |
Make sure that columns with required input data fields are filled. |
"code": "INVALID_URL_SYNTAX", "message": "Value must be in valid url format"
Overview |
How to fix |
Invalid data in a “link” (“url”) field. |
Make sure that:
|
"exception_msg": "unexpected end of file while reading quoted column beginning on line 2 and ending on line 4"
Overview |
How to fix |
Unpaired quotation mark in a string. |
Check that all quotation marks are escaped. |
What's next
-
Learn more about ways to group tasks in suites.
-
Check the project settings and tasks with the help of special labeling mode before you send them to real Tolokers.
-
If you haven't labeled your control and training tasks in the file, edit the tasks in the interface.
See also
For developers
Troubleshooting
If the column headings are incorrect, the whole file is rejected. Otherwise, Toloka specifies the number of tasks with processing errors.
Check the hint
field. For the general tasks, this field must be empty.
The number of tasks depends on how difficult and time-consuming the tasks are. Keep the size reasonably small. Large task suites are unpopular, partly because they are inconvenient for Tolokers (for example, if the internet connection is unstable).
Try completing the tasks yourself. Ask your colleagues and friends to complete them. Find out average completion time and add 50% to it.
The error might occur if the expected input type is URL, but a string is received.
There may be two reasons:
-
The input field has the "link" type.
-
The pool was created for an outdated project version. It means that the pool was created before you changed the input field type.
It depends on the task. Technically, you can use as many tasks you want.
But Tolokers don't like to take lengthy tasks. They'd rather do 10 tasks that take one minute each than one task that takes 10 minutes.
In addition, if you use a large number of tasks on the page, there might be issues with uploading the files to be labeled. This problem might occur with images.
The third thing to consider is quality control and manual review. If you allow recompletion of assignments by banned Tolokers, you should split the task into smaller parts so that fewer assignments are recompleted. You are more likely to meet your budget this way.
Toloka will group the uploaded image links in the file into suites depending on the task distribution method you use. For more information, see Ways to group tasks in suites.
A task means a separate task. A task suite means a page with tasks. The Toloker gets paid for a task suite.