Take control of your data labeling

Data labeling is critical for machine learning. Don’t outsource it. Crowd source it! Have full control over the labeling process, timeframe, and quality requirements. 24/7. Upload any amounts of unlabeled data using our free, powerful API and get your ML project going — at a fraction of the cost!
Top-quality data
Collect and annotate training data that meets and exceeds industry quality standards thanks to multiple quality control methods and mechanisms available in Yandex.Toloka.
Scalable projects
Have any amounts of image, text, speech, audio or video data collected and labeled for you by millions of skilled Yandex.Toloka users across the globe.
Cost-efficiency
Save time and money with this purpose-built platform for handling large-scale data collection and annotation projects, on demand 24/7, at your own price and within your timeframe.
Free, powerful API
Build scalable and fully automated human-in-the-loop machine learning pipelines with a powerful open API.

Yandex.Toloka Platform

Designed by engineers for engineers, Yandex.Toloka lets you integrate an on-demand workforce directly into your processes. Our cloud-based crowdsourcing platform is a fast and efficient way to collect and label large data sources for machine learning and other business purposes.
3 steps how to collect and label data
Start with raw data (text, images, URLs, video, audio, or any other type)
Configure the project and quality control, train and screen users
Get results with annotated data

Use Cases

Yandex.Toloka helps to improve models of any kind, including those in audio & natural language processing, computer vision, chat bots and voice assistants, search and information retrieval, as well as offering solutions for business challenges and projects on any scale.
Object Recognition & Detection
Train your computer vision model by labeling image elements with bounding boxes, polygons or key points, as well as utilizing image segmentation and tagging based on your own ontology.
Price for 1000 tasks: $15.
Turnaround time: 3 hours.*
Image & Video
Classification 
Collect a library of annotated images or videos — grade image quality, classify by type, identify objects or content, or receive any other judgments you need.
Price for 1000 tasks: $4.5.
Turnaround time: 1 hour.*
Image
Transcription
Get the text in PDF files annotated and transcribed to train your algorithms to better identify specific parts of documents, or validate and fine-tune the output of your own OCR models.
Price for 1000 tasks: $15.
Turnaround time: 3 hours.*
Side-by-Side
Comparison
Perform side-by-side comparisons to decide which option works best and check that your images match descriptions, or get any other judgments you need to verify or clean up your data.
Price for 100 image pairs starts at $1.5.
Turnaround time: 1 hour.*
Image & Video Moderation
Make sure inappropriate content isn’t uploaded to your site. Set rules for all types of images and videos with instructions and details about what is acceptable and what is not.
Price for 1000 tasks: $4.5.
Turnaround time: 1 hour.*
Image & Video
Collection
Create collections of images and videos based on specific topics or motifs, lighting, angles or environments. Ask crowd performers to record video snippets according to your specifications and instructions.
Price for 100 tasks: $6.
Turnaround time: 5 hours.*
Search
relevance
Use Yandex.Toloka to evaluate the performance of your search engine, find out which ranking model works best, and improve the search algorithm.
Price for 1000 tasks: $18.
Turnaround time: 4 hours.*
Text
Classification
Categorize any kind of text into predefined categories or ontologies to get the training data you need to optimize your NLP model.
Price for 1000 tasks: $18.
Turnaround time: 2 hours.*
Sentiment
Analysis
Categorize any kind of text by sentiment and reasons behind the sentiment for any purpose, from understanding customer reviews to spam filtering.
Price for 1000 tasks: $4.5.
Turnaround time: 1 hour.*
Intent
Classification
Train your chatbot, voice assistant, or any other conversational agent to better understand your user’s intent.
Price for 100 tasks: $6.
Turnaround time: 1 hour.*
Utterance
Collection
Power your conversational agent with text utterances collected by performers based on instructions or scenarios that you provide.
Price for 100 tasks: $12.
Turnaround time: 4 hours.*
Named Entity Recognition
Create your own custom ontology to identify parts of speech, classify proper nouns, or label any other entities for your project or named entity recognition (NER) model.
Price for 1000 tasks: $18.
Turnaround time: 1 hour.*
Audio Data Collection
Create or fine-tune a voice interface using speech samples recorded by Yandex.Toloka performers according to your instructions, with fast turnaround and low prices. Enhance TTS (Text-to-Speech) and speech synthesis technologies with high-quality audio data.
Price for 1000 tasks: $18.
Turnaround time: 3 hours.*
Audio Transcription
Convert audio to text with Yandex.Toloka transcribers or ask performers to check transcriptions for correctness and use resulting data to improve speech recognition models. Transcribe any number of audio files quickly and accurately.
Price for 1000 tasks: $7.
Turnaround time: 4 hours.*
Audio Classification
Use Yandex.Toloka to detect emotion, categorize topics, or identify events in audio samples or conversations to improve your model.
Price for 1000 tasks: $7.5.
Turnaround time: 2 hours.*
Business data
Collect information like website URLs or business hours, or categorize businesses by type, location, or size. Use Yandex.Toloka to enrich business data and gather the information you need.
Price for 1000 tasks: $7.2.
Turnaround time: 4 hours.*
Surveys
Conduct surveys of thousands of Yandex.Toloka performers with a variety of backgrounds using various types of survey formats.
Price for 100 tasks: $6.
Turnaround time: 1 hour.*
Content Generation
Generate any type or any amount of content, such as product descriptions, recipes or user manuals, for any kind of project, on-demand 24/7.
Price for 100 tasks: $6.
Turnaround time: 1 hour.*
Offline data collection
Gather information about businesses like name, phone number, menu, and opening hours. Digitize offline data for your projects and business goals.
Price: starting at $40. Up to 750 points visited in one city per week.*
Learn more
Price monitoring
Get information about actual in-store prices on your products and competitors' discounts at specific retail outlets, adjust your pricing to match. Price per item: starting at $0.03. Timeframe for price checks: 10 days.*
Learn more
Merchandising
Check where your products are placed and the allocated share of shelf space. Compare this data with information about competitors' products. Price per shelf: starting at $0.10. Timeframe for store monitoring: 10 days.*
Learn more
Secret buyer
Get insights into staff performance, quality of service and demand for your products. Price per secret online purchase: starting at $1. Timeframe for shopping: 2 weeks.* 
Learn more
Foot traffic
Use Yandex.Toloka to learn about foot traffic and road traffic for analyzing strategic retail points.
Price per traffic check (one retail point): starting at $0.50.* 
Learn more
Ad monitoring
Collect data for analyzing and monitoring outdoor advertising. Track promo campaigns.
Price: starting at $150. Up to 800 points visited in one city per month.*
Learn more
  • Approximate cost. Includes 20% Yandex.Toloka commission. Not a public offer. Price and turnaround time for tasks are set by the requester and depend on the type of task, input data, and other factors.
Success stories
Yandex.Toloka News
Receive information about platform updates, partners, training materials, and other news.
Wed Oct 21 2020 22:10:45 GMT+0300 (Moscow Standard Time)