In this tutorial, we will introduce data labeling via public crowdsourcing marketplaces and present the key components of efficient label collection.
In this tutorial, leading researchers and engineers from Toloka will share their unique industry experience in achieving efficient natural language annotation with crowdsourcing. We will introduce data labeling via public crowdsourcing marketplaces and present the key components of efficient label collection.
Then, in the practice session, participants will choose one real language resource production task, experiment with selecting settings for the labeling process, and launch their label collection project on Toloka, one of the world’s largest crowdsourcing marketplaces.
During the tutorial session, all projects will be run on the real Toloka crowd. We will also present useful quality control techniques and give the attendees an opportunity to discuss their own annotation ideas.
You can watch our hands-on tutorial here.
Now that you've learnt about data labeling for ML, try Toloka with this $15 promocode:
DAI21
Read about how to activate your promo code here.
Data-Driven AI Meetup: Tackling the Unique Challenges of Online Marketplaces
Our next Data-Driven AI meetup focuses on practical challenges in e-commerce. We’ll talk about search relevance evaluation and product matching for online marketplaces that have millions of SKUs.
Hosts:
Microsoft Azure Pakistan Community: How to set up an ML data labeling pipeline: best practices and examples
In this session, Magda shows you how to build data labeling pipelines through crowdsourcing. Crowdsourcing is a scalable approach that can be applied to a variety of domains. Magda will share some examples of real-life labeling projects and show you what best practices to apply in the process.
Hosts:
Be sure to attend our informative workshops,
tutorials, and webinars.