ICML 2021 Essentials: Toloka Workshop on July 18

Toloka Team
by Toloka Team
Image

Subscribe to Toloka News

Subscribe to Toloka News

We're counting down to the International Conference on Machine Learning (ICML) — the leading global conference in the field of AI and ML. As a Gold Level sponsor of this year's event, Toloka will present a comprehensive 4-hour virtual workshop titled "High-Quality Data Labeling at Scale with Toloka" on July 18.

About the workshop

Today, AI development rests on three pillars: algorithms, hardware, and data. For successful advances, the industry needs solutions that can bolster data quality, scalability, and flexibility. Crowdsourcing can feasibly offer these.

Our workshop will show you what the Toloka platform has to offer. We'll talk about the infrastructure for data production, practical aspects of designing data labeling tasks, the emerging profession of Crowd Solutions Architect, and the future of work for performers.

Speakers

Image
Speakers

Our lineup of experts will share their practical experience and insights into crowdsourcing:

  • Olga Megorskaya, Toloka's CEO: "Evolution of Data Production Paradigm in AI. Key Components of Future Success." Olga will discuss how AI industry needs can be adequately met during data production and processing.
  • Omar Alonso, Senior Engineering Manager at Instacart: "The Practice of Crowdsourcing." Omar will discuss practical considerations for designing and completing high-quality tasks that require the combined efforts of both humans and machines.
  • Daria Baidakova, Director of Educational Programs at Toloka: "Data Annotation at Scale. A Core Expertise of Modern ML." Daria will offer insights into the job of a Crowd Solutions Architect and explain Toloka's research grants program and Crowd Science initiative.
  • Saiph Savage, Assistant Professor at Northeastern University and Co-Director of the Civic Innovation Lab at UNAM: "The Future of Work for Performers. Empowering the People Behind AI." Saiph will propose a framework that facilitates a painless transition to new AI jobs that are unlikely to be automated in the future.

Demo

The keynote part will be followed by a demo that shows how crowdsourcing can tackle an e-commerce item retrieval and ranking task. Software developers, Dmitry Ustalov and Vladimir Losev, and Crowd Solutions Architect, Oleg Pavlov, will join the Toloka team for a hands-on tutorial. The audience will have a chance to learn how to build a human-in-the-loop pipeline from scratch:

  • Raw crowdsourced data will be integrated into ML models to obtain a ready-to-use dataset.
  • The team will show how interdependent data labeling processes can be combined using the Toloka Kit (our open-source Python library).
  • The Crowd Kit (quality control library) will be used to complete the task and get clean, AI-ready data.

The event will conclude with a Q&A session in real time, during which the team will address all of your questions.

Learn more
Article written by:
Toloka Team
Toloka Team
Updated: 

Recent articles

Have a data labeling project?

Take advantage of Toloka technologies. Chat with our expert to learn how to get reliable training data for machine learning at any scale.
Fractal