At Toloka, we are committed to unlocking AI opportunities. Every day, our researchers tackle pressing AI and ML challenges,
make appearances at prominent global events, and publish their findings in scientific journals. Scroll down to learn more.
Browse through some of our latest work.
The BigCode community introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention.
In this tutorial, we describe the general framework of RLHF and explain the technical procedures required to apply this framework.
We present a human-in-the-loop approach to learning the most useful combination of prompt keywords using a genetic algorithm.
Our experiments on two different image datasets, dresses from Zalando's FEIDEGGER and shoes from the Toloka Shoes Dataset, confirm that one can yield meaningful clusters with no machine learning algorithms purely with crowdsourcing.
Find out how Toloka powers top-tier research across the globe.
The challenge solicited solutions that processed RAW camera images captured in night scenes. The organizers used Toloka to evaluate the visual appearance of the results. Mean opinion scores were calculated for a “people’s choice” ranking of solutions.
Labeling a large number of images can be labor- and time-consuming, and labeling images in planetary science often requires the help of crowdsourcing. In their “Machine Learning for Planetary Science” book NASA researchers acknowledge Toloka.
Saiph Savage, an Assistant Professor at Northeastern University and Director of the Northeastern Civic AI Lab collaborated with Toloka to lead a research initiative called “A.I. For Good Framework to Empower Digital Workers” to help rural workers get better wages and conditions.
As part of the conference, teams competed to devise the best system for machine translation of articles into different languages. Toloka provided human judgements as the ground truth for translation quality evaluation. All relevant language pairs were covered for fast evaluations using Toloka’s global crowd.
We regularly hold tutorials and lead workshops at some of the biggest AI conferences around the globe.
We thrive on continuous improvement and international cooperation. Contact us on LinkedIn if you’d like to collaborate.
Toloka partners with universities across the world to incorporate crowd science techniques.