Toloka Research
Our mission
New data collection methods for SFT and RLHF that leverage synthetic data, AI feedback, and expert human-generated data.
Improved approaches to model training and alignment
that enhance model capabilities in long-horizon reasoning and autonomous behavior.
High-quality evaluation metrics & benchmarks to measure performance in coding, math, reasoning, multilingualism, multimodality, and other complex tasks.
Red-teaming methods for identifying model vulnerabilities and developing safety metrics such as harmfulness, security and CBRN risks, social bias, and more.
Our projects
BigCode: Open-scientific collaboration working on the responsible development of Large Language Models for Code
Publications
Conferences and events
We regularly hold tutorials and lead workshops at some of the biggest AI conferences around the globe.
Blog
Applied ML at Toloka
Open job positions: