Blog
Explore our updates, case studies,
technology articles and insights.

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025

LLM Evaluation In Action: Should You Trust Automated Metrics or Human Judgment?
Mar 3, 2025

Toloka’s Commitment to Responsible AI: How We Prioritize Ethics, Safety, and Excellence
Feb 27, 2025

Introducing Toloka’s Bug Bounty Program: Strengthening Security with Ethical Hacking
Feb 12, 2025

R1 is not on par with o1, and the difference is qualitative, not quantitative
Feb 12, 2025

Mindrift: Elevating the standard of data for GenAI
Feb 11, 2025

The Data Behind DeepSeek’s Success
Feb 10, 2025

Responsible AI: 3 Steps to Improve AI Model Safety
Jan 17, 2025

Multimodal SFT data for the win: Pushing model performance in the real world
Jan 14, 2025
Load More

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025

LLM Evaluation In Action: Should You Trust Automated Metrics or Human Judgment?
Mar 3, 2025

Toloka’s Commitment to Responsible AI: How We Prioritize Ethics, Safety, and Excellence
Feb 27, 2025

Introducing Toloka’s Bug Bounty Program: Strengthening Security with Ethical Hacking
Feb 12, 2025

R1 is not on par with o1, and the difference is qualitative, not quantitative
Feb 12, 2025

Mindrift: Elevating the standard of data for GenAI
Feb 11, 2025

The Data Behind DeepSeek’s Success
Feb 10, 2025

Responsible AI: 3 Steps to Improve AI Model Safety
Jan 17, 2025

Multimodal SFT data for the win: Pushing model performance in the real world
Jan 14, 2025
Load More

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025

LLM Evaluation In Action: Should You Trust Automated Metrics or Human Judgment?
Mar 3, 2025

Toloka’s Commitment to Responsible AI: How We Prioritize Ethics, Safety, and Excellence
Feb 27, 2025

Introducing Toloka’s Bug Bounty Program: Strengthening Security with Ethical Hacking
Feb 12, 2025

R1 is not on par with o1, and the difference is qualitative, not quantitative
Feb 12, 2025

Mindrift: Elevating the standard of data for GenAI
Feb 11, 2025

The Data Behind DeepSeek’s Success
Feb 10, 2025

Responsible AI: 3 Steps to Improve AI Model Safety
Jan 17, 2025

Multimodal SFT data for the win: Pushing model performance in the real world
Jan 14, 2025
Load More

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.
SOLUTIONS
SOLUTIONS
© 2025 Toloka AI BV
SOLUTIONS