Blog
Explore our updates, case studies,
technology articles and insights.

Creating domain-ready datasets: How Toloka's hybrid approach generates realistic and high-quality data
Aug 4, 2025

Agentic AI & the Future of Coding
Jul 29, 2025

Detecting hidden harm in long contexts: How Toloka built an advanced safety dataset
Jul 14, 2025

Does Your Agent Work? AI Agent Benchmarks Explained
Jul 7, 2025

Beyond Next-Token Prediction: How Post-Training Teaches LLMs to Reason
Jul 1, 2025

Agent Evaluation: Why Simulated Environments are the New Frontier for Data
Jun 17, 2025

Toxicity detection: Why we still need human-labeled data
Jun 2, 2025

Evaluating Model Reasoning with Rubrics: Building a Domain-Specific Evaluation Dataset
May 27, 2025

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025
Load More

Creating domain-ready datasets: How Toloka's hybrid approach generates realistic and high-quality data
Aug 4, 2025

Agentic AI & the Future of Coding
Jul 29, 2025

Detecting hidden harm in long contexts: How Toloka built an advanced safety dataset
Jul 14, 2025

Does Your Agent Work? AI Agent Benchmarks Explained
Jul 7, 2025

Beyond Next-Token Prediction: How Post-Training Teaches LLMs to Reason
Jul 1, 2025

Agent Evaluation: Why Simulated Environments are the New Frontier for Data
Jun 17, 2025

Toxicity detection: Why we still need human-labeled data
Jun 2, 2025

Evaluating Model Reasoning with Rubrics: Building a Domain-Specific Evaluation Dataset
May 27, 2025

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025
Load More

Creating domain-ready datasets: How Toloka's hybrid approach generates realistic and high-quality data
Aug 4, 2025

Agentic AI & the Future of Coding
Jul 29, 2025

Detecting hidden harm in long contexts: How Toloka built an advanced safety dataset
Jul 14, 2025

Does Your Agent Work? AI Agent Benchmarks Explained
Jul 7, 2025

Beyond Next-Token Prediction: How Post-Training Teaches LLMs to Reason
Jul 1, 2025

Agent Evaluation: Why Simulated Environments are the New Frontier for Data
Jun 17, 2025

Toxicity detection: Why we still need human-labeled data
Jun 2, 2025

Evaluating Model Reasoning with Rubrics: Building a Domain-Specific Evaluation Dataset
May 27, 2025

Human-powered evaluation: Actionable feedback for next‑gen video diffusion models
May 20, 2025

Standardizing AI safety with MLCommons
May 15, 2025

Toloka Fuels Next Stage of Growth with Investment Led by Bezos Expeditions
May 6, 2025

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Introducing JEEM: A new benchmark for evaluating low-resource Arabic dialects
Apr 14, 2025

The personality paradox: Teaching AI agents to act like real people
Apr 10, 2025

Fixing SWE-bench: A Smarter Way to Evaluate Coding AI
Mar 17, 2025
Load More

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.

Subscribe to Toloka News
Case studies, product news, and other articles straight to your inbox.
SOLUTIONS
SOLUTIONS
© 2025 Toloka AI BV
SOLUTIONS