Success Stories
Learn how companies around the world are pushing the boundaries of AI with LLM post-training and evaluation

AI agents under attack: A case study on advanced agent red-teaming
Apr 28, 2025

Multi-domain, multi-language SFT dataset pushes LLM performance to the next level
Oct 22, 2024

Toloka helps ServiceNow increase evaluation throughput multiple times
Oct 11, 2024

LLM for code generation: a scalable pipeline to gather SFT data
Apr 29, 2024

Building a lead classification system for 10x client leads
Dec 13, 2023

LLM costs vs quality: How Eightify picked the right GPT model
Dec 11, 2023

Perplexity enhances LLMs with holistic quality evaluation
Dec 8, 2023

Chatfuel drives chatbot quality with Toloka Deep Evaluation
Dec 7, 2023

Spoke.ai: Summarization as rocket fuel
Dec 6, 2023
Load More