Alice voice assistant

Trained a voice assistant with language data

Accelerate your
e-commerce AI
Talk to our AI expert
Accelerate your
e-commerce AI
Talk to our AI expert


A top voice assistant developer needed accurate training data in several languages for expansion into new markets.


Directly translating the existing voice assistant requests and responses yielded very low accuracy (about 12%), so the next strategy was to collect language-specific datasets for training the models.


The Toloka crowd provided data in the target languages: speech recordings, audio transcription for speech recognition, request classification, and answer relevance evaluations.

Business impact

The voice assistant's accuracy in the new languages hit ~62% (~2 correct responses for every 3 requests) — a major step up from a baseline of ~12%. With Toloka's contribution this result was reached in less than a year.

Similar success stories

Accelerate your e-commerce AI

Let's talk about the ideal solution for your data needs.