Products

Resources

Impact on AI

Company

Aligning LLMs to Low-Resource Languages

Conference

Aligning LLMs to Low-Resource Languages

This tutorial provides a detailed guide on collecting data for aligning large language models (LLMs) with low-resource languages (LRLs).

Feb 22, 2024

00:00 GMT+2

Nazar Beknazarov
Ahmet Üstün
Marzieh Fadaee
Aligning LLMs to Low-Resource Languages

Conference

Aligning LLMs to Low-Resource Languages

This tutorial provides a detailed guide on collecting data for aligning large language models (LLMs) with low-resource languages (LRLs).

Feb 22, 2024

00:00 GMT+2

Nazar Beknazarov
Ahmet Üstün
Marzieh Fadaee
Aligning LLMs to Low-Resource Languages

Conference

Aligning LLMs to Low-Resource Languages

This tutorial provides a detailed guide on collecting data for aligning large language models (LLMs) with low-resource languages (LRLs).

Feb 22, 2024

00:00 GMT+2

Nazar Beknazarov
Ahmet Üstün
Marzieh Fadaee
Aligning LLMs to Low-Resource Languages

Conference

Aligning LLMs to Low-Resource Languages

This tutorial provides a detailed guide on collecting data for aligning large language models (LLMs) with low-resource languages (LRLs).

Feb 22, 2024

00:00 GMT+2

Nazar Beknazarov
Ahmet Üstün
Marzieh Fadaee

Aligning LLMs to Low-Resource Languages

Where:

Date:

Feb 22, 2024

00:00 GMT+2

Aligning LLMs to Low-Resource Languages

Where:

Date:

Feb 22, 2024

00:00 GMT+2

Aligning LLMs to Low-Resource Languages

Where:

Date:

Feb 22, 2024

00:00 GMT+2

Aligning LLMs to Low-Resource Languages

Where:

Date:

Feb 22, 2024

00:00 GMT+2

Overview

This tutorial provides a detailed guide on collecting data for aligning large language models (LLMs) with low-resource languages (LRLs). It addresses the challenge of data scarcity in these languages and introduces a pipeline for generating high-quality data, using Swahili as a primary example. The tutorial covers strategies for dataset collection and alignment of LLMs to LRLs, offering comprehensive guidance on producing and utilizing high-quality data for language technology development in under-resourced languages.

Materials

Notebooks

Speakers

Nazar Beknazarov
Nazar Beknazarov
Nazar Beknazarov

Toloka AI

Profile link

Ahmet Üstün
Ahmet Üstün
Ahmet Üstün

Cohere for AI

Profile link

Marzieh Fadaee
Marzieh Fadaee
Marzieh Fadaee

Toloka

Profile link

Natalia Fedorova

Toloka Partnership Manager, Toloka

Profile link

Sergey Koshelev

Toloka AI

Profile link

Alisa Smirnova

Toloka AI

Profile link

Don't miss a thing!

Get all the latest on our webinars, meetups, and other events.

Subscribe

Don't miss a thing!

Get all the latest on our webinars, meetups, and other events.

Subscribe

Don't miss a thing!

Get all the latest on our webinars, meetups, and other events.

Subscribe

Don't miss a thing!

Get all the latest on our webinars, meetups, and other events.

Subscribe