Sign in / Sign up
Task 1: Far-Field Text-Dependent Speaker Verification from single microphone array

Training Data

The training data includes 120 speakers and each speaker has 3 visits. In each visit, there are multiple (“ni hao mi ya”) text-dependent utterances as well as multiple text-independent utterances. The recording from five recording devices for each utterance are provided for training. These five recording devices include one close-talk microphone, one 25cm distance cellphone, and three randomly selected microphone arrays (4 channels per array).

Any publicly open and freely accessible database shared on openslr.org before Feb 1st 2020 (including HI-MIA) can be used in this task.

Development Data

The Development data includes 35 speakers and each speaker has 3 visits. In each visit, there are multiple (“ni hao mi ya”) text-dependent utterances as well as multiple text-independent utterances. The recording from five recording devices for each utterance are provided. These five recording devices include one close-talk microphone, one 25cm distance cellphone, and three randomly selected microphone arrays (4 channels per array).

Evaluation Data

The evaluation data includes 80 speakers and each speaker has 3 visits. In each visit, there are multiple (“ni hao mi ya”) utterances, The recording from two recording devices for each utterance are provided. These two recording devices include one 25cm distance cellphone and one randomly selected microphone arrays (4 channels per array).

The recording from 25cm distance cellphone will be selected as enrollment and recording from single far-field microphone array will be used for test. For any true trial, the enrollment and the testing utterances are from different visits of the same speaker.

There is no overlapping among the speakers in the evaluation data in task1, task2, and task3.