Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings

Michał Affek; Marek Sylwester Tatara

doi:10.1007/978-3-031-16159-9_14

Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings

Abstrakt

The paper proposes an approach for extending deep neural networks-based solutions to closed-set speaker identification toward the open-set problem. The idea is built on the characteristics of deep neural networks trained for the classification tasks, where there is a layer consisting of a set of deep features extracted from the analyzed inputs. By extracting this vector and performing anomaly detection against the set of known speakers, new speakers can be detected and modeled for further re-identification. The approach is tested on the basis of NeMo toolkit with SpeakerNet architecture. The algorithm is shown to be working with multiple new speakers introduced.

Cytowania

0

CrossRef
0

Web of Science
0

Scopus

Autorzy (2)

Cytuj jako

Pełna treść

pobierz publikację

pobrano 73 razy

Wersja publikacji: Accepted albo Published Version
Licencja: Copyright (2023 The Author(s), under exclusive license to Springer Nature Switzerland AG)

Słowa kluczowe

Informacje szczegółowe

Kategoria:

Publikacja monograficzna

Typ:

rozdział, artykuł w książce - dziele zbiorowym /podręczniku w języku o zasięgu międzynarodowym

Język:

angielski

Rok wydania:

2022

Opis bibliograficzny:

Affek M., Tatara M.: Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings// Intelligent and Safe Computer Systems in Control and Diagnostics/ : , 2022, s.167-177

DOI:

10.1007/978-3-031-16159-9_14

Źródła finansowania:

Publikacja bezkosztowa

Weryfikacja:

Politechnika Gdańska

wyświetlono 168 razy

K. Bobkowska,
N. Wawrzyniak

2019

Texture Features for the Detection of Playback Attacks: Towards a Robust Solution

M. Smiatacz

2020

Meta Tagi

Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings

Abstrakt

Cytowania

Autorzy (2)

Michał Affek

Marek Sylwester Tatara (dawniej: Marek Tatara) dr inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Playback detection using machine learning with spectrogram features approach

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

The Hough transform in the classification process of inland ships

Texture Features for the Detection of Playback Attacks: Towards a Robust Solution

Wyszukiwarka

Open-Set Speaker Identification Using Closed-Set Pretrained Embeddings

Abstrakt

Cytowania

Autorzy (2)

Michał Affek

Marek Sylwester Tatara (dawniej: Marek Tatara) dr inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Playback detection using machine learning with spectrogram features approach

Examining Influence of Distance to Microphone on Accuracy of Speech Recognition

The Hough transform in the classification process of inland ships

Texture Features for the Detection of Playback Attacks: Towards a Robust Solution