Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules

This thesis focuses on the robustness issues of speaker verification (SV) systems. Al-though current SV systems perform well under clean condition, their performance de-grades dramatically under real-world uncontrolled environments. The reliability of cur-rent SV systems is also questionable under sp...

Full description

Bibliographic Details
Main Author: Du, Steven
Other Authors: Chng Eng Siong, School of Computer Engineering, Emerging Research Lab
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:https://hdl.handle.net/10356/65396
https://doi.org/10.32657/10356/65396
id ftnanyangtu:oai:dr.ntu.edu.sg:10356/65396
record_format openpolar
spelling ftnanyangtu:oai:dr.ntu.edu.sg:10356/65396 2023-05-15T15:14:35+02:00 Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules Du, Steven Chng Eng Siong School of Computer Engineering Emerging Research Lab 2015 81 p. application/pdf https://hdl.handle.net/10356/65396 https://doi.org/10.32657/10356/65396 en eng Du, S. (2015). Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/65396 doi:10.32657/10356/65396 DRNTU::Engineering::General Thesis 2015 ftnanyangtu https://doi.org/10.32657/10356/65396 2023-03-10T01:21:13Z This thesis focuses on the robustness issues of speaker verification (SV) systems. Al-though current SV systems perform well under clean condition, their performance de-grades dramatically under real-world uncontrolled environments. The reliability of cur-rent SV systems is also questionable under spoofing attacks. These pitfalls severely limit it’s deployment in many applications. This thesis presents approaches to combat these two robustness issues, namely noise robustness and spoofing attacks. To address the noise robustness issue, the use of deep neural networks (DNN) as a feature compensation method in the front-end module of the SV system is proposed. The motivation to use DNN is due to its success in various related speech fields, and its ability to model nonlinear relationships between high dimensional input and output. In this work, DNN is used to convert noisy input features into clean features. The proposed method is evaluated using the benchmarking speaker recognition evaluation (SRE) 2010 dataset provided by the National Institute of Standards and Technology(NIST). To focus on the effect of feature pre-processing, the SV system is trained using noise free speech and evaluated on noise corrupted speech. Results show that the proposed DNN feature compensation improves the equal error rate (EER) by 2%-25% under different unseen noise types for various SNR levels. To address the spoofing attacks issue, the use of long temporal high dimensional speech features derived from both magnitude and phase spectra as input features to neural network (NN) classifiers is proposed. The long term temporal information is in-corporated by concatenating 31 successive frames as input feature to the NN classifier. The classifier is then used to predict the posterior probability of the test speech being spoofing speech. Four speakers of CMU-ARCTIC database are selected for spoofing data generation and methods evaluation. Spoofing data is generated by four synthesis meth-ods, namely: AHOcoder, STRAIGHT, JD-GMM with maximum likelihood ... Thesis Arctic DR-NTU (Digital Repository at Nanyang Technological University, Singapore) Arctic
institution Open Polar
collection DR-NTU (Digital Repository at Nanyang Technological University, Singapore)
op_collection_id ftnanyangtu
language English
topic DRNTU::Engineering::General
spellingShingle DRNTU::Engineering::General
Du, Steven
Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
topic_facet DRNTU::Engineering::General
description This thesis focuses on the robustness issues of speaker verification (SV) systems. Al-though current SV systems perform well under clean condition, their performance de-grades dramatically under real-world uncontrolled environments. The reliability of cur-rent SV systems is also questionable under spoofing attacks. These pitfalls severely limit it’s deployment in many applications. This thesis presents approaches to combat these two robustness issues, namely noise robustness and spoofing attacks. To address the noise robustness issue, the use of deep neural networks (DNN) as a feature compensation method in the front-end module of the SV system is proposed. The motivation to use DNN is due to its success in various related speech fields, and its ability to model nonlinear relationships between high dimensional input and output. In this work, DNN is used to convert noisy input features into clean features. The proposed method is evaluated using the benchmarking speaker recognition evaluation (SRE) 2010 dataset provided by the National Institute of Standards and Technology(NIST). To focus on the effect of feature pre-processing, the SV system is trained using noise free speech and evaluated on noise corrupted speech. Results show that the proposed DNN feature compensation improves the equal error rate (EER) by 2%-25% under different unseen noise types for various SNR levels. To address the spoofing attacks issue, the use of long temporal high dimensional speech features derived from both magnitude and phase spectra as input features to neural network (NN) classifiers is proposed. The long term temporal information is in-corporated by concatenating 31 successive frames as input feature to the NN classifier. The classifier is then used to predict the posterior probability of the test speech being spoofing speech. Four speakers of CMU-ARCTIC database are selected for spoofing data generation and methods evaluation. Spoofing data is generated by four synthesis meth-ods, namely: AHOcoder, STRAIGHT, JD-GMM with maximum likelihood ...
author2 Chng Eng Siong
School of Computer Engineering
Emerging Research Lab
format Thesis
author Du, Steven
author_facet Du, Steven
author_sort Du, Steven
title Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
title_short Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
title_full Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
title_fullStr Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
title_full_unstemmed Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules
title_sort robust speaker verification system with anti-spoofing detection and dnn feature enhancement modules
publishDate 2015
url https://hdl.handle.net/10356/65396
https://doi.org/10.32657/10356/65396
geographic Arctic
geographic_facet Arctic
genre Arctic
genre_facet Arctic
op_relation Du, S. (2015). Robust speaker verification system with anti-spoofing detection and DNN feature enhancement modules. Master's thesis, Nanyang Technological University, Singapore.
https://hdl.handle.net/10356/65396
doi:10.32657/10356/65396
op_doi https://doi.org/10.32657/10356/65396
_version_ 1766345024904101888