Detection of Voice Conversion Spoofing Attacks Using Voiced Speech

  • Arun Sankar Muttathu Sivasankara Pillai
  • , Phillip L. De Leon
  • , Utz Roedig

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

Speech consists of voiced and unvoiced segments that differ in their production process and exhibit different characteristics. In this paper, we investigate the spectral differences between bonafide and spoofed speech for voiced and unvoiced speech segments. We observe that the largest spectral differences lie in the 0–4 kHz band of voiced speech. Based on this observation, we propose a low-complexity, pre-processing stage which subsamples voiced frames prior to spoofing detection. The proposed pre-processing stage is applied to two systems, LFCC+GMM and IA/IF+KNN that differ entirely on the features and classifier used for spoofing detection. Our results show improvement with both systems in detection of the ASVspoof 2019 A17 voice conversion attack, which is recognized to have one of the highest spoofing capabilities. We also show improvements in the A18 and A19 voice conversion attacks for the IA/IF+KNN system. The resulting A17 EERs are lower than all reported systems where the A17 spoofing attack is the worst attack except the Capsule Network. Finally, we note that the proposed pre-processing stage reduces the speech date by more than 4 × due to subsampling and using only voiced frames but at the same time maintaining similar pooled EER as that for the baseline systems, which may be advantageous for resource constrained spoofing detectors.

Original languageEnglish
Title of host publicationSecure IT Systems - 27th Nordic Conference, NordSec 2022, Proceedings
EditorsHans P. Reiser, Marcel Kyas
PublisherSpringer Science and Business Media Deutschland GmbH
Pages159-175
Number of pages17
ISBN (Print)9783031222948
DOIs
Publication statusPublished - 2022
Externally publishedYes
Event27th Nordic Conference on Secure IT Systems, NordSec 2022 - Reykjavic, Iceland
Duration: 30 Nov 20222 Dec 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13700 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th Nordic Conference on Secure IT Systems, NordSec 2022
Country/TerritoryIceland
CityReykjavic
Period30/11/222/12/22

Keywords

  • Computer security
  • Speech processing
  • Spoofing detection
  • Voice bio-metric

Fingerprint

Dive into the research topics of 'Detection of Voice Conversion Spoofing Attacks Using Voiced Speech'. Together they form a unique fingerprint.

Cite this