• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Classifiers for Synthetic Speech Detection: A Comparison

by Tomi Kinnunen, Md Sahidullah, R Sizov
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

A comparison of features for synthetic speech detection

by Md Sahidullah, Tomi Kinnunen - in INTERSPEECH , 2015
"... The performance of biometric systems based on automatic speaker recognition technology is severely degraded due to spoofing attacks with synthetic speech generated using different voice conversion (VC) and speech synthesis (SS) techniques. Various countermeasures are proposed to detect this type of ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
The performance of biometric systems based on automatic speaker recognition technology is severely degraded due to spoofing attacks with synthetic speech generated using different voice conversion (VC) and speech synthesis (SS) techniques. Various countermeasures are proposed to detect this type of at-tack, and in this context, choosing an appropriate feature extrac-tion technique for capturing relevant information from speech is an important issue. This paper presents a concise experi-mental review of different features for synthetic speech detec-tion task. A wide variety of features considered in this study include previously investigated features as well as some other potentially useful features for characterizing real and synthetic speech. The experiments are conducted on recently released ASVspoof 2015 corpus containing speech data from a large number of VC and SS technique. Comparative results using two different classifiers indicate that features representing spectral information in high-frequency region, dynamic information of speech, and detailed information related to subband characteris-tics are considerably more useful in detecting synthetic speech. Index Terms: anti-spoofing, ASVspoof 2015, feature extrac-tion, countermeasures
(Show Context)

Citation Context

...s) which are not included in training. 3.2. Classifier Description In a different study with classifiers, we have shown that GMMbased technique yields reasonably good accuracy in ASVspoof 2015 corpus =-=[36]-=-. So, we choose this classifier for benchmarking of various features. We have also evaluated the performance with recently proposed SVM-based approach for detecting synthetic speech. GMM-ML: Two separ...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University