Multitaper Estimation of Frequency-Warped Cepstra with Application to Speaker Verification

Johan Sandberg, Maria Sandsten, Tomi Kinnunen, Rahim Saeidi, Patrick Flandrin, Pierre Borgnat

Research output: Contribution to journalArticlepeer-review

Abstract

Usually the mel-frequency cepstral coefficients are estimated either from a periodogram or from a windowed periodogram. We state a general estimator which also includes multitaper estimators. We propose approximations of the variance and bias of the estimate of each coefficient. By using Monte Carlo computations, we demonstrate that the approximations are accurate. Using the proposed formulas, the peak matched multitaper estimator is shown to have low mean square error (squared bias + variance) on speech-like processes. It is also shown to perform slightly better in the NIST 2006 speaker verification task as compared to the Hamming window conventionally used in this context.
Original languageEnglish
Pages (from-to)343-346
JournalIEEE Signal Processing Letters
Volume17
Issue number4
DOIs
Publication statusPublished - 2010

Subject classification (UKÄ)

  • Probability Theory and Statistics

Free keywords

  • Speech analysis
  • Multitapers
  • Speaker verification
  • Cepstral analysis
  • Multiple windows
  • MFCC

Fingerprint

Dive into the research topics of 'Multitaper Estimation of Frequency-Warped Cepstra with Application to Speaker Verification'. Together they form a unique fingerprint.

Cite this