Abstract
Usually the mel-frequency cepstral coefficients are estimated either from a periodogram or from a windowed periodogram. We state a general estimator which also includes multitaper estimators. We propose approximations of the variance and bias of the estimate of each coefficient. By using Monte Carlo computations, we demonstrate that the approximations are accurate. Using the proposed formulas, the peak matched multitaper estimator is shown to have low mean square error (squared bias + variance) on speech-like processes. It is also shown to perform slightly better in the NIST 2006 speaker verification task as compared to the Hamming window conventionally used in this context.
Original language | English |
---|---|
Pages (from-to) | 343-346 |
Journal | IEEE Signal Processing Letters |
Volume | 17 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2010 |
Subject classification (UKÄ)
- Probability Theory and Statistics
Free keywords
- Speech analysis
- Multitapers
- Speaker verification
- Cepstral analysis
- Multiple windows
- MFCC