Multi-pitch estimation

Mads Christensen, Andreas Jakobsson

Research output: Book/ReportBookResearch

Abstract

Periodic signals can be decomposed into sets of sinusoids having frequencies that are integer multiples of a fundamental frequency. The problem of finding such fundamental frequencies from noisy observations is important in many speech and audio applications, where it is commonly referred to as pitch estimation. These applications include analysis, compression, separation, enhancement, automatic transcription and many more. In this book, an introduction to pitch estimation is given and a number of statistical methods for pitch estimation are presented. The basic signal models and associated estimation theoretical bounds are introduced, and the properties of speech and audio signals are discussed and illustrated. The presented methods include both single- and multi-pitch estimators based on statistical approaches, like maximum likelihood and maximum a posteriori methods, filtering methods based on both static and optimal adaptive designs, and subspace methods based on the principles of subspace orthogonality and shift-invariance. The application of these methods to analysis of speech and audio signals is demonstrated using both real and synthetic signals, and their performance is assessed under various conditions and their properties discussed. Finally, the estimators are compared in terms of computational and statistical efficiency, generalizability and robustness.
Original languageEnglish
PublisherMorgan & Claypool
Number of pages160
DOIs
Publication statusPublished - 2009

Publication series

NameSynthesis Lectures on Speech and Audio Processing
ISSN (Print)1932-121X
ISSN (Electronic)1932-1678

Subject classification (UKÄ)

  • Probability Theory and Statistics

Fingerprint

Dive into the research topics of 'Multi-pitch estimation'. Together they form a unique fingerprint.

Cite this