Packet voice rate adaptation through perceptual frame discarding

Steffen Praestholm, Hans-Peter Schwefel, Sören Vang Andersen

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

Abstract

We address the problem of rate adaptation at the Source, given a congested packet based voice carrying network. We propose and analyze a novel method for perceptually based frame discarding. Thus, we propose a Perceptually Based Classifier (PBC) to do the discarding and we combine the PBC with a method for state synchronization, which exploits the knowledge of frame discards to combat the error propagation normally following a frame loss for state dependent coders. We further propose to use a combination of a queue model and an empirical speech quality model to decide on the proper discard rate for a given congestion scenario. In this design, we particularly focus on the queue model's ability to capture the characteristics of bursty traffic from voice sources in discontinuous transmission mode. Frame discarding is evaluated in a network bottleneck scenario, where objective results, based on PESQ-LQO, show significant improvements due to perceptually based frame discarding. The improvement depends on the degree of congestion without rate adaptation, the number of incoming voice flows, and the bottleneck buffer.
Original languageEnglish
Title of host publicationGLOBECOM 2007: 2007 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-11
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Pages2497-2502
Publication statusPublished - 2007
Externally publishedYes
EventIEEE Global communications conference (Globecom), 2007 - Washington D.C., Washington D.C., United States
Duration: 2007 Nov 262007 Nov 30

Publication series

Name
ISSN (Print)1930-529X

Conference

ConferenceIEEE Global communications conference (Globecom), 2007
Country/TerritoryUnited States
CityWashington D.C.
Period2007/11/262007/11/30

Subject classification (UKÄ)

  • Mathematics

Fingerprint

Dive into the research topics of 'Packet voice rate adaptation through perceptual frame discarding'. Together they form a unique fingerprint.

Cite this