Show simple item record

dc.contributor.authorHARTE, NAOMIen
dc.contributor.authorKOKARAM, ANILen
dc.contributor.authorHINES, ANDREWen
dc.contributor.authorKOKARAM, ANIL CHRISTOPHERen
dc.contributor.authorHARTE, NAOMIen
dc.contributor.authorHINES, ANDREWen
dc.date.accessioned2015-12-09T12:41:36Z
dc.date.available2015-12-09T12:41:36Z
dc.date.issued2015en
dc.date.submitted2015en
dc.identifier.citationHines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, 1, 2015, 13-en
dc.identifier.otherYen
dc.descriptionPUBLISHEDen
dc.descriptionExport Date: 27 August 2015en
dc.description.abstractThis paper presents an objective speech quality model, ViSQOL, the Virtual Speech Quality Objective Listener. It is a signal-based, full-reference, intrusive metric that models human speech quality perception using a spectro-temporal measure of similarity between a reference and a test speech signal. The metric has been particularly designed to be robust for quality issues associated with Voice over IP (VoIP) transmission. This paper describes the algorithm and compares the quality predictions with the ITU-T standard metrics PESQ and POLQA for common problems in VoIP: clock drift, associated time warping, and playout delays. The results indicate that ViSQOL and POLQA significantly outperform PESQ, with ViSQOL competing well with POLQA. An extensive benchmarking against PESQ, POLQA, and simpler distance metrics using three speech corpora (NOIZEUS and E4 and the ITU-T P.Sup. 23 database) is also presented. These experiments benchmark the performance for a wide range of quality impairments, including VoIP degradations, a variety of background noise types, speech enhancement methods, and SNR levels. The results and subsequent analysis show that both ViSQOL and POLQA have some performance weaknesses and under-predict perceived quality in certain VoIP conditions. Both have a wider application and robustness to conditions than PESQ or more trivial distance metrics. ViSQOL is shown to offer a useful alternative to POLQA in predicting speech quality in VoIP scenarios.en
dc.description.sponsorshipAndrew Hines thanks Google, Inc. for support. Thanks also to Yi Hu for sharing the full listener test MOS results and enhanced test files for the NOIZEUS databaseen
dc.format.extent13en
dc.relation.ispartofseriesEurasip Journal on Audio, Speech, and Music Processingen
dc.relation.ispartofseries2015en
dc.relation.ispartofseries1en
dc.rightsYen
dc.subjectObjective speech quality; POLQA; P.853; PESQ; ViSQOL; NSIMen
dc.subject.lcshObjective speech quality; POLQA; P.853; PESQ; ViSQOL; NSIMen
dc.titleViSQOL: an objective speech quality modelen
dc.typeJournal Articleen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/nharteen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/akokaramen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/ahinesen
dc.identifier.rssinternalid105760en
dc.identifier.doihttp://dx.doi.org/10.1186/s13636-015-0054-9en
dc.rights.ecaccessrightsopenAccess
dc.identifier.rssurihttp://www.scopus.com/inward/record.url?eid=2-s2.0-84930212854&partnerID=40&md5=9e6b8eb966a3dd6e2a497dd672a26f54en
dc.identifier.urihttp://hdl.handle.net/2262/75238


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record