An image-inspired audio sharpness index

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We propose a new non-intrusive (reference-free) objective measure of speech intelligibility that is inspired from previous works on image sharpness. We define the audio Sharpness Index (aSI) as the sensitivity of the spectrogram sparsity to the convolution of the signal with a white noise, and we calculate a closed-form formula of the aSI. Experiments with various speakers, noise and reverberation conditions show a high correlation between the aSI and the well-established Speech Transmission Index (STI), which is intrusive (full-reference). Additionally, the aSI can be used as an intelligibility or clarity criterion to drive sound enhancement algorithms. Experimental results on stereo mixtures of two sounds show that blind source separation based on aSI maximization performs well for speech and for music.

Original languageEnglish
Title of host publication25th European Signal Processing Conference, EUSIPCO 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages683-687
Number of pages5
ISBN (Electronic)9780992862671
DOIs
Publication statusPublished - 23 Oct 2017
Externally publishedYes
Event25th European Signal Processing Conference, EUSIPCO 2017 - Kos, Greece
Duration: 28 Aug 20172 Sept 2017

Publication series

Name25th European Signal Processing Conference, EUSIPCO 2017
Volume2017-January

Conference

Conference25th European Signal Processing Conference, EUSIPCO 2017
Country/TerritoryGreece
CityKos
Period28/08/172/09/17

Fingerprint

Dive into the research topics of 'An image-inspired audio sharpness index'. Together they form a unique fingerprint.

Cite this