Automatic Alignment of Human Generated Transcripts to Speech Signals

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper introduces a novel, completely automatic, audio-subtitle synchronization algorithm designed to increase the accessibility and comprehension of the hearing-impaired people over the video documents. The major contribution of the paper involves the anchor words matching strategy that can reliably put in correspondence the subtitle document generated by a human transcriber and the automatic speech recognition (ASR) textual file. In addition, we propose a subtitle positioning strategy that automatically determines the optimal location for each phrase on the user screen, with respect to the video semantic content. The experimental evaluation validates the proposed method with average accuracy scores superior to 90%.

Original languageEnglish
Title of host publication2022 10th E-Health and Bioengineering Conference, EHB 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665485579
DOIs
Publication statusPublished - 1 Jan 2022
Event10th E-Health and Bioengineering Conference, EHB 2022 - Virtual, Online, Romania
Duration: 17 Nov 202218 Nov 2022

Publication series

Name2022 10th E-Health and Bioengineering Conference, EHB 2022

Conference

Conference10th E-Health and Bioengineering Conference, EHB 2022
Country/TerritoryRomania
CityVirtual, Online
Period17/11/2218/11/22

Keywords

  • anchor words
  • automatic speech recognition, dynamic subtitle positioning
  • subtitle synchronization

Fingerprint

Dive into the research topics of 'Automatic Alignment of Human Generated Transcripts to Speech Signals'. Together they form a unique fingerprint.

Cite this