Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams

Aditya Joglekar*, Ivan Lopez Espejo, John H. L. Hansen

*Kontaktforfatter

Publikation: Bidrag til tidsskriftKonferenceabstrakt i tidsskriftForskningpeer review

Abstract

Fearless Steps (FS) APOLLO is a + 50,000 hr audio resource established by CRSS-UTDallas capturing all communications between NASA-MCC personnel, backroom staff, and Astronauts across manned Apollo Missions. Such a massive audio resource without metadata/unlabeled corpus provides limited benefit for communities outside Speech-and-Language Technology (SLT). Supplementing this audio with rich metadata developed using robust automated mechanisms to transcribe and highlight naturalistic communications can facilitate open research opportunities for SLT, speech sciences, education, and historical archival communities. In this study, we focus on customizing keyword spotting (KWS) and topic detection systems as an initial step towards conversational understanding. Extensive research in automatic speech recognition (ASR), speech activity, and speaker diarization using manually transcribed 125 h FS Challenge corpus has demonstrated the need for robust domain-specific model development. A major challenge in training KWS systems and topic detection models is the availability of word-level annotations. Forced alignment schemes evaluated using state-of-the-art ASR show significant degradation in segmentation performance. This study explores challenges in extracting accurate keyword segments using existing sentence-level transcriptions and proposes domain-specific KWS-based solutions to detect conversational topics in audio streams.
OriginalsprogEngelsk
ArtikelnummerA173
TidsskriftThe Journal of the Acoustical Society of America
Vol/bind153
Udgave nummerSupplement 3
ISSN0001-4966
DOI
StatusUdgivet - mar. 2023
Begivenhed184th Meeting of the Acoustical Society of America - Chicago, USA
Varighed: 8 maj 202312 maj 2023
https://acousticalsociety.org/asa-meetings/

Konference

Konference184th Meeting of the Acoustical Society of America
Land/OmrådeUSA
ByChicago
Periode08/05/202312/05/2023
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams'. Sammen danner de et unikt fingeraftryk.

Citationsformater