Toward Real Time Word Based Prosody Recognition

Tilson, Alex and Foerster, Frank (2024) Toward Real Time Word Based Prosody Recognition. In: Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning :. CLASP Conference Proceedings, 3 . Association for Computational Linguistics, SWE, pp. 62-67. ISBN 979-8-89176-163-6
Copy

Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.

visibility_off picture_as_pdf

picture_as_pdf
Tilson_Foerster24-Toward_Real_Time_Word_Based_Prosody_Recognition_accpted.pdf
subject
Submitted Version
lock
Restricted to Repository staff only
copyright
Available under Unspecified

Request Copy
picture_as_pdf

Published Version


Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads