Toward Real Time Word Based Prosody Recognition
Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.
Item Type | Book Section |
---|---|
Additional information | © 2024 Association for Computational Linguistics. This work is distributed under the terms of the Creative Commons Attribution License (CC BY), https://creativecommons.org/licenses/by/4.0/ |
Date Deposited | 15 May 2025 16:51 |
Last Modified | 30 May 2025 23:20 |
Explore Further
-
picture_as_pdf - Tilson_Foerster24-Toward_Real_Time_Word_Based_Prosody_Recognition_accpted.pdf
-
subject - Submitted Version
-
lock - Restricted to Repository staff only
-
copyright - Available under Unspecified