Toward Real Time Word Based Prosody Recognition
Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.
Item Type | Conference or Workshop Item (Other) |
---|---|
Additional information | © 2024 Association for Computational Linguistics. This work is distributed under the terms of the Creative Commons Attribution License (CC BY), https://creativecommons.org/licenses/by/4.0/ |
Date Deposited | 15 May 2025 16:51 |
Last Modified | 10 Jul 2025 23:46 |
-
picture_as_pdf - Tilson_Foerster24-Toward_Real_Time_Word_Based_Prosody_Recognition_accpted.pdf
-
subject - Submitted Version
-
lock - Restricted to Repository staff only
-
copyright - Available under Unspecified