Show simple item record

dc.contributor.authorFujii, Koyo
dc.contributor.authorHolthaus, Patrick
dc.contributor.authorSamani, H
dc.contributor.authorPremachandra, Chinthaka
dc.contributor.authorAmirabdollahian, Farshid
dc.contributor.editorAl Ali, Abdulaziz
dc.contributor.editorCabibihan, John-John
dc.contributor.editorMeskin, Nader
dc.contributor.editorRossi, Silvia
dc.contributor.editorJiang, Wanyue
dc.contributor.editorHe, Hongsheng
dc.contributor.editorGe, Shuzhi Sam
dc.date.accessioned2023-12-18T09:30:01Z
dc.date.available2023-12-18T09:30:01Z
dc.date.issued2023-12-03
dc.identifier.citationFujii , K , Holthaus , P , Samani , H , Premachandra , C & Amirabdollahian , F 2023 , Two-Level Reinforcement Learning Framework for Self-Sustained Personal Robots . in A Al Ali , J-J Cabibihan , N Meskin , S Rossi , W Jiang , H He & S S Ge (eds) , 15th International Conference, ICSR 2023, Proceedings, Part II . 15 edn , vol. 14453 , Lecture Notes in Computer Science (LNCS, volume 14454) , vol. 14453 , Springer Nature Link , pp. 363–372 , 15th International Conference on Social Robotics (ICSR 2023) , Doha , Qatar , 3/12/23 . https://doi.org/10.1007/978-981-99-8715-3_30
dc.identifier.citationconference
dc.identifier.isbn978-981-99-8717-7
dc.identifier.isbn978-981-99-8715-3
dc.identifier.issn0302-9743
dc.identifier.otherORCID: /0000-0001-8450-9362/work/149287783
dc.identifier.otherORCID: /0000-0003-1494-2798/work/149287955
dc.identifier.urihttp://hdl.handle.net/2299/27301
dc.description© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. This is the accepted manuscript version of a conference proceeding which has been published in final form at https://link.springer.com/book/9789819987177
dc.description.abstractAs social robots become integral to daily life, effective battery management and personalized user interactions are crucial. We employed Q-learning with the Miro-E robot for balancing self-sustained energy management and personalized user engagement. Based on our approach, we anticipate that the robot will learn when to approach the charging dock and adapt interactions according to individual user preferences. For energy management, the robot underwent iterative training in a simulated environment, where it could opt to either “play” or “go to the charging dock”. The robot also adapts its interaction style to a specific individual, learning which of three actions would be preferred based on feedback it would receive during real-world human-robot interactions. From an initial analysis, we identified a specific point at which the Q values are inverted, indicating the robot’s potential establishment of a battery threshold that triggers its decision to head to the charging dock in the energy management scenario. Moreover, by monitoring the probability of the robot selecting specific behaviours during human-robot interactions over time, we expect to gather evidence that the robot can successfully tailor its interactions to individual users in the realm of personalized engagement.en
dc.format.extent10
dc.format.extent3384337
dc.language.isoeng
dc.publisherSpringer Nature Link
dc.relation.ispartof15th International Conference, ICSR 2023, Proceedings, Part II
dc.relation.ispartofseriesLecture Notes in Computer Science (LNCS, volume 14454)
dc.titleTwo-Level Reinforcement Learning Framework for Self-Sustained Personal Robotsen
dc.contributor.institutionCentre for Computer Science and Informatics Research
dc.contributor.institutionSchool of Physics, Engineering & Computer Science
dc.contributor.institutionDepartment of Computer Science
dc.contributor.institutionECS Engineering and Technology VLs
dc.contributor.institutionCommunications and Intelligent Systems
dc.contributor.institutionCentre for Engineering Research
dc.contributor.institutionDepartment of Engineering and Technology
dc.contributor.institutionCentre for Future Societies Research
dc.contributor.institutionRobotics Research Group
dc.date.embargoedUntil2025-01-13
dc.identifier.urlhttps://link.springer.com/book/9789819987177
rioxxterms.versionofrecord10.1007/978-981-99-8715-3_30
rioxxterms.typeOther
herts.preservation.rarelyaccessedtrue


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record