RealityTalk: Real-time Speech-driven Augmented Presentation for AR Live Storytelling

Jian Liao , Adnan Karim , Shivesh Jadon , Rubaiat Habib Kazi , Ryo Suzuki



We present RealityTalk, a system that augments real-time live presentations with speech-driven interactive virtual elements. Augmented presentations leverage embedded visuals and animation for engaging and expressive storytelling. However, existing tools for live presentations often lack interactivity and improvisation, while creating such effects in video editing tools require significant time and expertise. RealityTalk enables users to create live augmented presentations with real-time speech-driven interactions. The user can interactively prompt, move, and manipulate graphical elements through real-time speech and supporting modalities. Based on our analysis of 177 existing video-edited augmented presentations, we propose a novel set of interaction techniques and then incorporated them into RealityTalk. We evaluate our tool from a presenter’s perspective to demonstrate the effectiveness of our system.

Keywords:  Augmented RealityMixed RealityAugmented PresentationNatural Language ProcessingGestural And Speech InputVideo


Jian Liao, Adnan Karim, Shivesh Jadon, Rubaiat Habib Kazi, Ryo SuzukiRealityTalk: Real-time Speech-driven Augmented Presentation for AR Live StorytellingIn Proceedings of the Annual ACM Symposium on User Interface Software and Technology (UIST '22)ACM, New York, NY, USA  Page: 1-12.  DOI: