Alert icon
We're changing our privacy policy. This stuff matters.  Learn more  Dismiss

Multimodal mobile interaction - blending speech and GUI input - iphone demo

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
1,389
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Oct 14, 2010

A siri like (personal assistant) interface developed as part of my PhD research (focus on mutlimodal interaction), circa 2009

Multimodal interfaces (interfaces that support more than 1 interaction modalities) offer a richer user experience; they are more flexible and robust (at the cost of greater design and implementation complexity).

This video is about a multimodal mobile interaction application demonstrating how to exploit speech and GUI (touch) modalities to enrich user experience. The application scenario is a travel reservation service. The user can use either GUI or speech input at each interaction turn, that is, selecting values from a list by touch or directly speaking, e.g. "I want to fly from Orlando to Chicago next Friday evening".

This specific demonstration showcases 4 different interactions modes, one unimodal (GUI only input) and 3 different multimodal ones:
-"Click-to-Talk": user clicks speech button to talk
-"Open-Mike": speech input using voice activity detection
-"Modality-selection": default input modality chosen on modality efficiency; the system switches between "Click-to-Talk" & "Open-Mike" depending on current context

Note that the same (and also the simpest possible, e.g. one way trip without car/hotel reservation) scenario (New-York to Chicago, etc.) is demonstrated for all different interaction modes (Of course everything you can do with GUI you can do with speech). This video was shot to showcase the porting to iphone platform (with the help of V Kouloumenta); the platform also runs on PCs and various PDAs (e.g. Zaurus), since 2006.

This demo is part of my PhD work at Electronics & Computer Engineering Dept, Technical University Crete under the supervision of A. Potamianos. For more info you may refer to:
M. Perakakis and A. Potamianos. A study in efficiency and modality usage in multimodal form filling systems. IEEE Transactions on Audio, Speech and Language Processing, 2008.

Links:
http://perak.wordpress.com/2010/10/15/multimodal-mobile-interaction-blending-...
http://gr.linkedin.com/in/manolisperakakis

Category:

Science & Technology

Tags:

License:

Standard YouTube License

  • likes, 0 dislikes

Link to this comment:

Share to:
see all

All Comments (1)

Sign In or Sign Up now to post a comment!
  • Ωραίος! Και παίζει και με ελληνική προφορά :)

Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more