TY - GEN
T1 - What if everyone could do it? A framework for easier spoken dialog system design
AU - Milhorat, Pierrick
AU - Schlögl, Stephan
AU - Boudy, Jérôm
AU - Chollet, Gérard
PY - 2013/7/29
Y1 - 2013/7/29
N2 - While Graphical User Interfaces (GUI) still represent the most common way of operating modern computing technology, Spoken Dialog Systems (SDS) have the potential to offer a more natural and intuitive mode of interaction. Even though some may say that existing speech recognition is neither reliable nor practical, the success of recent product releases such as Apple's Siri or Nuance's Dragon Drive suggests that language-based interaction is increasingly gaining acceptance. Yet, unlike applications for building GUIs, tools and frameworks that support the design, construction and maintenance of dialog systems are rare. A particular challenge of SDS design is the often complex integration of technologies. Systems usually consist of several components (e.g. speech recognition, language understanding, output generation, etc.), all of which require expertise to deploy them in a given application domain. This paper presents work in progress that aims at supporting this integration process. We propose a framework of components and describe how it may be used to prototype and gradually implement a spoken dialog system without requiring extensive domain expertise.
AB - While Graphical User Interfaces (GUI) still represent the most common way of operating modern computing technology, Spoken Dialog Systems (SDS) have the potential to offer a more natural and intuitive mode of interaction. Even though some may say that existing speech recognition is neither reliable nor practical, the success of recent product releases such as Apple's Siri or Nuance's Dragon Drive suggests that language-based interaction is increasingly gaining acceptance. Yet, unlike applications for building GUIs, tools and frameworks that support the design, construction and maintenance of dialog systems are rare. A particular challenge of SDS design is the often complex integration of technologies. Systems usually consist of several components (e.g. speech recognition, language understanding, output generation, etc.), all of which require expertise to deploy them in a given application domain. This paper presents work in progress that aims at supporting this integration process. We propose a framework of components and describe how it may be used to prototype and gradually implement a spoken dialog system without requiring extensive domain expertise.
KW - Language technology components
KW - SDS design
KW - WOZ
U2 - 10.1145/2480296.2480325
DO - 10.1145/2480296.2480325
M3 - Conference contribution
AN - SCOPUS:84880547052
SN - 9781450322133
T3 - EICS 2013 - Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems
SP - 217
EP - 222
BT - EICS 2013 - Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems
T2 - 5th ACM SIGCHI Symposium on Engineering Interactive Computing Systems, EICS 2013
Y2 - 24 June 2013 through 27 June 2013
ER -