Archive for the ‘Workshops’ Category

STKU-1: Introduction to Speech Technologies

Sunday, August 7th, 2011

PreConference – Sun, August 7 
1:30 p.m. – 4:30 p.m.

Learn how to design and build multimodal applications, including the principles of multimodal design, and how to maximize the usability of multimodal applications for a variety of different users. The emphasis is on mobile applications that combine speech and graphics, but other input modalities such as pen or stylus and sensors are also discussed. Major design issues are examined, including the components of a multimodal application, maintaining consistency between the voice and graphical parts of a multimodal application, pros and cons of avatars, initiating voice input, when to use audio output and other types of feedback, accommodating different types of users, available platforms and development tools. We walk attendees through a development process with a current development platform, discuss standards for multimodal development, and evaluate the various types of testing and evaluation, including focus groups and field-testing.

Presented by: James A. Larson

STKU-2: Introduction to Voice Interaction Design

Sunday, August 7th, 2011

PreConference – Sun, August 7 
1:30 p.m. – 4:30 p.m.

Jump-start your knowledge in the field of voice user interface design. This session is designed to quickly get those new to VUI design up-to-speed to make the most of the Voice Interaction Design track. This tutorial will illustrate why VUI design is the make-or-break factor for speech applications and how to make smart design decisions from Day 1. Learn how to encourage customers to accept and use speech automation by focusing on the perceptions and reactions of end users throughout the design process. This tutorial will cover the basics in VUI design — the current and future state of technology (including multimodality); speech project methodology; design principles; rules for efficient, no-nonsense call flows; and evaluation techniques — so you can learn what works and what real customers think.

Presented by: Jenni McKienzie

STKU-3: Voice-Enabling Mobile Apps for the Android and iOS

Thursday, August 11th, 2011

PostConference – Thu, August 11 
9:00 a.m. – 12:00 p.m.

This session examines recent industry trends in mobile speech applications, covering both speech recognition and text-to-speech and the differences among embedded, connected and hybrid deployment scenarios, and outline the associated trade-offs regarding availability, latency, accuracy and privacy The majority of the session is a hands-on exploration of implementing speech into mobile apps, including practical issues associated with this. We will walk you through building your very own speech-enabled application for Apple iOS or Google Android. To participate fully, please bring your Mac or Mac/Windows/Linux laptop with Eclipse loaded with additional software from Nuance Communications. You will need to sign a no-cost development license con tract with Nuance before attending this course. For details visit http://bit.ly/eDePmT.

Presented by: Aaron Masih, Anthony Gillet, Alex Kinney

STKU-4: Tuning Speech Recognition: How to Get the Best Result With Minimum Effort

Thursday, August 11th, 2011

PostConference – Thu, August 11 
9:00 a.m. – 12:00 p.m.

The creation of high-quality speech applications is an art, and, especially for ASR, entails the tuning of the speech recognition performance. This tuning task is not a trivial one and requires specialist knowledge. The goal is to present many use cases taken from a wide variety of different applications of speech recognition in today’s speech market. This tutorial starts off assessing what makes a correct and sound tuning methodology and also covers best practices, suggested tips, and a trick.

Presented by: Paolo Baggia

STKU-5: Introduction to Natural Language

Thursday, August 11th, 2011

PostConference – Thu, August 11 
9:00 a.m. – 12:00 p.m.

Natural language processing, particularly in the form of statistical models, is being used in more speech applications. This session introduces natural language processing and its role in speech applications. The key ideas are the following: what natural language is, the statistical language model (SLM) approach to natural language processing, when and how to use natural language processing techniques in an application, how to combine natural language processing techniques with grammars and directed dialogues to achieve optimal application performance, commercially available natural language tools, a brief discussion of research areas and newer technology such as the technology used in the IBM Watson Jeopardy system. This tutorial is aimed at an audience with a general technical background. Experience developing speech applications would be helpful. Attendees can experiment with a simple open-source system that illustrates the key concepts of SLMs.

Presented by: Deborah Dahl

Break

Thursday, August 11th, 2011

PostConference – Thu, August 11 
12:00 p.m. – 1:30 p.m.

STKU-6: Designing and Building Multimodal Applications

Thursday, August 11th, 2011

PostConference – Thu, August 11 
1:30 p.m. – 4:30 p.m.

Learn how to design and build multimodal applications, including the principles of multimodal design, and how to maximize the usability of multimodal applications for a variety of different users. The emphasis is on mobile applications that combine speech and graphics, but other input modalities such as pen or stylus and sensors are also discussed. Major design issues are examined, including the components of a multimodal application, maintaining consistency between the voice and graphical parts of a multimodal application, pros and cons of avatars, initiating voice input, when to use audio output and other types of feedback, accommodating different types of users, available platforms and development tools. We walk attendees through a development process with a current development platform, discuss standards for multimodal development, and evaluate the various types of testing and evaluation, including focus groups and field-testing.

Presented by: Deborah Dahl

STKU-7: Advanced Topics in Grammar and Lexicon Development

Thursday, August 11th, 2011

PostConference – Thu, August 11 
1:30 p.m. – 4:30 p.m.

Without well-constrained grammars and lexicons to support it, a great design isn’t worth the paper it’s printed on. What types of concerns should be taken into consideration when designing complex grammars? What are some characteristics of a complex grammar task? What grammar features can be leveraged to optimize recognition? This session begins with a very high-level refresher on GRXML grammar structure and then delves into advanced topics in how to optimize recognition for a variety of complex tasks. We discuss what makes these tasks complex and, in each case, some how-to methodologies for optimizing recognition given the task.

Presented by: Charles Galles, Judi Halperin

STKU-8: Everything Managers Need to Know About Design, But Didn’t Know to Ask

Thursday, August 11th, 2011

PostConference – Thu, August 11 
1:30 p.m. – 4:30 p.m.

One of the biggest challenges for companies deploying speech-enabled technologies is managing the design phase of the project. Design tasks are often ess familiar than technical tasks, thus are often scoped incompletely or inappropriately in project plans, leading to slipped deadlines and budgets. This course arms managers with the information they need to understand what design is, what resources are needed to produce good design, when and how o build design tasks into project plans, and how to ensure that design activties produce worthwhile results in a project. This course doesn’t teach you how to design, but rather, how to support and benefit from a user-centric design process.

Presented by: Susan L. Hura