Computer Science City
College of New York
CSc 59866 - CAPSTONE I
Spoken
Dialog Systems and Voice XML
Assignment 1
- Test
and analyze at least 2 spoken dialog applications:
- To
register: Go to http://www.cunyspeechlab.com, choose ‘Pain Diary”, and then
Enrollment.
- You
will fill a form, and will receive an email with your password, and
further instructions. You don’t have to enter your real personal
data (age and ethnicity). You can choose there any random entry.
Originally they were
required by NIH to guarantee that the study is
balanced.
- Place
at least 1-2 call a day
(more is better!) over a period of a week – 10 days.
Experiment with the system and document the dialogs.
- Write
a report describing your perceived call flow (as a diagram); how the
system behaves when it makes recognition errors; how it reacts to
silence; does it offer help? How?; what happens
if you make a mistake and confirm
negatively the right answer? Does it become easier/faster to complete a
session as you get more experienced? Does it allow the users to change
topic (mixed initiative)?
- Tall
free directory assistance.
- Call
1800-555-1212 after 5 pm.
- Place
at least 10 calls. Try to find numbers for big companies such as airlines or
banks, and small companies – like pizza shops near you. Try to find
the number for city college. Document the
dialogs.
- Write
a report describing your perceived call flow (as a diagram); how the
system behaves when it makes recognition errors; how it reacts to
silence; does it offer help? How?; what happens
if you make a mistake and confirm
negatively the right answer? Does it become easier/faster to complete a
session as you get more experienced? Does it allow the users to change
topic (mixed initiative)?
- Research
assignment (you can do it in groups). Explore two VoiceXML
depelopment environments:
a)
http://cafe.bevocal.com/index.html
b)
http://www.vxml.org/ or IBM Voice Toolkit http://www-306.ibm.com/software/pervasive/voice_toolkit/
Create an account in both sites, and run the Hello world example.
Compare the two development environments. Which one do you prefer and
why?