- Contents
CIC Text to Speech Engines Technical Reference
Supported Say-as Text Normalization
In verbal conversations, certain categories of speech, such as currency
and time, use a specific method to convey information. For example, when
people read $12,345
, it is usually spoken as "twelve-thousand-three-hundred-forty-five
dollars" as opposed to "dollar-symbol one-two (pause) three-four-five",
which is how a computer might interpret it.
In TTS, say-as text normalization directs the speech synthesizer to speak text in a specific manner so that the listener understands it. Without the say-as functionality, the synthesizer might speak a time of 10:30 AM as "one-zero-three-zero am". With say-as, the synthesizer can say the time as "ten-thirty a-m", which the listener understands easier.
The following table lists the say-as normalization types that are available and their support within Interaction Text to Speech.
Text normalization type |
Usage |
Supported |
Notes |
|
---|---|---|---|---|
address |
Processes mailing addresses |
Yes |
Processes US addresses, including military addresses, Post Office boxes, and rural routes. Abbreviations are expanded based on context: Abbreviation examples: 123 Main St. "St." spoken as "street" Springfield, IN "IN" spoken as "Indiana" PO Box "PO" spoken as "post office" RR 2 "RR" spoken as "rural route" Tips:
Supported languages:
|
|
alphanumeric |
Spells letters and numbers |
Yes |
Supported languages: |
|
|
|
|||
boolean |
Understands yes and no |
Yes |
This
type uses the VoiceXML 2.0 defined type for Boolean. ITTS supports
usage of Supported languages: |
|
|
|
|||
currency |
Processes currency amounts |
Yes |
In
specifying currency values, you can use either the monetary symbol,
such as ITTS processes values to the right of a decimal point for only four digits. ITTS ignores any additional digits. Supported languages: |
|
|
|
|||
date |
Processes chronological dates |
Yes |
ITTS supports the following date formats:
Example (en-US):
Output (en-US): "January first, nineteen eighty-four" Tip: You can use the following delimiters when specifying dates: You can also type single-digits for months and days, and two-digit years. Supported languages: |
|
|
|
|||
digits |
Reads strings digit-by-digit |
Yes |
Supported languages: |
|
|
|
|||
number |
Reads strings as a value (not digit-by-digit) |
Yes |
Supported languages: |
|
|
|
|||
ordinal |
Processes ordinal numbers, such as "first", "second", and so on. |
Yes |
You can specify only the digits or the digits and an ordinal ending for the supported language. Note: If the language uses gendered forms for ordinals, and the gender is not specified in the text, ITTS hypothesizes the most-likely gender for the ordinal. Supported languages: |
|
|
|
|||
spell |
Processes characters within a string, such as a word |
Yes |
The
synthesizer speaks each character individually. Any punctuation
characters in the string are read and named, such as ampersand
( Supported languages: |
|
|
|
|||
telephone |
Processes telephone numbers |
Yes |
The synthesizer reads telephone numbers, including SIP, according to local conventions in supported languages. |
|
|
|
|||
time |
Processes time statements, such as "12:45 AM" |
Yes |
Supports hours, minutes, seconds, and 12 or 24-hour clock. This normalization type does not support any This normalization type does not support durations, such as "60 minutes". For all languages, the separator for the hours and minutes of
the time is a colon ( Supported languages: |
|
|
|