|
|
From text to speech: The MITalk system
|
|
|
|
|
|
then to the root and leftmost suffix combined. The process continues with one
|
|
|
more suffix adjoined to the string under consideration before each cycle begins,
|
|
|
until the end of the word is reached. This cyclic phase is devoted solely to the
|
|
|
placement of primary stress. Unless otherwise noted, prefixes are considered part
|
|
|
of the root.
|
|
|
|
|
|
The second, noncyclic phase includes the application to the entire word of or-
|
|
|
dered rules and reduces all but one. of the primary stress marks to secondary or
|
|
|
ZETO Stress.
|
|
|
|
|
|
The stress marks used here are ’ for primary stress and " for secondary
|
|
|
stress.
|
|
|
|
|
|
In the following sections, stress placement rules will be given both as for-
|
|
|
mulas and in descriptive (nonsymbolic) form. Each rule which contains more than
|
|
|
one case is broken down into cases for which brief descriptions and examples are
|
|
|
given. It is important to note that a particular case applies only if the rules for pre-
|
|
|
vious cases have not applied, i.e., a maximum of one case per rule is applicable.
|
|
|
The subrules in each case are mutually exclusive. The rules are listed in the order
|
|
|
in which they apply and are marked either cyclic or noncyclic.
|
|
|
|
|
|
In this context, syllable means a vowel followed by any number of consonants
|
|
|
(including none). Weak syllable means a.short (or nontense) vowel followed by, at
|
|
|
most, one consonant before the next vowel. The words vowel and consonant them-
|
|
|
selves denote the vocalic and nonvocalic phonetic segment labels output from the
|
|
|
letter-to-sound conversion stage, rather than the letters in the original word. In the
|
|
|
examples, Klatt symbols are used to represent the segment labels. The short
|
|
|
vowels are: AA, EH, IH, AO, UH, AH, AX, AE, and 1X. Long vowels are: EY, IY,
|
|
|
AY, OW, UW, OY, and Aw. |
|
|
|
|
|
|
Each formula is a phonetic segment string pattern matching expression. The
|
|
|
symbols used in the formulas are defined as follows:
|
|
|
|
|
|
C matches a single consonant. Sub- and superscripts denote lower and upper
|
|
|
bounds, respectively, on the number of replications of the preceding term
|
|
|
(usually C). For example, Cy matches any number of consecutive con-
|
|
|
|
|
|
sonants (including none) while C% matches one or two consonants.
|
|
|
|
|
|
V matches a single vowel.
|
|
|
|
|
|
X and Y match segment strings of any length (including null, unless noted
|
|
|
otherwise).
|
|
|
|
|
|
Brackets [ ] denote the association of one or more features with a vowel. The fea-
|
|
|
|
|
|
62
|