You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

46 lines
2.0 KiB

This file contains ambiguous Unicode characters!

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

S
Morphophonemics and stress adjustment
5.1 Overview
It is not always possible to simply concatenate the pronunciations of the con-
stituent morphs of a word to get its pronunciation. There are sometimes changes in
pronunciation at morph boundaries. Module SOUNDI1 checks for contexts in
which such changes occur, and changes the pronunciation. It also adjusts the lex-
ical stress for compounds and for words having suffixes requiring special stress
rules. SOUNDI1 also performs letter-to-sound conversion for words which were
not segmented by DECOMP (this function will be described in the following
chapter). It accepts as input all the word and morph information given by
DECOMP and the additional phrase part-of-speech information produced by
PARSER. Output is a set of phonetic segment labels for each word along with the
phrase information from PARSER.
5.2 Input
Input to SOUNDI is the output stream from PARSER. The format of this stream
has been described in Chapter 4. It contains morph pronunciation information
from DECOMP and phrase and part-of-speech information from PARSER.
5.3 Output
The output stream from SOUNDI consists of a string of phonetic segment labels,
stress marks, and syllable and morph boundaries for each word. At the end of each
sentence, the phrase information for that sentence is placed in the output stream
(this is simply a duplicate of the phrase information from PARSER).
5.4 Morphophonemic rules
The pronunciation for each word which has been segmented by DECOMP is con-
structed by catenating the pronunciations of its component morphs. The following
rules are applied to modify the morph pronunciations when necessary.
5.4.1 Plurals, possessives, and contractions with “is”
Words which end in a fricative or affricate close in place of articulation to ss and
232, i.e., the set of segments Ss, 2z, SH, zH, CH, and JJ, form their plurals and
possessives by the concatenation of the segment string IB 22, or, in its vowel-
reduced form, Ix 2z2z (e.g. busses, churches, garages, marshs). After other
52