You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

88 lines
1.9 KiB

This file contains ambiguous Unicode characters!

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

From text to speech: The MITalk system
The old man sat in a rocker.
PHONO2: Function word: DH IY
PHONO2: Content word: 'OW LX DD
PHONO2: Content word: MM 'AE NN [End NOUN phrase]
PHONO2: Content word: SS '"AE DX
PHONO2: Function word: IH NN
PHONO2: Function word: AX
PHONQO2: Content word: RR AA KK * - ER
PHONO2: Punctuation:
PHONQ2: <EOF>
PROSOD: (Silence] 30ms. 133.4Hz.
PROSOD: Function word:
PROSOD: DH 50ms. 123.4Hz.
PROSOD: IY 105ms. 131.4Hz.
PROSOD: Content word:
PROSOD: "OW 170ms. 174.5Hz. Stressed
PROSOD: LX 75ms. 151.0Hz.
PROSOD : DD 50ms. 146.0Hz.
PROSOD: Content word:
PROSOD: MM 70ms. 151.0Hz. Stressed
PROSOD: 'AE 210ms. 157.0Hz. Stressed
PROSOD: NN 55ms. 117.9Hz.
PROSOD: [End NOUN phrase]
PROSOD: Content word:
PROSOD: SS 100ms. 122.9Hz. Stressed
PROSOD: 'AE 175ms. 153.9Hz. Stressed
PROSOD: DX 20ms. 140.1Hz.
PROSOD: Function word:
PROSQOD: IH 55ms. 148.1Hz.
PROSOD: NN 50ms. 142.5Hz.
PROSOD: Function word:
PROSOD: AX 60ms. 142.5Hz.
PROSOD: Content word:
PROSOD: RR 80ms. 140.2Hz. Stressed
PROSOD: "AA 160ms. 146.2Hz. Stressed
PROSOD: KK 65ms. 113.1Hz.
PROSOD: *
PROSOD: -
PROSOD: ER 170ms. 108.1Hz.
PROSOD: Punctuation: .
PROSOD: [Silence] 400ms. 111.2Hz.
PROSOD: [End sentence]
PROSOD: <EOF>
Figure 9-1: Example of the processing performed by PROSOD
DUR=((INHDUR-MINDUR)xPRCNT)/100+MINDUR (1)
where INHDUR is the inherent duration of a segment in msec, MINDUR is the
minimum duration of a segment in msec, and PRCNT is the percentage shortening
determined by applying rules 1 to 10 below. The program begins by obtaining
values for INHDUR and MINDUR for the current segment from Table 9-1, and by
setting PRCNT to 100. The inherent duration has no special status other than a
starting point for rule application; it is roughly the duration to be expected in non-
sense CVCs spoken in the carrier phrase “Say bVb again” or “Say Cab again”.
The following ten rules are then applied, where each rule modifies the PRCNT
94