As we have seen, Intonational Phrases frequently contain more than a single pitch accent. In previous sections we have seen four pitch accent labels: the two single-tone accents (H* and L*) and two bitonal accents (L+H* and L*+H). Theoretically, there are no constraints within the ToBI system on how these pitch accents may combine in an Intonational Phrase: any pitch accent label of the set seen so far may precede or follow any of the pitch accent labels in that same set. For example, H* may be followed or preceded by another H*, or by L* or L+H* or L*+H.
However, the !H* pitch accent label ("downstepped H star") is dependent on the immediately preceding labels. It indicates that there is a specific tonal relationship between the prominence labelled !H* and the preceding pitch accent. Specifically, a downstepped H pitch accent indicates that the tone of the prominent syllable is realized by a perceptually lower tone than the High tone target of an immediately preceding pitch accent: the tone has "stepped down" from the preceding High. While the !H* is realized by a lower pitch than the preceding High, it is distinct from the L* pitch accent, which is characterized by a pitch excursion down towards the bottom of the speaker's pitch range (for that utterance). The !H* does not necessarily approach the bottom of the speaker's range. For example, in <another-banana1>, the H* on the -noth- syllable of another is followed by !H* on the -nan- of banana. The phrase ends with a Low phrase tone and a Low boundary tone (L-L%), which are realized by additional lowering of the F0 in the final syllable (-na of banana). (Note that the pitch track shows what looks to be a rise in the F0 at the end of banana, but this is a pitch-tracking error.)
Play another-banana1.wav:
The example <clean_slate>, below shows another instance of the same tones (H* !H* L-L%), this time realized on two single-syllable words, "clean slate." Here, the pitch track shows a nice "step down" from the word clean, labelled with H*, to the word slate, labelled with !H*. (The audible fall in pitch into creaky voice at the end of the word slate, which is the realization of the L-L% phrase tone-boundary tone combination, is not captured by the pitch tracker.)
Play clean_slate.wav:
The !H* label may follow any of the pitch accents which contain an H target, such as H*, L+H* or L*+H. The !H* label will therefore never immediately follow a (single-tone) L* pitch accent, as that pitch accent has no High tone target. The !H* label will likewise never be the first pitch accent in a phrase.
In general, pitch accents labelled !H* are realized in the middle of the speaker's pitch range for that Intonational Phrase; however, they are not considered "mid" tones. Like L+H* and H*, the !H* is a pitch accent for which the prominence is considered to be signalled by a High tonal target, which is why the H symbol is used with the star symbol. As with the other pitch accents with H targets, the actual realization of the High tone can vary quite a lot (i.e. speakers can speak with a relatively low pitch or a high pitch, but still produce High tones). This is particularly apparent in cases where one !H* pitch accent is followed by another !H*. The second !H* means that the target of the this prominence is lower relative to the target of the first !H*.
For example, in Figure 2.6.3 <privateryan> ("Saving Private Ryan") the last prominent (pitch-accented) syllable of the phrase (the Ry- of Ryan) is produced with a lower F0 than that of the previous prominence (on the Pri- of Private) which is in turn lower than the F0 of the first prominent syllable of the phrase (the Sav- of Saving). This relationship in the pitch height is reflected in the use of the use of the !H* labels on the last two pitch accents of the phrase, giving the tone labels H* !H* !H* L-L%. By the way, notice that the pitch range (75-150 Hz, indicted on the right vertical axis in the F0 pane) for this speaker is much more compressed than the range seen in other examples. It is useful to adjust the range for each speaker in order to see the F0 excursions. Otherwise, in another display, such as one from 75-400 Hz, the F0 might appear nearly flat.
Play privateryan.wav:
The same tone labels (where the pitch accents labels are an H* followed by !H* followed by another !H*) can be seen in the example Figure 2.6.4 <fencedmeadow_150>. There are no theoretical limits within the ToBI system as to how many times such downstepping can occur within a phrase.
Play fencedmeadow_150.wav:
On the other hand, there is no special term or diacritic for a second High pitch accent that is realized with a higher f0 than its immediate predecessor. In this case, the second, higher pitch accent would be simply labeled as another H*, as in the following examples:
Play really1.wav:
In this example, the first rendition of That's really illuminating has two pitch accents, an H* on real- and a !H* on -lum- similar to the examples of downstep shown above. However, the second rendition has an H* on real- followed by a higher H* on -lum-. The differences in heights of H*'s in an intonational phrase is not distinctive, that is, medium High H*'s don't convey different information than higher H*'s. However, the term "upstep" is sometimes used casually or in other contexts (see box: What About "Upstep"?).
Figure 2.6.6 : Two more examples of H* followed by another H* realized with a higher f0
Play thought.wav:
Play tree1house.wav:
In this last example, note the middle Intonational Phrase (who lives in a treehouse) where the H* on treehouse is higher than the H* on lives.