Physical Modelling of the vocal tract of
a Sygyt singer
Chen-Gia Tsai
Source theory vs. Resonance theory
Two
types of overtone-singing should be distinguished: Sygyt and Kargyraa. In Sygyt
performances, the rising tongue divides the vocal tract into two cavities,
which are connected by a narrow channel, whereas the tongue does not rise in
Kargyraa performances.
Up
until now, two major theories have been proposed on the production of the melody
pitch: (1) The 'double-source' theory (Chernov & Maslov 1987), which
asserts the existence of a second sound source such as a whistle-like mechanism
formed by the narrowing of the false vocal folds (ventricular folds) in
addition to the true vocal fold vibration; and (2) the 'resonance' theory,
which asserts that only a glottal sound source exists, but that an upper
harmonic is so emphasized by an extreme resonance of the vocal tract that it is
segregated from the other components and heard as another pitch. The fact that
the melody pitches producible by the singer are limited to the harmonic series
of the drone supports the resonance theory (Adachi & Yamada 1999).
Physical
modelling of the resonance of the vocal tract of Sygyt singers includes: (1)
rear cavity theory, (2) front cavity theory, and (3) resonance-matching theory.
The glottal sound source of Sygyt voices is rich in harmonics. This has been
attributed to the short open duration of the glottis (Bloothooft et al. 1992,
Adachi & Yamada 1999).
Rear cavity theory
Based
on vocal tract shape measurements by MRI, Adachi and Yamada (1999) reported
that the resonance of the rear cavity, that was, from the glottis to the
narrowing of the tongue, produced the sharp formant Fk. The resonance of the
front cavity, that was, from the articulation by the tongue to the mouth exit,
was not critical to the production of the melody pitch. The length of the rear
cavity decreases as fk increases.
Adachi
and Yamada (1999) synthesized tones from transfer functions calculated with and
without the front cavity, finding that the front cavity did not affect the
formant frequencies, although the magnitude of Fk decreased due to the lack of
the front cavity resonance. It is important to note that Adachi and Yamada
calculated the transfer functions of a Sygyt singer's vocal tract using a
one-dimensional model, in which the tract shape was approximated as a
succession of cones. While such models are widely used in speech research, I
argue that the change in the tract shape at the articulation point is so abrupt
that the assumption of planar-wave fronts clearly breaks down. Theoretically,
one-dimensional models are unsuitable for a Sygyt singer's vocal tract.
In
practice, the rear cavity theory is not supported by a non-traditional
technique of overtone-singing used by Tran Quang Hai, who calls it 'one-cavity
technique' because the tongue does not rise to divide the vocal tract into two
cavities. However, there is an articulation point at the soft palate, as to
pronounce the velar /ng/. The melody of fk is produced by manipulating the
opening of the front cavity, while the rear cavity, that is, from the glottis
to the soft palate, remains unchanged. This technique suggests that the front
cavity may be more important for the production of fk.
Front cavity theory
Based
on preliminary impedance measurements of vocal tract by a Jew's harp, Tsai
(2001) reported that the resonance of the front cavity determined fk. The
author modelled the front cavity as a Helmholtz resonator driven by a flow
source U1 at the articulation point. The transfer function can be calculated
according to Eq. (6.65) in [Fletcher & Rossing 1991].
Owing
to the tract shape at the articulation point, the flow U1 is presumed to be
incompressible. It is known that in regions of fast change in pipe geometry,
such as a tone hole or the pipe termination, the Helmholtz number He<<1
implies that the wave equation can locally be approximated by the Laplace
equation, which describes an incompressible potential flow (Hirschberg &
Kergomard 1995). In overtone-singing, the acoustic flow at the articulation
point is therefore incompressible (compact region). This is not true for normal
phonations.
The
front cavity theory failed to explain the small bandwidth of Fk. Fig. 2 compares
the matched theoretical spectral envelops and recorded spectra of a Sygyt voice
and a Jew's harp tone, which were produced by me with the same front cavity. It
can be seen that the Fk bandwidth of the voice is smaller than that of the
Jew's harp tone. The latter was produced without the rear cavity because the
rising tongue completely closed the channel between the front and the rear
cavities. This discrepancy suggests that the rear cavity may play a role in
sharpening Fk.


Figure 2: Spectra of a Sygyt voice (left) and a Jew's harp tone (right)
produced with the same front cavity.
Resonance-matching theory
The resonance-matching theory
takes into account the contributions of both the front and the rear cavities,
whose resonances are more or less matched to produce a sharp Fk. Kob (2002),
reported that an improvement of the second resonance by about 15 dB was
achieved by matching two resonance frequencies, which was fulfilled by
manipulating the mouth opening. Although this theory appears to 'unified' the
theories of rear/front cavity, it should be noted that according to Table 6.1
in [Kob 2002], the resonance of the front cavity was just close to the second
resonance of the rear cavity; Fk could be sharp enough for pitch production
without an exact resonance-matching.
Discussion
Kob (2002) calculated the
transfer functions of a Sygyt singer's vocal tract using an improved method of
continuous-time interpolated multiconvolution (Barjau et al. 1999), which was
originally developed to calculate the impulse response of wind instruments with
tone-hole discontinuities. However, this approach does not predict the flow
field at the articulation point. Fig. 3 displays the shape of a Sygyt singer's
vocal tract and the potential field at the articulation point. As can be seen
from the isobar (equal-potential) lines, the acoustic flow has a higher
velocity near the tongue. This contradicts the assumption of planar-wave fronts
in Kob's calculation.

Figure 3: Shape of a Sygyt singer's vocal tract (left) and the isobar lines at
the articulation point (right).
The
limitations of one-dimensional models of the vocal tract or the bore of wind
instruments should be borne in mind: even at low frequencies evanescent
cross-modes will be excited in the rapidly flaring bell section because of
strong mode coupling (e.g., Pagneux et al. 1996). In a Sygyt singer's vocal
tract, one-dimensional models are suitable only for the rear cavity.
The
vocal tract sould be divided into four regions, in which the wave equations
have different forms for approximation. In light of Matched Asymptotic
Expansions, the global solution can be obtained by 'gluing' the local solutions
together (Hirschberg & Kergomardh 1995). The four regions are (1) the rear
cavity, (2) the compact region at the articulation point, (3) the front cavity
as a Helmholtz resonator, and (4) the compact region at the mouth opening. The
rear cavity is approximated as a succession of cones, where the acoustic field
is governed by the Webster equation for He<<1. At the articulation point
and at the mouth opening, the incompressible air is approximated as a piston.
The front cavity is a Helmholtz resonator with a short neck.
If
the transfer function of a Sygyt singer's vocal tract does not predict the
small bandwidth of the second formant, one should consider the possible effect
of acoustic feedback to the glottal source (Levin and Edgerton 1999). This may
be related to the nonlinear effect of the adducted ventricular folds.
Return to Mongolian Khoomii Singing main Page