Mel Scale - History and Other Formulas

History and Other Formulas

There is no single mel-scale formula. The popular formula from O'Shaugnessy's book can be expressed with different log bases:

The corresponding inverse expressions are:


There were published curves and tables on psychophysical pitch scales since Steinberg's 1937 curves based on just-noticeable differences of pitch. More curves soon followed in Fletcher and Munson's 1937 and Fletcher's 1938 and Steven's 1937 and Stevens and Volkmann's 1940 papers using a variety of experimental methods and analysis approaches.

In 1949 Koenig published an approximation based on separate linear and logarithmic segments, with a break at 1000 Hz.

Gunnar Fant proposed the current popular linear/log formula in 1949, but with the 1000 Hz corner frequency.

An alternate expression of the formula, not depending on choice of log base, is noted in Fant (1968):

In 1976, Makhoul and Cosell published the now-popular version with the 700 Hz corner frequency. As Ganchev et al. have observed, "The formulae, when compared to, provide a closer approximation of the Mel scale for frequencies below 1000 Hz, at the price of higher inaccuracy for frequencies higher than 1000 Hz." Above 7 kHz, however, the situation is reversed, and the 700 Hz version again fits better.

Data by which some of these formulas are motivated are tabulated in Beranek (1949), as measured from the curves of Stevens and Volkman:

Beranek 1949 mel scale data from Stevens and Volkman 1940
Hz 20 160 394 670 1000 1420 1900 2450 3120 4000 5100 6600 9000 14000
mel 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250

A formula with a break frequency of 625 Hz is given by Lindsay & Norman (1977); the formula doesn't appear in their 1972 first edition:

Most mel-scale formulas give exactly 1000 mels at 1000 Hz. The break frequency (e.g. 700 Hz, 1000 Hz, or 625 Hz) is the only free parameter in the usual form of the formula. Some non-mel auditory-frequency-scale formulas use the same form but with much lower break frequency, not necessarily mapping to 1000 at 1000 Hz; for example the ERB-rate scale of Glasberg & Moore (1990) uses a break point of 228.8 Hz, and the cochlear frequency–place map of Greenwood (1990) uses 165.3 Hz.

Other functional forms for the mel scale have been explored by Umesh et al.; they point out that the traditional formulas with a logarithmic region and a linear region do not fit the data from Stevens and Volkman's curves as well as some other forms, based on the following data table of measurements that they made from those curves:

Umesh et al. 1999 mel scale data from Stevens and Volkman 1940
Hz 40 161 200 404 693 867 1000 2022 3000 3393 4109 5526 6500 7743 12000
mel 43 257 300 514 771 928 1000 1542 2000 2142 2314 2600 2771 2914 3228

Read more about this topic:  Mel Scale

Famous quotes containing the words history and, history and/or formulas:

    We don’t know when our name came into being or how some distant ancestor acquired it. We don’t understand our name at all, we don’t know its history and yet we bear it with exalted fidelity, we merge with it, we like it, we are ridiculously proud of it as if we had thought it up ourselves in a moment of brilliant inspiration.
    Milan Kundera (b. 1929)

    No matter how vital experience might be while you lived it, no sooner was it ended and dead than it became as lifeless as the piles of dry dust in a school history book.
    Ellen Glasgow (1874–1945)

    That’s the great danger of sectarian opinions, they always accept the formulas of past events as useful for the measurement of future events and they never are, if you have high standards of accuracy.
    John Dos Passos (1896–1970)