Archive for the ‘Fast Fourier Transform’ tag
Dialect Detectives

Pedro Torres-Carrasquillo working on identifying different dialects using an automated computer program.
We live in such a diverse world today. With over 6 billion people of different ethnic backgrounds, spread across six continents, it’s sometimes hard to fathom the immensity and beauty of it all. However, one of the drawbacks of this diversity is the barriers it can create, in particular one of language. However, even within a single language, there may exist several different dialects that even native speakers can’t decipher. That’s where Pedro Torres-Carrasquillo and his colleagues at Lincoln Laboratory have come in.
While there exists language-detecting software currently on the market, no such identification system has been made for the different dialects that exist within a certain language. Torres-Carrasquillo, an electrical engineer specializing in speech processing, believes that by analyzing the frequency spectra of several short bursts of sound (using tools such as the Fast Fourier Transform), he will be able to pinpoint the key differences between different dialects.
Previously, Torres-Carrasquillo says, the approach was to “get a lot of examples, and then build a model that looks like your examples.” But he is tackling the problem in a different way. “Our group’s idea is that we don’t need a model that looks like our data – we need a model that can classify our data,” he explains. “We take very small pieces – snippets of speech – turn them into frequencies, add up all these contributions, and make a model that can tell them apart. We’re looking for patterns from just milliseconds of speech.”
According to Torres-Carrasquillo and his team, their technique will allow them to quantify the linguistic differences between dialects, such as the different pronunciations of vowel sounds between Cuban and Puerto Rican Spanish. So far, they have already been able to discriminate against American English and Indian-accented English with an error rate of only about 7 percent.
To me, this breakthrough illustrates how technology can not only simplify our lives, but can also break down barriers between people of different ethnicities and backgrounds. While it may be a stretch now, maybe one day this will pave the way for speech translators or digital language tutors. Vacation in Spain anyone?
It’s been a Hard Days Night
How can technology aid scientific discovery? We’ve covered on Bench Press examples such as providing the computing power needed to simulate particle physics and using cosmic muons to scan Mayan temples. But what about something closer to home – like figuring out what the “infamous” opening chord in the Beatles classic “A Hard Days Night” is?
As you probably guessed from the fact that I’m writing this, the answer is yes.
Professor Jason Brown of Dalhousie University applied a technique called Fourier analysis to the problem at hand. Fourier analysis is useful for decomposing a particular waveform into the fundamental frequencies which make it up. Or, to put it more commonly, it lets you take a waveform (like a particular sound) and figure out what all the underlying frequencies are which make it up.
The concept of Fourier analysis has been around at least as early as the 1800s when Jean Baptiste Joseph Fourier used the concept to explore heat propagation. However, the lack of computers made the technique unwieldy for exploring real world analog data. This problem was addressed with the development of programs and integrated circuits adept at deploying a numerical approximation of Fourier analysis called the Fast Fourier Transform (oftentimes abbreviated FFT) which has made an entire realm of sophisticated analyses possible and relatively simple to do.
What Brown did was very elementary. Using a digital recording of the Beatles classic and the widely available program Mathematica, he broke down the opening “twang” into the 29375 frequencies that made it up. He then applied a filter (to filter out harmonics and background noise) to pick out the 48 most important frequencies and compared them to popular “estimates” of what was played.
What Brown found was that most popular guesses of the opening were wrong as they assumed the Beatles had played the G2 note (where C4 is “middle C”) which didn’t show up at all in his Fourier analysis. But, using information about what instruments the Beatles played (which gives you information like knowing that pianos make 3 distinct frequencies due to the hammer hitting 3 strings simultaneously), Brown deduced what countless enthusiasts have guessed at for over 40 years. Without further ado, the opening sound to “It’s a Hard Day’s Night”:
Awesome.