Who needs a time machine? Scientists reconstruct ancient languages with software

February 15, 2013

The new software has already accurately reconstructed the Proto-Austronesian language, which was spoken by the ancient inhabitants of Easter Island (Photo: Shutterstock)

View 1 Image

1/1

The new software has already accurately reconstructed the Proto-Austronesian language, which was spoken by the ancient inhabitants of Easter Island (Photo: Shutterstock)

Imagine the wealth of knowledge we could uncover if it was possible to travel back in time and re-construct ancient languages. While that’s impossible right now, scientists at UC Berkley and the University of British Columbia reckon they’ve managed the next-best thing, by developing new software which uncovers existing fragments of “proto-languages” from languages still in use.

Proto-languages are linguistic ancestors which gave rise to modern languages. These forbears include Proto-Indo-European, Proto-Afroasiatic and Proto-Austronesian. Typically, their reconstruction is a painstaking process which can take linguists many years.

The new software uses probabilistic reasoning which explores logic and statistics in order to perform its reconstructive work. It focused on 637 modern Austronesian languages, and analyzed a database of over 140,00 words to provide a reconstruction of Proto-Austronesian which replicated the work of human linguists at an accuracy of 85 percent – though far more quickly.

Indeed, the researchers posit that a large-scale reconstruction could be performed in a matter of days or even hours in this way.

The computer program is based upon the linguistic theory that words evolve in a way which can be thought of as similar to a family tree. That is, traces of proto-languages remain in the “roots” of languages even as they evolve over time.

Utilizing an algorithm called the Markov chain Monte Carlo sampler, the software sorted through sets of words in the modern Austronesian languages which share a common sound, history and origin. From there, it determined whether the words shared a common mother language – in this case, Proto-Austronesian.

“What excites me about this system is that it takes so many of the great ideas that linguists have had about historical reconstruction, and it automates them at a new scale: more data, more words, more languages, but less time,” said Dan Klein, an associate professor of computer science at UC Berkeley and co-author of a paper on the subject which was published in the journal Proceedings of the National Academy of Sciences.

In addition to reaching into the past, the researchers note their software can also predict the future evolution of words, providing clues as to how languages will change over time.

Source: UC Berkley

3 comments

Bruce H. Anderson February 18, 2013 01:45 PM

Perhaps this could be used to decode the Mayan calendar. The first attempt did not turn out well.

Joris Hines February 18, 2013 03:21 PM

They should work on the ancient and mysterious language of the Basque people of the Pyrennees Mountains. I believe it dates back to the earliest civilization and legend has it that they were in Atlantis. They were where they still are now (Pyrennees Mountains) long before Spain was called Spain, or France was called France. Even before Europe was called Europe. Their language is completely unique and unassociated with any other language in Europe.

Wolfhoundpax February 21, 2013 10:25 PM

Great thought Joris, hope they take the suggestion. Too bad the Voynich Manuscript can't be read.

Who needs a time machine? Scientists reconstruct ancient languages with software

Tags

Most Viewed

Toyota and Lexus no longer most reliable carmakers, says Consumer Reports

France runs fusion reactor for record 22 minutes

Laser-wielding device is like an anti-aircraft system for mosquitoes

FREE NEWSLETTER