Speaking, Talking, Telling

Spoken Language and Text Corpora

Languages



Nafsan (South Efate), Efate, Vanuatu

The Nafsan language, also known as South Efate, is a Southern Oceanic language spoken on the island of Efate in central Vanuatu. As of 2005, there are approximately 6,000 speakers who live in coastal villages from Pango to Eton.

Dalabon, Australia

Dalabon is spoken in central Arnhem Land by a dwindling population, now reduced to fewer than half a dozen fluent speakers, although there are many people of middle age or young adults who understand the language to varying extents. It belongs to the Gunwinyguan language family.

Matukar Panau, Papua New Guinea

Matukar Panau is an Oceanic language spoken near Madang in Madang Province, Papua New Guinea. Since 2010, the language is being documented in an ongoing project lead by Danielle Barth and community members. Linguistic work can be found under resources.

Anindilyakwa,

Bininj Gun-Wok,

Gurindji,

Kaytetye,

Mudburra,

Kriol,

Murrinhpatha,

Warlpiri,

Wubuy,

Yolngu,

Gurindji Kriol,

Nen,

Marind,

Ku Waru,

Abui,

Kalam,

Cook Islands Māori, Southern Cook Islands and diaspora populations (Mainly New Zealand and Australia)

Warumungu,

Mawng,

Ngaanyatjarra,

Warnman,

Kayardild,

Wutung,

Bislama, Vanuatu

Bislama, also known under its earlier name in French bichelamar is a creole language, one of the official languages of Vanuatu. It is the first language of many of the "Urban ni-Vanuatu" (those who live in Port Vila and Luganville), and the second language of much of the rest of the country's residents.

Nmbo, Papua New Guinea

Register for full access

Already a member? Login




ANNIS Corpus Viewer

ANNIS is an open source, cross platform (Linux, Mac, Windows), web browser-based search and visualization architecture for complex multi-layer linguistic corpora with diverse types of annotation. ANNIS, which stands for ANNotation of Information Structure, was originally designed to provide access to the data of the SFB 632 - “Information Structure: The Linguistic Means for Structuring Utterances, Sentences and Texts”.

annis corpus viewer