Talk:TIMIT

	Linguistics portal This article is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.LinguisticsWikipedia:WikiProject LinguisticsTemplate:WikiProject LinguisticsLinguistics articles
???	This article has not yet received a rating on the project's importance scale.
	This article is supported by Applied Linguistics Task Force.

[Untitled][edit]

I am faced with the task of translating Timit to IPA. Is there some resourse that might help me?

Draft for updating the article[edit]

telephone corpus used as a benchmark [1]
timit is used to train a speech recognizer during the Blizzard Challenge [2]
full name is: DARPA-TIMIT Acoustic-Phonetic Continuous Speech Corpus [3]
another full name: Texas Instruments/Massachusetts Institute of Technology (TIMIT) [4]
first CD Rom version was released in 1988 [4]
has only 10 sentences each is 30 seconds long, spoken by 630 different speakers [4]
the costs for creating the TIMIT dataset was 1.5 million US$ [5]

History

The TIMIT corpus was an early attempt to create a database with speech samples. It was published in the year 1988 on CD-ROM and contains of only 10 sentences. Each sentence was 30 seconds long and was spoken by 630 different speakers. It was the first notable attempt in creating and distributing a speech corpus and the overall project has produced costs of around 1.5 million US$.

The acronym stands for Texas Instruments/Massachusetts Institute of Technology and it was initiated by DARPA. The main reason why a corpus of telephone speech was created was to train speech recognizing software. In the Blizzard Challenge different software has to convert audio recordings into textual data and the TIMIT corpus was used as a standardized baseline.

Literature

[1] Morales, Nicolás, et al. "STC-TIMIT: Generation of a single-channel telephone corpus." Proceedings of the Sixth International Language Resources and Evaluation (LREC’08) (2008): 391-395.
[2] Sawada, Kei, et al. "The NITech text-to-speech system for the Blizzard Challenge 2016." Blizzard Challenge 2016 Workshop. 2016.
[3] Bauer, Patrick, David Scheler, and Tim Fingscheidt. "WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network." LREC. 2010.
[4] John S. Garofolo, Lori F. Lamel, William M. Fisher: DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CD-ROM, NISTIR 4930, 1993
[5] Chanchaochai, Nattanun, et al. "GlobalTIMIT: Acoustic-Phonetic Datasets for the World's Languages." Interspeech. 2018.

greetings --ManuelRodriguez (talk) 08:00, 29 February 2020 (UTC)[reply]