Talk:TIMIT

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

[Untitled][edit]

I am faced with the task of translating Timit to IPA. Is there some resourse that might help me?

Draft for updating the article[edit]

  • telephone corpus used as a benchmark [1]
  • timit is used to train a speech recognizer during the Blizzard Challenge [2]
  • full name is: DARPA-TIMIT Acoustic-Phonetic Continuous Speech Corpus [3]
  • another full name: Texas Instruments/Massachusetts Institute of Technology (TIMIT) [4]
  • first CD Rom version was released in 1988 [4]
  • has only 10 sentences each is 30 seconds long, spoken by 630 different speakers [4]
  • the costs for creating the TIMIT dataset was 1.5 million US$ [5]
History

The TIMIT corpus was an early attempt to create a database with speech samples. It was published in the year 1988 on CD-ROM and contains of only 10 sentences. Each sentence was 30 seconds long and was spoken by 630 different speakers. It was the first notable attempt in creating and distributing a speech corpus and the overall project has produced costs of around 1.5 million US$.

The acronym stands for Texas Instruments/Massachusetts Institute of Technology and it was initiated by DARPA. The main reason why a corpus of telephone speech was created was to train speech recognizing software. In the Blizzard Challenge different software has to convert audio recordings into textual data and the TIMIT corpus was used as a standardized baseline.

Literature
  • [1] Morales, Nicolás, et al. "STC-TIMIT: Generation of a single-channel telephone corpus." Proceedings of the Sixth International Language Resources and Evaluation (LREC’08) (2008): 391-395.
  • [2] Sawada, Kei, et al. "The NITech text-to-speech system for the Blizzard Challenge 2016." Blizzard Challenge 2016 Workshop. 2016.
  • [3] Bauer, Patrick, David Scheler, and Tim Fingscheidt. "WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network." LREC. 2010.
  • [4] John S. Garofolo, Lori F. Lamel, William M. Fisher: DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CD-ROM, NISTIR 4930, 1993
  • [5] Chanchaochai, Nattanun, et al. "GlobalTIMIT: Acoustic-Phonetic Datasets for the World's Languages." Interspeech. 2018.

greetings --ManuelRodriguez (talk) 08:00, 29 February 2020 (UTC)[reply]