The LaMIT database consists in recordings of 100 Italian sentences. The sentences in the database were designed so to include all phonemes of the Italian language, and also take into account the typical frequency of each phoneme in written Italian. Four native adult speakers of Standard Italian, raised and living in Rome, Italy, two female and two male, pronounced the sentences in two different recording sessions; two repetitions for each sentence per speaker were therefore collected, for a total of 800 recordings. The database was specifically created for application in the LaMIT project, that focuses on the application to the Italian language of the Lexical Access model proposed by Ken Stevens for American English. The model relies on the detection of specific acoustic discontinuities called landmarks and other acoustic cues to features that characterize each phoneme. Each recording was thus processed to generate a set of labeling files that identify both predicted landmarks and other cues, and actual landmarks/cues. The labeling files, compiled according to the labeling syntax used in the Praat speech processing software, are also made available as part of the LAMIT database.

The LaMIT database: a read speech corpus for acoustic studies of the Italian language toward lexical access based on the detection of landmarks and other acoustic cues to features / Di Benedetto, Maria-Gabriella; Shattuck-Hufnagel, Stefanie; Choi, Jeung-Yoon; De Nardis, Luca; Arango, Javier; Chan, Ian; Decaprio, Alec; Budoni, Sara. - In: DATA IN BRIEF. - ISSN 2352-3409. - 42:(2022). [10.1016/j.dib.2022.108275]

The LaMIT database: a read speech corpus for acoustic studies of the Italian language toward lexical access based on the detection of landmarks and other acoustic cues to features

Di Benedetto, Maria-Gabriella;De Nardis, Luca
;
2022

Abstract

The LaMIT database consists in recordings of 100 Italian sentences. The sentences in the database were designed so to include all phonemes of the Italian language, and also take into account the typical frequency of each phoneme in written Italian. Four native adult speakers of Standard Italian, raised and living in Rome, Italy, two female and two male, pronounced the sentences in two different recording sessions; two repetitions for each sentence per speaker were therefore collected, for a total of 800 recordings. The database was specifically created for application in the LaMIT project, that focuses on the application to the Italian language of the Lexical Access model proposed by Ken Stevens for American English. The model relies on the detection of specific acoustic discontinuities called landmarks and other acoustic cues to features that characterize each phoneme. Each recording was thus processed to generate a set of labeling files that identify both predicted landmarks and other cues, and actual landmarks/cues. The labeling files, compiled according to the labeling syntax used in the Praat speech processing software, are also made available as part of the LAMIT database.
2022
Speech processing; lexical gemination; syntactic gemination; italian
01 Pubblicazione su rivista::01a Articolo in rivista
The LaMIT database: a read speech corpus for acoustic studies of the Italian language toward lexical access based on the detection of landmarks and other acoustic cues to features / Di Benedetto, Maria-Gabriella; Shattuck-Hufnagel, Stefanie; Choi, Jeung-Yoon; De Nardis, Luca; Arango, Javier; Chan, Ian; Decaprio, Alec; Budoni, Sara. - In: DATA IN BRIEF. - ISSN 2352-3409. - 42:(2022). [10.1016/j.dib.2022.108275]
File allegati a questo prodotto
File Dimensione Formato  
DeNardis_The-LaMIT-database_2022.pdf

accesso aperto

Note: Articolo completo
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 1.01 MB
Formato Adobe PDF
1.01 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1640075
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact