EUROM1 collection

Collection eurom-000741

84CAMPIONE, E.; VERONIS, J. A Multilingual Prosodic Database. Proceedings of ICSLP, 1998.
EUROM1 collection (eurom-000741)
We present a prosodic corpus in five languages (French, English, Italian, German and Spanish) comprising 4 hours and 20 minutes of speech and involving 50 different speakers (5 male and 5 female per language). The recordings on which the corpus is based are extracted from the EUROM 1 speech database and consists of passages of about five sentences. The corpus was stylized automatically by an algorithm which factors out microprosodic effects and represents the intonation contour of utterances by a series of target points. Once interpolated by a smooth curve (spline), these points produce a contour undistinguishable from the original when re-synthesized, apart from a few detection errors. A symbolic coding of the 50000 pitch movements of the corpus is also provided, along with the time-alignment of orthographic transcription to signal at word- level. The entire corpus was verified and manually corrected by experts for each language. It will be made available at production cost for research through the European Language Resource Association (ELRA).
60CHAN, D.; FOURCIN, A.; GIBBON, D.; et al. EUROM — A Spoken Language Resource For The EU. ESCA. EUROSPEECH'95. 4th European Conference on Speech Communication and Technology. Madrid, September 1995. ISSN 1018-4074
A summary of the progress of development and the current realisation of a CDrom based spoken language resource for 11 languages of the European Union is given; the physical conditions basic to its acquisition are defined and the criteria guiding its poly-language structures briefly outlined.