C-ORAL-ROM: corpora

Corpora
Contents

The corpora which are the basis for sampling within the C-ORAL-ROM Project are:

Corpora of Spontaneous Spoken Italian LABLITA
(since the beginning of the 70’s)

Spoken French Corpus
(GARS/DELIC Corpus, since 1978)

Corpus of spoken Portuguese FUL.CLUL
(since the beginning of the 70's)

The UAM corpus of spoken Spanish
(under development since 1991)

A multilingual corpus of man-machine dialogues
(under development since 1991)