Corpora

Contents

The corpora which are the basis for sampling within the C-ORAL-ROM Project are:

  1. Corpora of Spontaneous Spoken Italian LABLITA
    (since the beginning of the 70’s)
  2. Spoken French Corpus
    (GARS/DELIC Corpus, since 1978)
  3. Corpus of spoken Portuguese FUL.CLUL
    (since the beginning of the 70's)
  4. The UAM corpus of spoken Spanish
    (under development since 1991)
  5. A multilingual corpus of man-machine dialogues
    (under development since 1991)