W0024 : PAROLE Portuguese CorpusThe parole Portuguese corpus contains approximately 3 million running words of European Portuguese distributed by Medium, as follows:
The corpus was classified and encoded according to the common core parole encoding standard. The file format of this corpus is SGML. A subcorpus of the PAROLE Portuguese Corpus, which reproduces approximately the whole Corpus distribution by Medium (Newspaper: about 65%, Book: ab. 20%, Periodical: ab. 5%, Miscellaneous: ab. 10%) is also available. It has about 250,000 words morpho-syntactically tagged accordingly to the parole common tagset and morpho-syntactic annotation standards. Disambiguation was manually checked. Click here to view the prices and browse other ressources belonging to this category |