Summary of the paper

Title Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata
Authors Claude Audebert, Christian Gaubert and André Jaccarini
Abstract We present scenarii showing the interactive construction of operators . Three grammars and their progressive refinements through the "feed-back" method are given as an example: a kernel of grammars for retrieving quotations, a grammar reflecting a set of current syntactic operations and a grammar dealing with morphology. They are designed as Finite State Automata, part of them made deterministic for better performance, using the Sarfiyya software developed on purpose which allows many operations on FST. Purely algorithmic, this approach uses minimal resources, is rather independent from lexicons, gives to the tool words a prominent place and bases parsing on surface structures. On the theoretical level, it aims at putting forward the specificity of Arabic language which allows to work without a lexicon (as a limit case) due to the high level of grammaticalization in this language. This work is thus of interest to the linguist who looks for the good balance between lexicon and grammar as well as to the specialist in cognitive sciences (duality between data and programs). On the practical level, this work aims at establishing a coherent methodology for the creation of multipurpose searching operators.
Topics Exploitation of LRs in different types of applications (information extraction, information retrieval, speech dictation, translation, summarisation, web services, semantic web, etc.),
Evaluation methodologies, protocols and measures,
Taggers and Parsers
Full paper Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata
Bibtex @InProceedings{AUDEBERT09.37,
  author = {Claude Audebert, Christian Gaubert and André Jaccarini},
  title = {Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata},
  booktitle = {Proceedings of the Second International Conference on Arabic Language Resources and Tools},
  year = {2009},
  month = {April},
  date = {22-23},
  address = {Cairo, Egypt},
  editor = {Khalid Choukri and Bente Maegaard},
  publisher = {The MEDAR Consortium},
  isbn = {2-9517408-5-9},
  language = {english}
  }

Powered by ELDA © 2009 The MEDAR Consortium