User Tools

Site Tools

Agence Nationale de la Recherche

wp2

This is an old revision of the document!


Work Package 2: MWE Lexicon

  • Partners in charge: LI (Agata Savary) and ATILF (Mathieu Constant)
  • Partners involved: LI, LIF, ATILF, LIGM
  • Objectives: Build a unified and enriched MWE lexicons, including morphological, distributional, syntactic and semantic information. Multiword NEs will get special treatment as they will be associated with pragmatic information (i.e. linking with the LOD). The encoded features will be of varying nature - either symbolic or numeric.
  • Final products:
    • FP.2.1: A new lexical resource, distributed under an open license, in a standard format,
    • FP.2.2: A tool to project an MWE lexicon on treebanks
  • Subtasks:
    • WP 2.1: Compilation and analysis of existing lexicons
    • WP 2.2: Construction of a unified framework;
    • WP 2.3: Enrichment of the lexicon
    • WP 2.4: Interlinking of MWEs with the Linked Open Data
    • WP 2.5: Converting the lexicon to a standard export format
    • WP 2.6: Projection on treebanks

Preparatory work

Before the actual construction of the unified MWE lexicon, some preliminary studies have been performed:

  • a state-of-the-art of the different formats of MWE lexicons by Agata Savary in the framework of the PARSEME COST Action.
  • experiments for extracting linguistic information from various existing MWE lexicons (training period at LIGM in 2016 by Manolo Iborra, supervised by Mathieu Constant)
  • inventory and documentation of the properties in the lexicon-grammar tables of frozen expressions, as well as selection of lexical entries based on WP1 criteria (training period at LIGM, by Fabrice Beltran, supervised by Eric Laporte).

Work in progress

wp2.1505747784.txt.gz · Last modified: 2017/09/18 17:16 by matthieu.constant

Page Tools