User Tools

Site Tools

Agence Nationale de la Recherche

wp1

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
wp1 [2016/02/17 19:13]
agata.savary
wp1 [2017/09/18 15:46]
matthieu.constant some words on WP1
Line 1: Line 1:
 __Work Package 1__: **MWE representation and annotation** __Work Package 1__: **MWE representation and annotation**
-  * **Partners in charge**: ALPAGE (Marie Candito) and LIGM (Mathieu Constant) +  * **Partners in charge**: LLF (Marie Candito) and ATILF (Mathieu Constant) 
-  * **Partners involved**: ALPAGE, LI, LIF, LIFO, LIGM +  * **Partners involved**: LLF, LI, LIF, LIFO, ATILF 
-  * **Objectives**: select the set of criteria to be used in the project for MWE identification,  classification, properties, produce a gold standard corpus.+  * **Objectives**: Select the set of criteria to be used in the project for MWE identification,  classification, properties. Produce a gold standard corpus.
   * **Final products**:    * **Final products**: 
-    * **FP.1.1**: state-of-the-art report on MWE representation, +    * **FP.1.1**: state-of-the-art report on MWE representation, 
-    * **FP.1.2**: guidelines indicating the criteria to identify and classify MWEs, as well as the list of properties to be encoded in the lexicon and an annotation scheme +    * **FP.1.2**: Guidelines indicating the criteria to identify and classify MWEs, as well as the list of properties to be encoded in the lexicon and an annotation scheme 
-    * **FP.1.3**: gold standard corpus manually annotated by experts, including deep MWE annotation, together with the annotation guidelines +    * **FP.1.3**: gold standard corpus manually annotated by experts, including deep MWE annotation, together with the annotation guidelines
   * **Subtasks**:    * **Subtasks**: 
     * **WP 1.1**: State-of-the art on MWE in language resources     * **WP 1.1**: State-of-the art on MWE in language resources
     * **WP 1.2**: Setup of formal criteria for MWE identification and classification      * **WP 1.2**: Setup of formal criteria for MWE identification and classification 
     * **WP 1.3**: A gold standard     * **WP 1.3**: A gold standard
 +
 +----
 +**Results**
 +
 +In the framework of the [[https://typo.uni-konstanz.de/parseme/index.php/2-general/142-parseme-shared-task-on-automatic-detection-of-verbal-mwes|PARSEME Shared Task on identification of verbal MWEs]], Agata Savary, Carlos Ramisch and Marie Candito participated in the writing of the annotation guidelines (Savary et al. MWE 2017). Marie Candito, Mathieu Constant, Carlos Ramisch, Agata Savary, Yannick Parmentier, Caroline Pasquer and Jean-Yves Antoine produced the French dataset (Candito et al. TALN 2017). This dataset, composed of the Sequoia corpus and the French UD treebank (about 19,000 sentences), includes 5,000 annotated verbal MWEs, 
 +
 +
 +**Work in progress**
 +
 +The annotation of the Sequoia corpus is now being extended to all MWEs, using annotation guidelines under construction. The release of the data is planned for the end of 2017.
wp1.txt · Last modified: 2017/09/18 15:46 by matthieu.constant