User Tools

Site Tools

Agence Nationale de la Recherche

wp1

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

wp1 [2016/02/17 19:14]
agata.savary
wp1 [2017/09/18 15:46] (current)
matthieu.constant some words on WP1
Line 1: Line 1:
 __Work Package 1__: **MWE representation and annotation** __Work Package 1__: **MWE representation and annotation**
-  * **Partners in charge**: ​ALPAGE ​(Marie Candito) and LIGM (Mathieu Constant) +  * **Partners in charge**: ​LLF (Marie Candito) and ATILF (Mathieu Constant) 
-  * **Partners involved**: ​ALPAGE, LI, LIF, LIFO, LIGM+  * **Partners involved**: ​LLF, LI, LIF, LIFO, ATILF
   * **Objectives**:​ Select the set of criteria to be used in the project for MWE identification, ​ classification,​ properties. Produce a gold standard corpus.   * **Objectives**:​ Select the set of criteria to be used in the project for MWE identification, ​ classification,​ properties. Produce a gold standard corpus.
   * **Final products**: ​   * **Final products**: ​
Line 11: Line 11:
     * **WP 1.2**: Setup of formal criteria for MWE identification and classification ​     * **WP 1.2**: Setup of formal criteria for MWE identification and classification ​
     * **WP 1.3**: A gold standard     * **WP 1.3**: A gold standard
 +
 +----
 +**Results**
 +
 +In the framework of the [[https://​typo.uni-konstanz.de/​parseme/​index.php/​2-general/​142-parseme-shared-task-on-automatic-detection-of-verbal-mwes|PARSEME Shared Task on identification of verbal MWEs]], Agata Savary, Carlos Ramisch and Marie Candito participated in the writing of the annotation guidelines (Savary et al. MWE 2017). Marie Candito, Mathieu Constant, Carlos Ramisch, Agata Savary, Yannick Parmentier, Caroline Pasquer and Jean-Yves Antoine produced the French dataset (Candito et al. TALN 2017). This dataset, composed of the Sequoia corpus and the French UD treebank (about 19,000 sentences), includes 5,000 annotated verbal MWEs, 
 +
 +
 +**Work in progress**
 +
 +The annotation of the Sequoia corpus is now being extended to all MWEs, using annotation guidelines under construction. The release of the data is planned for the end of 2017.
wp1.txt · Last modified: 2017/09/18 15:46 by matthieu.constant