User Tools

Site Tools

Agence Nationale de la Recherche

outcomes

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
outcomes [2019/11/05 12:59]
matthieu.constant [Full-MWE annotated Sequoia treebank]
outcomes [2021/01/27 15:16]
carlos
Line 30: Line 30:
    * Corpus edition 1.0 (2017) on [[http://hdl.handle.net/11372/LRT-2282|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2017/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.0, 2017). [[https://www.ortolang.fr/market/corpora/parseme-fr|You can also download the French data only from the ORTOLANG platform]].     * Corpus edition 1.0 (2017) on [[http://hdl.handle.net/11372/LRT-2282|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2017/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.0, 2017). [[https://www.ortolang.fr/market/corpora/parseme-fr|You can also download the French data only from the ORTOLANG platform]]. 
    * Corpus edition 1.1 (2018) on [[http://hdl.handle.net/11372/LRT-2842|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2018/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.1, 2018).    * Corpus edition 1.1 (2018) on [[http://hdl.handle.net/11372/LRT-2842|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2018/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.1, 2018).
-   * [[http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/|Annotation guidelines (editons 1.0 and 1.1)]]+   * Corpus edition 1.2 (2020) on [[https://gitlab.com/parseme/sharedtask-data/tree/master/1.2|gitlab (temporary)]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2020/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.2, 2020). 
 +   * [[http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/|Annotation guidelines]]
  
  
 ==== Full-MWE annotated Sequoia treebank ==== ==== Full-MWE annotated Sequoia treebank ====
  
-   * Released as part of [[https://deep-sequoia.inria.fr/|Deep sequoia 9.0]]+   * Released on ]]http://hdl.handle.net/11234/1-3429|LINDAT/CLARIN]] as part of the [[https://deep-sequoia.inria.fr/|Deep sequoia corpus]]
    * [[https://gitlab.lis-lab.fr/PARSEME-FR/PARSEME-FR-public/wikis/Guide-annotation-PARSEME_FR-chapeau|Annotation guidelines]]    * [[https://gitlab.lis-lab.fr/PARSEME-FR/PARSEME-FR-public/wikis/Guide-annotation-PARSEME_FR-chapeau|Annotation guidelines]]
 +
 +==== Manually annotated web sample ====
 +
 +   * [[https://gitlab.com/cpasquer/websample|4618 sentences]] with positive and negative examples of 90 selected verbal MWEs in French. The sentences stem from Wikipedia and webcrawling and were taken from the [[http://hdl.handle.net/11234/1-1989|CoNLL shared task corpus]].
 +   * Paper describing the construction of this dataset (see Section 6.3) - TBA
  
 ==== Project-internal resources ==== ==== Project-internal resources ====
  
    * For project members: [[https://gitlab.lif.univ-mrs.fr/PARSEME-FR/PARSEME-FR|PARSEME-FR GitLab]]    * For project members: [[https://gitlab.lif.univ-mrs.fr/PARSEME-FR/PARSEME-FR|PARSEME-FR GitLab]]
outcomes.txt · Last modified: 2021/12/16 17:14 by agata