User Tools

Site Tools

Agence Nationale de la Recherche

outcomes

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
outcomes [2019/03/07 17:59]
agata.savary [Other software]
outcomes [2021/12/16 17:14] (current)
agata
Line 1: Line 1:
 ====== Project outcomes ====== ====== Project outcomes ======
  
-The goal of our project is to develop linguistic resources (lexicons, corpora, annotation guidelines) and software (parsers, MWE identifiers and linkers). They are currently under development and will be published here when they are ready.+The goal of our project is to develop linguistic resources (lexicons, corpora, annotation guidelines) and software (parsers, MWE identifiers and linkers). Some of them are currently under development and will be published here when they are ready.
  
 ===== Software ===== ===== Software =====
Line 7: Line 7:
 ==== MWE identification software ==== ==== MWE identification software ====
  
-   * Transition +Tools which annotate multiword expressions automatically in running text, developed within the project or in close collaboration with PARSEME-FR project members.
-   * VarIDE +
-   * Veyn +
-   * LGTagger (?) +
-   * mwetoolkit (?)+
  
-==== Other software ====+   * [[https://github.com/hazemalsaied/IdenSys|ATILF-LLF system]] ("Transition" in shared task 2017) 
 +   * [[https://gitlab.com/cpasquer/SharedTask2018_varIDE|VarIDE]] 
 +   * [[https://github.com/zamp13/Veyn|Veyn]] 
 +   * [[http://igm.univ-mlv.fr/~mconstan/index.php?section=software|LGTagger]] 
 +   * [[http://mwetoolkit.sourceforge.net/|mwetoolkit]] 
 +   * [[https://gitlab.com/cpasquer/st_2020|Seen2Seen et Seen2Unseen]] (cf. PARSEME shared task 1.2)
  
-   PARSEME shared task tools (?) +Some of these tools can be tested online on the PARSEME-FR [[https://mwedemonstrator.atilf.fr/|demonstrator]]
-   * Lexicon tools (?) +
-   [[https://mwedemonstrator.atilf.fr/|Demonstrator]] (contains MWE identifiers and a corpus-lexicon browser)+
  
 +==== Other software ====
 +
 +   * [[https://mwedemonstrator.atilf.fr/|Demonstrator]] of MWE identifiers and a corpus-lexicon browser
 +   * On-line PARSEME-FR corpus browser on [[http://match.grew.fr/?corpus=PARSEME-FR@1.2|Grew-match]]
 +   * [[https://gitlab.com/parseme/utilities/|PARSEME shared task utilities]]
 +   
 ===== Language resources and datasets ===== ===== Language resources and datasets =====
  
Line 27: Line 32:
    * Corpus edition 1.0 (2017) on [[http://hdl.handle.net/11372/LRT-2282|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2017/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.0, 2017). [[https://www.ortolang.fr/market/corpora/parseme-fr|You can also download the French data only from the ORTOLANG platform]].     * Corpus edition 1.0 (2017) on [[http://hdl.handle.net/11372/LRT-2282|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2017/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.0, 2017). [[https://www.ortolang.fr/market/corpora/parseme-fr|You can also download the French data only from the ORTOLANG platform]]. 
    * Corpus edition 1.1 (2018) on [[http://hdl.handle.net/11372/LRT-2842|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2018/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.1, 2018).    * Corpus edition 1.1 (2018) on [[http://hdl.handle.net/11372/LRT-2842|LINDAT/CLARIN]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2018/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.1, 2018).
-   * [[http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/|Annotation guidelines (editons 1.0 and 1.1)]]+   * Corpus edition 1.2 (2020) on [[https://gitlab.com/parseme/sharedtask-data/tree/master/1.2|gitlab (temporary)]]. The French dataset was used in the [[http://multiword.sourceforge.net/sharedtask2020/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.2, 2020). 
 +   * [[http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/|Annotation guidelines]]
  
  
 ==== Full-MWE annotated Sequoia treebank ==== ==== Full-MWE annotated Sequoia treebank ====
  
-Coming soon...+   * Released on [[http://hdl.handle.net/11234/1-3429|LINDAT/CLARIN]] as part of the [[https://deep-sequoia.inria.fr/|Deep sequoia corpus]] 
 +   * [[https://gitlab.lis-lab.fr/PARSEME-FR/PARSEME-FR-public/wikis/Guide-annotation-PARSEME_FR-chapeau|Annotation guidelines]] 
 + 
 +==== MWE and coreference corpus ==== 
 + 
 +   * [[https://github.com/anaelle-p/MWE_coref|French corpus]] annotated both for MWEs and for coreference - data, annotation tools, manual validation results, statistics and reports. 
 + 
 +==== Manually annotated web sample ==== 
 + 
 +   * [[https://gitlab.com/cpasquer/websample|4618 sentences]] with positive and negative examples of 90 selected verbal MWEs in French. The sentences stem from Wikipedia and webcrawling and were taken from the [[http://hdl.handle.net/11234/1-1989|CoNLL shared task corpus]]. 
 +   * [[https://arxiv.org/pdf/2007.11381.pdf|Paper]] describing the construction of this dataset 
 + 
 +==== Multilingual corpus of literal occurrences of multiword expressions ==== 
 + 
 +   * [[http://hdl.handle.net/11372/LRT-2966|Corpus]] in Basque, German, Greek, Polish and Portuguese  
 +   * [[https://ufal.mff.cuni.cz/pbml/112/art-savary-et-al.pdf|Paper]] describing the construction and study of the corpus 
 + 
 +==== French metagrammar with verbal MWEs ==== 
 +   * [[https://gitlab.com/agata.savary/mwe-xmg/|FrenchTAG metagrammar]] with encoded verbal MWEs 
 +   * [[https://lexique.univ-lille.fr/object-oriented-lexical-encoding-of-multiword-expressions-short-and-sweet.html|Paper]] describing the encoding process and the evaluation 
  
 ==== Project-internal resources ==== ==== Project-internal resources ====
  
-   *For project members*: [[https://gitlab.lif.univ-mrs.fr/PARSEME-FR/PARSEME-FR|PARSEME-FR GitLab]]+   * For project members: [[https://gitlab.lif.univ-mrs.fr/PARSEME-FR/PARSEME-FR|PARSEME-FR GitLab]]
outcomes.1551977953.txt.gz · Last modified: 2019/03/07 17:59 by agata.savary