This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
outcomes [2018/10/25 12:53] carlos.ramisch created |
outcomes [2021/12/16 17:14] (current) agata |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== | + | ====== |
- | The goal of our project is to develop linguistic resources (lexicons, corpora, annotation guidelines) and software (parsers, MWE identifiers and linkers). | + | The goal of our project is to develop linguistic resources (lexicons, corpora, annotation guidelines) and software (parsers, MWE identifiers and linkers). |
+ | ===== Software ===== | ||
- | ===== MWE-annotated corpus ===== | + | ==== MWE identification software |
+ | Tools which annotate multiword expressions automatically in running text, developed within the project or in close collaboration with PARSEME-FR project members. | ||
- | The first release of our MWE-annotated corpus corresponds to the French dataset of the [[http://multiword.sourceforge.net/sharedtask2017/|PARSEME Shared Task on identification of verbal multiword expressions]] (edition 1.0). [[https://www.ortolang.fr/market/corpora/parseme-fr|You can freely download it here from the ORTOLANG platform]]. | + | |
+ | * [[https:// | ||
+ | | ||
+ | * [[http://igm.univ-mlv.fr/ | ||
+ | * [[http:// | ||
+ | * [[https:// | ||
- | The full dataset | + | Some of these tools can be tested online on the PARSEME-FR |
+ | ==== Other software ==== | ||
+ | * [[https:// | ||
+ | * On-line PARSEME-FR corpus browser on [[http:// | ||
+ | * [[https:// | ||
+ | |||
+ | ===== Language resources and datasets ===== | ||
- | *For project members*: [[https:// | + | ==== Verbal MWE-annotated corpora of the PARSEME shared tasks ==== |
+ | |||
+ | The datasets of the PARSEME shared task contain 18-20 languages, including French, and can be downloaded from: | ||
+ | |||
+ | * Corpus edition 1.0 (2017) on [[http:// | ||
+ | * Corpus edition 1.1 (2018) on [[http:// | ||
+ | * Corpus edition 1.2 (2020) on [[https:// | ||
+ | * [[http:// | ||
+ | |||
+ | |||
+ | ==== Full-MWE annotated Sequoia treebank ==== | ||
+ | |||
+ | * Released on [[http:// | ||
+ | * [[https:// | ||
+ | |||
+ | ==== MWE and coreference corpus ==== | ||
+ | |||
+ | * [[https:// | ||
+ | |||
+ | ==== Manually annotated web sample ==== | ||
+ | |||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | |||
+ | ==== Multilingual corpus of literal occurrences of multiword expressions ==== | ||
+ | |||
+ | * [[http:// | ||
+ | * [[https:// | ||
+ | |||
+ | ==== French metagrammar with verbal MWEs ==== | ||
+ | * [[https:// | ||
+ | * [[https:// | ||
+ | |||
+ | |||
+ | ==== Project-internal resources ==== | ||
+ | |||
+ | * For project members: [[https:// |