This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | |||
job-2017-lif-postdoc [2017/09/10 14:43] carlos.ramisch |
job-2017-lif-postdoc [2017/09/15 10:31] (current) carlos.ramisch |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | Aix Marseille University offers a 2-years post-doc position in computational linguistics. | ||
- | |||
===== Identification of multiword expressions off the beaten path : towards MWE-aware semantic parsing ===== | ===== Identification of multiword expressions off the beaten path : towards MWE-aware semantic parsing ===== | ||
Line 11: | Line 9: | ||
* **Supervisors**: | * **Supervisors**: | ||
* **Duration: | * **Duration: | ||
- | * **Remuneration: | + | * **Remuneration: |
* **Funding: | * **Funding: | ||
* **Keywords**: | * **Keywords**: | ||
Line 18: | Line 16: | ||
==== Context ==== | ==== Context ==== | ||
- | We are hiring a post-doc researcher in the domain of natural language processing. This position is funded by the [[http:// | + | We are hiring a post-doc researcher in the domain of natural language processing. This position is funded by the [[http:// |
+ | |||
+ | The post-doc will be supervised by Carlos Ramisch and Alexis Nasr (Aix Marseille University) and will become a member of the TALEP team of LIF, specialised in computational linguistics. | ||
+ | |||
+ | The metropolitan area of Aix-Marseilles is a the second largest in France. It offers a vibrant environment conveniently situated on the south coast of France, in the Provence-Alpes-Côte d'Azur region. Aix-Marseilles is a cosmopolite and well connected urban area with Mediterranean climate and surrounded by stunning landscapes such as the Calanques natural park and the Provence region. | ||
+ | |||
+ | Aix Marseille University is the largest university in France and of the francophone world by its number of students and staff. It provides a lively and diverse research environment which attracts scientists carrying out research in many areas, including computational linguistics, | ||
- | The post-doc will be supervised by Carlos Ramisch and Alexis Nasr (Aix-Marseille University) and will become a member of the TALEP team, specialised in computational linguistics. The TALEP team is part of LIF, a computer science lab affiliated to CNRS and Aix Marseille University, located on the Luminy campus in Marseilles, the second-largest city in France. Marseilles is conveniently situated on the south coast of France, in the Provence-Alpes-Côte d'Azur region. It is a cosmopolite and well connected city with Mediterranean climate and surrounded by stunning landscapes such as the Calanques natural park, where the Luminy campus is located. | ||
-------------------------- | -------------------------- | ||
Line 27: | Line 30: | ||
One of the main goals of natural language processing (NLP) systems is to automatically find the underlying structure in running text. Many tools and techniques for **text analysis** have been developed to transform sequences of characters into increasingly abstract representations. This process is usually based on a pipeline of modules which subsequently perform operations such as text segmentation, | One of the main goals of natural language processing (NLP) systems is to automatically find the underlying structure in running text. Many tools and techniques for **text analysis** have been developed to transform sequences of characters into increasingly abstract representations. This process is usually based on a pipeline of modules which subsequently perform operations such as text segmentation, | ||
- | Multiword expressions (MWEs) such as //French fries, take a break, do one's best// and //spill the beans// often pose problems for text analysis chains because of their idiosyncratic nature (Baldwin and Kim 2009, Sag et al. 2001). Among their notably challenging characteristics, | + | Multiword expressions (MWEs) such as //French fries, take a break, do one's best// and //spill the beans// often pose problems for text analysis chains because of their idiosyncratic nature (Baldwin and Kim 2009, Sag et al. 2001). Among their notably challenging characteristics, |
Considerable progress has been made in the last years to understand and model the interactions between MWE identification and syntactic parsing. It has been shown that the automatic discovery of new MWEs can greatly benefit from parsed data (Seretan, 2008). The use of sequence models, such as CRFs and structured perceptrons, | Considerable progress has been made in the last years to understand and model the interactions between MWE identification and syntactic parsing. It has been shown that the automatic discovery of new MWEs can greatly benefit from parsed data (Seretan, 2008). The use of sequence models, such as CRFs and structured perceptrons, | ||
Line 37: | Line 40: | ||
The first phase will consist in adapting existing sequence models to tag syntactic trees instead of flat word sequences. The first experiments will focus on verbal MWEs using the corpora of the [[http:// | The first phase will consist in adapting existing sequence models to tag syntactic trees instead of flat word sequences. The first experiments will focus on verbal MWEs using the corpora of the [[http:// | ||
- | The second phase will focus on the use of word embeddings and deep learning models to perform the classification | + | The second phase will focus on the use of word embeddings and deep learning models to perform the classification |
Once the MWE identification system is both precise and robust enough, the third phase consists in applying tree transformation operations on the resulting tagged trees so that they become closer to semantic predicate-argument structures. Verbal MWEs are particularly relevant here, since phenomena such as light-verb constructions, | Once the MWE identification system is both precise and robust enough, the third phase consists in applying tree transformation operations on the resulting tagged trees so that they become closer to semantic predicate-argument structures. Verbal MWEs are particularly relevant here, since phenomena such as light-verb constructions, | ||
Line 48: | Line 51: | ||
* Interest in linguistics and familiarity with language technology | * Interest in linguistics and familiarity with language technology | ||
* Capacity to work independently and as part of a team | * Capacity to work independently and as part of a team | ||
+ | |||
------------------------- | ------------------------- | ||
Line 53: | Line 57: | ||
* **Application deadline: October 15, 2017 (or until fulfilled)** | * **Application deadline: October 15, 2017 (or until fulfilled)** | ||
- | * Position starts: January 2018 (or earlier if possible) | + | * Position starts: |
* Duration: 2 years, that is, 1 year renewable once | * Duration: 2 years, that is, 1 year renewable once | ||
+ | |||
--------------------- | --------------------- | ||
==== Application ==== | ==== Application ==== | ||
- | Candidates should send the following documents as a single attached document named **LASTNAME-Firstname.pdf**, | + | **Applications should be sent before October 15, 2017**. |
* a CV, including a list of publications | * a CV, including a list of publications | ||
- | * a cover letter explaining how this position | + | * a cover letter explaining how the offer matches your interests and experience |
- | * and the names and emails of 2 referees to be contacted | + | * a copy of their PhD degree or a document indicating the expected defense date |
+ | * and the names and emails of 1 or 2 referees to be contacted | ||
+ | |||
+ | Candidates applying to both positions should indicate and motivate this in their cover letter, but send a single application file. | ||
-------------------- | -------------------- | ||
Line 77: | Line 86: | ||
* [[ http:// | * [[ http:// | ||
* [[ http:// | * [[ http:// | ||
- | |||
- | |||