lipn

Laboratoire d'Informatique de Paris Nord

UMR 7030, Université Paris 13, 99 avenue Jean-Baptiste Clément, 93430 Villetaneuse

up13 cnrs

Publications RCLN

In the course of update...


Chapitres de livres

[1] Etude de la pénétration des anglicismes de type N ou ADJ(-)Ving \`a partir d'un corpus contemporain journalistique : les exemples de bashing et shaming en fran\c cais contemporain
Cartier, Emmanuel and Julie, Viaux
EMPRUNTS NEOLOGIQUES ET EQUIVALENTS AUTOCHTONES. MESURE DE LEURS CIRCULATIONS RESPECTIVES, Folia Litteraria Romanica, 2017

Edition d'ouvrages collectifs

[2] Grammar of genres and styles: New Approaches.
Dominique Legallois and Thierry Charnois and Meri Larjavaara
to appear, to appear, De Gruyter Mouton, Trends in Lingustics book series, 2017

Articles dans des revues internationales avec comité de lecture

[3] Event-Based Knowledge Reconciliation using Frame Embeddings and Frame Similarity.
Mehwish Alam, Diego Reforgiato Recupero, Misael Mongiovi, Aldo Gangemi, Petar Ristoski.
Knowledge Based Systems, 2017

[4] Prefix-projection global constraint and top-k approach for sequential pattern mining
Kemmar, Amina and Lebbah, Yahia and Loudni, Samir and Boizumault, Patrice and Charnois, Thierry
2, Sequential pattern mining (SPM) is an important data mining problem with broad applications. SPM is a hard problem due to the huge number of intermediate subsequences to be considered. State of the art approaches for SPM (e.g., PrefixSpan Pei et al. 2001) are largely based on the pattern-growth approach, where for each frequent prefix subsequence, only its related suffix subsequences need to be considered, and the database is recursively projected into smaller ones. Many authors have promoted the use of constraints to focus on the most promising patterns according to the interests of the end user. The top-k SPM problem is also used to cope with the difficulty of thresholding and to control the number of solutions. State of the art methods developed for SPM and top-k SPM, though efficient, are locked into a rather rigid search strategy, and suffer from the lack of declarativity and flexibility. Indeed, adding new constraints usually amounts to changing the data-structures used in the core of the algorithm, and combining these new constraints often require new developments. Recent works (e.g. Kemmar et al. 2014; Négrevergne and Guns 2015) have investigated the use of Constraint Programming (CP) for SPM. However, despite their nice declarative aspects, all these modelings have scaling problems, due to the huge size of their constraint networks. To address this issue, we propose the Prefix-Projection global constraint, which encapsulates both the subsequence relation as well as the frequency constraint. Its filtering algorithm relies on the principle of projected databases which allows to keep in the variables domain, only values leading to a frequent pattern in the database. Prefix-Projection filtering algorithm enforces domain consistency on the variable succeeding the current frequent prefix in polynomial time. This global constraint also allows for a straightforward implementation of additional constraints such as size, item membership, regular expressions and any combination of them. Experimental results show that our approach clearly outperforms existing CP approaches and competes well with the state-of-the-art methods on large datasets for mining frequent sequential patterns, sequential patterns under various constraints, and top-k sequential patterns. Unlike existing CP methods, our approach achieves a better scalability., 265--306 , 22, 10.1007/s10601-016-9252-z, Constraints, Avril 2017

Communications dans des conférences internationales avec comité de lecture

[5] Néoveille, a Web Platform for Neologism Tracking
Cartier, Emmanuel and Gabor, Kata and Lejeune, Gaël and Charnois, Thierry
13, Supplementary Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2017), Avril 2017

[6] Néoveille, a Web Platform for Neologism Tracking : the Semantic Neologisms Module
Cartier, Emmanuel
XX-XX, Proceedings of the 5th Electronic Lexicography in the 21st Century Conference, Leiden, 19-21 sept. 2017), 2017

[7] Sémantique lexicale et distributionnalisme : éléments pour le repérage automatique du sens en corpus
Cartier, Emmanuel
XX-XX, Proceedings of the 7th Représentations du sens linguistique, Sherbrooke, 25-27 oct. 2017), 2017

[8] Semantic Change Tracking Through the Prism of Distributionnalism and Construction Grammars : an experiment in Contemporary French
Cartier, Emmanuel
XX-XX, Proceedings of the 14th International Cognitive Linguistics Conference, Tartu, Estonia, 10-14 July 2017), 2017

pres

w3c-xhtml