Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns - Sorbonne Université Accéder directement au contenu
Article Dans Une Revue International Journal of Computational Linguistics and Applications Année : 2015

Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns

Résumé

In this contribution, we present a computational stylistic study of the French classic literature texts based on a data-driven approach where discovering interesting linguistic patterns is done without any prior knowledge. We propose an objective interestingness measure to extract meaningful stylistic syntactic patterns from a given author’s work. Our hypothesis is based on the fact that the most characterising linguistic patterns should significantly reflect the author’s stylistic choice in that the positions of theirs occurrences are controlled by the author’s purpose, while the irrelevant linguistic patterns are distributed randomly in the text. Since it does not rely on the counts of occurrences of the syntactic patterns in texts, this measure can work reasonably well with both large and small text samples. The analysed results show the effectiveness in extracting interesting syntactic patterns from a single text, and this seems particularly promising for the analyses of such texts that, for their characteristics or for historical reasons, cannot support a comparative study.
Fichier non déposé

Dates et versions

hal-01396361 , version 1 (14-11-2016)

Identifiants

  • HAL Id : hal-01396361 , version 1

Citer

Mohamed Amine Boukhaled, Francesca Frontini, Gauvain Bourgne, Jean-Gabriel Ganascia. Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns. International Journal of Computational Linguistics and Applications, 2015, 6 (1). ⟨hal-01396361⟩
108 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More