Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns

In this contribution, we present a computational stylistic study of the French classic literature texts based on a data-driven approach where discovering interesting linguistic patterns is done without any prior knowledge. We propose an objective interestingness measure to extract meaningful stylistic syntactic patterns from a given author’s work. Our hypothesis is based on the fact that the most characterising linguistic patterns should significantly reflect the author’s stylistic choice in that the positions of theirs occurrences are controlled by the author’s purpose, while the irrelevant linguistic patterns are distributed randomly in the text. Since it does not rely on the counts of occurrences of the syntactic patterns in texts, this measure can work reasonably well with both large and small text samples. The analysed results show the effectiveness in extracting interesting syntactic patterns from a single text, and this seems particularly promising for the analyses of such texts that, for their characteristics or for historical reasons, cannot support a comparative study.

Mots clés

Computational stylistics interestingness measure sequential pattern mining syntactic style

Domaines

Traitement du texte et du document Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Machine Learning [stat.ML]

Mohamed Amine Boukhaled : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01396361

Soumis le : lundi 14 novembre 2016-13:09:14

Dernière modification le : vendredi 16 juin 2023-17:18:06

Dates et versions

hal-01396361 , version 1 (14-11-2016)

Identifiants

HAL Id : hal-01396361 , version 1

Citer

Mohamed Amine Boukhaled, Francesca Frontini, Gauvain Bourgne, Jean-Gabriel Ganascia. Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns. International Journal of Computational Linguistics and Applications, 2015, 6 (1). ⟨hal-01396361⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES ANR

143 Consultations

0 Téléchargements