Université Paris-Saclay (Bâtiment Bréguet, 3 Rue Joliot Curie 2e ét, 91190 Gif-sur-Yvette - France)
Abstract : In this paper, we present an unsupervised
pipeline approach for clustering news articles
based on identified event instances in
their content. We leverage press agency
newswire and monolingual word alignment
techniques to build meaningful and
linguistically varied clusters of articles
from the Web in the perspective of a
broader event type detection task. We validate
our approach on a manually annotated
corpus of Web articles.
https://hal-cea.archives-ouvertes.fr/cea-01857885 Contributor : Olivier FerretConnect in order to contact the contributor Submitted on : Friday, August 17, 2018 - 3:31:08 PM Last modification on : Saturday, June 25, 2022 - 10:32:58 PM