Affordable Access

Textual Article Clustering in Newspaper Pages

Authors
Publication Date
Keywords
  • Qa076.7 Programming Languages - Semantics
Disciplines
  • Computer Science

Abstract

In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experimentation on actual pages from the Italian newspaper L’Adige, we select one of the algorithms as the preferred choice to solve the textual clustering problem.

There are no comments yet on this publication. Be the first to share your thoughts.