Affordable Access

Data Clustering Analysis - From Simple Groupings to Scalable Clustering With Constraints

University of Alberta
Publication Date
  • Data Clustering
  • Database Systems
  • Computer Science


Technical report TR02-03. Clustering is the problem of grouping data based on similarity and consists of maximizing the intra-group similarity while minimizing the iter-group similarity. While this problem has attracted the attention of many researchers for many years, we are witnessing a resurgence of interest in new clustering techniques in the data mining community. In this paper we discuss some very recent clustering approaches and recount our experience with some of these algorithms. We also present the problem of clustering in the presence of constraints and discuss the issue of cluster validation.

There are no comments yet on this publication. Be the first to share your thoughts.