Affordable Access

Data Clustering Analysis - From Simple Groupings to Scalable Clustering With Constraints

Authors
Publisher
University of Alberta
Publication Date
Keywords
  • Data Clustering
  • Database Systems
Disciplines
  • Computer Science

Abstract

Technical report TR02-03. Clustering is the problem of grouping data based on similarity and consists of maximizing the intra-group similarity while minimizing the iter-group similarity. While this problem has attracted the attention of many researchers for many years, we are witnessing a resurgence of interest in new clustering techniques in the data mining community. In this paper we discuss some very recent clustering approaches and recount our experience with some of these algorithms. We also present the problem of clustering in the presence of constraints and discuss the issue of cluster validation.

There are no comments yet on this publication. Be the first to share your thoughts.