Affordable Access

Data distribution and task scheduling for distributed computing of all-to-all comparison problems

Authors
Publication Date
Source
Queensland University of Technology ePrints Archive
Keywords
License
Unknown
External links

Abstract

This research studied distributed computing of all-to-all comparison problems with big data sets. The thesis formalised the problem, and developed a high-performance and scalable computing framework with a programming model, data distribution strategies and task scheduling policies to solve the problem. The study considered storage usage, data locality and load balancing for performance improvement in solving the problem. The research outcomes can be applied in bioinformatics, biometrics and data mining and other domains in which all-to-all comparisons are a typical computing pattern.

Statistics

Seen <100 times