Performing Initiative Data Prefetching in Distributed File Systems for Cloud Computing

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents an initiative data prefetching scheme on the storage servers in distributed file systems for cloud computing. In this prefetching technique, the client machines are not substantially involved in the process of data prefetching, but the storage servers can directly prefetch the data after analyzing the history of disk I/O access events, and then send the prefetched data to the relevant client machines proactively. To put this technique to work, the information about client nodes is piggybacked onto the real client I/O requests, and then forwarded to the relevant storage server. Next, two prediction algorithms have been proposed to forecast future block access operations for directing what data should be fetched on storage servers in advance. Finally, the prefetched data can be pushed to the relevant client machine from the storage server. Through a series of evaluation experiments with a collection of application benchmarks, we have demonstrated that our presented initiative prefetching technique can benefit distributed file systems for cloud environments to achieve better I/O performance. In particular, configuration-limited client machines in the cloud are not responsible for predicting I/O access operations, which can definitely contribute to preferable system performance on them.

Original languageEnglish
Article number7070699
Pages (from-to)550-562
Number of pages13
JournalIEEE Transactions on Cloud Computing
Volume5
Issue number3
DOIs
Publication statusPublished - 1 Jul 2017

Keywords

  • Mobile cloud computing
  • distributed file systems
  • initiative data prefetching
  • server-side prediction
  • time series

Fingerprint

Dive into the research topics of 'Performing Initiative Data Prefetching in Distributed File Systems for Cloud Computing'. Together they form a unique fingerprint.

Cite this