TY - GEN
T1 - Extending standard mapreduce algorithms
AU - Hashem, Hadi
AU - Ranc, Daniel
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/8/16
Y1 - 2016/8/16
N2 - The information rate nowadays is expanding very quickly and contains complex and heterogeneous data types (text, images, videos, GPS data, purchase transactions) that require powerful computing engines, able to easily store and process such complex structures. Gartner's definition of the 3Vs (volume, velocity, variety) describing this expansion of data will then lead to extract the unnamed forth V (value) from BigData. This added value addresses the need for valuation of enterprise data. In this paper, we discuss the existing MapReduce implementation techniques and the need of a different approach based on the pre-processing of the data. The goal is to show interesting results in terms of data processing costs, performance and green computing.
AB - The information rate nowadays is expanding very quickly and contains complex and heterogeneous data types (text, images, videos, GPS data, purchase transactions) that require powerful computing engines, able to easily store and process such complex structures. Gartner's definition of the 3Vs (volume, velocity, variety) describing this expansion of data will then lead to extract the unnamed forth V (value) from BigData. This added value addresses the need for valuation of enterprise data. In this paper, we discuss the existing MapReduce implementation techniques and the need of a different approach based on the pre-processing of the data. The goal is to show interesting results in terms of data processing costs, performance and green computing.
KW - BigData Tools
KW - Data Modeling
KW - MapReduce Algorithms
KW - Pre-Processing
U2 - 10.1109/BigMM.2016.49
DO - 10.1109/BigMM.2016.49
M3 - Conference contribution
AN - SCOPUS:84987606135
T3 - Proceedings - 2016 IEEE 2nd International Conference on Multimedia Big Data, BigMM 2016
SP - 159
EP - 165
BT - Proceedings - 2016 IEEE 2nd International Conference on Multimedia Big Data, BigMM 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Multimedia Big Data, BigMM 2016
Y2 - 20 April 2016 through 22 April 2016
ER -