International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 79 - Number 5 |
Year of Publication: 2013 |
Authors: Kiran M., Saikat Mukherjee, Ravi Prakash G. |
10.5120/13741-1551 |
Kiran M., Saikat Mukherjee, Ravi Prakash G. . Characterization of Randomized Shuffle and Sort Quantifiability in MapReduce Model. International Journal of Computer Applications. 79, 5 ( October 2013), 51-59. DOI=10.5120/13741-1551
Quantifiability is a concept in MapReduce Analytics based on the following two conditions: (a) a mapper should be cautious, that is, should not exclude any reducer's shuffle and sort strategy from consideration; and (b) a mapper should respect the reducers' shuffle and sort preferences, that is, should deem a reducer's shuffle and sort strategy ki infinitely more likely than k'i if it premises the reducer to prefer ki to k'i. A shuffle and sort strategy is quantifiable if it can optimally be chosen under common shuffle and sort conjecture in the events (a) and (b). In this paper we present an algorithm that for every finite MapReduce operation computes the set of all quantifiable shuffle and sort strategies. The algorithm is based on the new idea of a key-value preference limitation, which is a pair (ki, Vi) consisting of a shuffle and sort strategy ki, and a subset of shuffle and sort strategies Vi, for mapper i. The interpretation is that mapper i prefers some shuffle and sort strategy in Vi to ki. The algorithm proceeds by successively adding key-value preference limitations to the MapReduce.