Parallel K-Means Algorithm based on Hadoop-MapReduce for Mining

About The Book

This work aimed to investigate the use of a parallel K-Means clustering algorithm based on the MapReduce programming model to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers in agricultural regions and belong to Ameriflux. The experiments were performed using 3 4 and 6 machines respectively. The results showed that with the increase in the number of machines there was a gain in performance with the best time obtained using six machines reaching a SpeedUp of 3.25. It was found that the application scales well with the equivalent increase in data size and number of machines in the cluster achieving similar performance in the tests.
Piracy-free
Piracy-free
Assured Quality
Assured Quality
Secure Transactions
Secure Transactions
Delivery Options
Please enter pincode to check delivery time.
*COD & Shipping Charges may apply on certain items.
Review final details at checkout.
downArrow

Details


LOOKING TO PLACE A BULK ORDER?CLICK HERE