Data Analytics Using Open-Source Tools


LOOKING TO PLACE A BULK ORDER?CLICK HERE

Piracy-free
Piracy-free
Assured Quality
Assured Quality
Secure Transactions
Secure Transactions
Fast Delivery
Fast Delivery
Sustainably Printed
Sustainably Printed
Delivery Options
Please enter pincode to check delivery time.
*COD & Shipping Charges may apply on certain items.
Review final details at checkout.

About The Book

This book is about Data Analytics. In that respect it is like others. What distinguishes it from the rest is the variety of open-source tool applications. This book incorporates the use of R Studio Python SAS Studio (University Edition) and KNIME. This book is also about manipulating Big Data. Apache Hadoop on Hortonworks Sandbox is introduced and we manage move handle and transform data using Apache Hive Apache Spark MapReduce and TEZ with terminal shell commands and Ambari. We show you how to set up a virtual machine in Microsoft Azure. We then use the data in later chapters for modeling. We cover Descriptive Modeling and Predictive. The content includes Support Vector Machines Decision Tree learning Random Forests Naive and Empirical Bayes Gradient Boosting Cluster Modeling Generalized Linear Models Logistic Regression and Artificial Neural Networks. Every chapter includes completely worked examples using one or more open-source tools.
downArrow

Details