R and its usage with hadoop

What is R and why it is used? R is a language and environment for statistical computing and graphics. It is a GNU project which provides a wide variety of statistical and graphical techniques, and is highly extensible. It includes an effective data handling and storage facility, a suite of operators for calculations on arrays, … Continue reading R and its usage with hadoop

Advertisements

MADLib installation and integration with HAWQ

Make sure that you have completed the following tasks before running the installation script: Make sure you have rpm, gpssh and gpscp in your PATH. Make sure that you have HAWQ binaries installed properly on all master and segment nodes in your cluster (also new segment nodes when adding new nodes). Add hawq_install.sh to your PATH  from https://github.com/madlib/madlib/blob/master/deploy/hawq_install.sh Make sure the HOSTFILE lists all the new segment nodes. … Continue reading MADLib installation and integration with HAWQ