Spark Cluster Installation
Prerequisites Create static ip addresses for each node in the cluster Create meaningful hostnames for each node in the cluster Standard update on each node sudo apt update Install Java ...
Prerequisites Create static ip addresses for each node in the cluster Create meaningful hostnames for each node in the cluster Standard update on each node sudo apt update Install Java ...
Prerequisites Please make sure to have a working hadoop cluster installed hadoop tutorial login to the master node as the ‘hadoop’ user created in the previous turorial Install Hive Downloa...
Prerequisites At least two ubuntu servers One ubuntu server for the master node At least one ubuntu server for the slave node (potentially multiple) hostnames Set mea...
Background The term “data lake” is credited to James Dixon, the former CTO of Pentaho. The term has been around since 2010. In 2012, The Harvard Business Review described the data scientist role...
Intended Audience: data professionals, just getting started with linux Objectives: cover main linux topics necessary to stand up an on-prem data lake There are many flavors of Linux. I te...