tag : hadoop

Hadoop Serialization Framework

Below are some mentioned in Hadoop In Practice and MapReduce Cookbooks Where to use serialization In order to be used as a value data type of a MapReduce computation, a data type must implement the or

Rsync for HBase/Hadoop Cluster Deployment

Create a simple rsync script to do HBase/Hadoop deployment Create a cluster-deploy.sh script, shown as follows: $ vi cluster-deploy.sh #!/bin/bash # Sync HBASE_HOME across the cluster. Must run on ma

Little About MapReduce Combiner

Combiner is used to reduce the number of split shuffling to reducer. It will improve the overall performance obviously. There are following two points to be attention of using it. Your map and reduc