tag : hive

Apache Hive RowID Generation

It is quite often that we need a unique identifier for each single rows in the Apache Hive tables. This is quite useful when you need such columns as surrogate keys in data warehouse, as the primary k

Happy New Year 2016

It is the end of 2015 and HAPPY NEW YEAR - 2016. It is time to wrap up my writing calendar with some summary on Sparkera, myself, and Big Data ecosystem. In past 2015, I have published 21 articles in

Build Big Data Warehouse With Apache Hive

Ten tools for ten big data areas 04_Apache Hive from Will Du Above presentation is the fourth topic I have covered for series of talks about the Ten Tools for Ten Big Data Areas. Apache Hive is

Apache Hive Essentials Published

Finally, I made it. I got it published after working for 6 monthes. Apache Hive Essentials My very first book Also the first book on Apache Hive 1.0.0 in the world Check it out here

Hive and Hadoop Exceptions

I installed Hive 1.0.0 on Hadoop 1.2.1. When I try to enter the Hive CLI, it reports following exceptions 1org.apache.hadoop.hive.ql.metadata.HiveException:java.io.IOException:Filesystem closed Accor

Hive Get the Max/Min Value Rows

Most of time, we need to find the max or min value of particular columns as well as other columns. For example, we have following employee table. 1234567891011> SELECT name,sex_age.sex AS sex,sex_

Hive and Hadoop Exceptions

I installed Hive 1.0.0 on Hadoop 1.2.1. When I try to enter the Hive CLI, it reports following exceptions 1org.apache.hadoop.hive.ql.metadata.HiveException:java.io.IOException:Filesystem closed Accor

Hive Composite Data Type

For now, hive supports following composite data type: map: (key1, value1, key2, value2, …). Creates a map with the given key/value pairs struct: (val1, val2, val3, …). Creates a struct with the give

Hive vs. Pig

Both projects are top Apache projects to process data in Hadoop. Here, I try to compare the difference. Below is picture I found (I cannot find the original link, but there is mirror here In addition,

Hive Sorting and Ordering

There are following key words used in Hive to sort data with following difference: ORDER BY (ASC|DESC) : This is similar to the traditional SQL operator. Sorted order is maintained across all of the