tag : bigdata

Apache HAWQ is Landing on HDP

NewsLast week, HDP had announced to expend their strategic relationship with Pivotal. This will bring together Hortonworks’ expertise and support for data management and processing with Pivotal’s top

Happy New Year 2016

It is the end of 2015 and HAPPY NEW YEAR - 2016. It is time to wrap up my writing calendar with some summary on Sparkera, myself, and Big Data ecosystem. In past 2015, I have published 21 articles in

Build Big Data Warehouse With Apache Hive

Ten tools for ten big data areas 04_Apache Hive from Will Du Above presentation is the fourth topic I have covered for series of talks about the Ten Tools for Ten Big Data Areas. Apache Hive is

Light Big Data With Apache Spark

Ten tools for ten big data areas 03_Apache Spark from Will Du Above presentation is the third topic I have covered for series of talks about the Ten Tools for Ten Big Data Areas. Apache Spark is

Apache Spark Memo

Here, I am collecting the memo while I am learning the spark so that people like me can benefit fot this collection. 1. Spark Core The stage creation rule is based on the idea to pipeline as many narr

Ten Tools for Ten Big Data Areas

In the ancient of China, it is said there are ten legend weapons. Each of them has special magic and power. Anyone who can own one of these weapons could become a master or leader who is not undefeat

Data Analysis

A friend of mine asked me what is data analysis. This is a simple but difficult question. It is simple because we talk about data analysis all the time and everywhere. It is difficult because there a

Data Lake Stages

Edd has post a very impressive blog about how Hadoop ecosystem influence the data lake in enterprise recently. It discussed about the four following stages when enterprise’s data evolution to the dr

Moving to the Spark

It has been a while that the blog is now updated since 2014 is a ready busy year. After I almost completed my first book recently, I think it is the right time to start new journey in big data for rea

Big Data Platform

Here I am comparing the most famous vendors who offer hadoop platform for enterprise Below is a typical vision of big data analytics architecture