category : Blog

Light Big Data With Apache Spark

Ten tools for ten big data areas 03_Apache Spark from Will Du Above presentation is the third topic I have covered for series of talks about the Ten Tools for Ten Big Data Areas. Apache Spark is

Apache Spark Memo

Here, I am collecting the memo while I am learning the spark so that people like me can benefit fot this collection. 1. Spark Core The stage creation rule is based on the idea to pipeline as many narr

Tableau Your Big Data

Ten tools for ten big data areas 02_Tableau from Will Du Above presentation is the second topic I have covered for series of talks about the Ten Tools for Ten Big Data Areas. Tableau is one of f

Ten Tools for Ten Big Data Areas

In the ancient of China, it is said there are ten legend weapons. Each of them has special magic and power. Anyone who can own one of these weapons could become a master or leader who is not undefeat

Informatica in Big Data

Ten tools for ten big data areas 01 informatica from Will Du Above presentation is the first topic I’ll cover for series of talks about the Ten Tools for Ten Big Data Areas. I came up this seri

Hadoop Streaming

1. Streaming OverviewHadoop Streaming is a generic API which allows writing Mappers and Reduces in any language. Develop MapReduce jobs in practically any language Uses Unix Streams as communicatio

Apache Hive Essentials Published

Finally, I made it. I got it published after working for 6 monthes. Apache Hive Essentials My very first book Also the first book on Apache Hive 1.0.0 in the world Check it out here