Monday, 11 July 2016

Apache Hadoop : Hive Partitioning and Bucketing Example on Twitter Data


Hive Partitioning and Bucketing Example on Twitter Data

Overview on Hive Partitioning :


Hive organizes tables into partitions. It is a way of dividing a table into related parts based on the values of partitioned columns such as date, city, and department. Using partition, it is easy to query a portion of the data.
Overview on Hive Bucketing :
The Hive Partition can be further subdivided into Clusters or Buckets.Hive Buckets is nothing but another technique of decomposing data or decreasing the data into more manageable parts or equal parts.

Check it out above Examples in below weblink:


Hive Partitioning and Bucketing Example on Twitter Data

I hope this tutorial will surely help you. If you have any questions or problems please let me know.


Happy Hadooping with Patrick..

No comments:

Post a Comment