Tuesday, June 18, 2013

Hive Essentials (Part 1)

This Post will be dedicated to Hive:

What is Hive?.
Hive is a data warehouse that sits on top of Hadoop .

Online Resources
http://en.wikipedia.org/wiki/Apache_Hive

Hive Queries run as Mapreduce Jobs under the hood.

Hive Components:
Hive MetaStore
a) Table Definition, Indexes and partitions
b) Stored on the Local Machine on Derby Database
Hive Qurey Engine

Basic Sytnax:
Show Tables;//Shows All tables
Describe
; //Describes the schema of the table


CREATE TABLE sample(firstName STRING, lastName STRING, id INT)
ROW FORMAT
DELIMITED FIELDS
TERMINATED BY ‘ ‘
LINES TERMINATED BY ‘\n’
STORED AS TEXTFILE


No comments:

Post a Comment