Saturday, June 22, 2013

Hortonworks Certified Apache Hadoop Developer (HCAHD)

What a feeling i cleared Hortonworks Certified Apache Hadoop Developer (HCAHD).
Preparation Tips:
Buy the Definitive Guide
Read it!
Make sure you have sufficient hands on and take the practice exam on the website.
There were some very good questions asked make sure you have your fundas clear.
With luck i am sure you will ace it! 

Wednesday, June 19, 2013

Sri Sri Yoga for Kids








Hive - Part 2

.


Hive was developed by facebook .
  • It is a data warehouse system built on Hadoop.
  • Helps you to define a structure for you massive unstructured data
  • Simplify analysis by using SQL like scripting language.(Hive QL)
Hive is NOT : 
  1. A Relational Database
  2. Designed for OLTP.
  3. Suited for real time updates  
Use Cases:
Ad hoc queries
Analysis
Summarization  

Hive components:
Hive Clients (Thrift,JDBC,ODBC)
Hive Services (CLI(Command Line Interface), HWI(Hive web Interface))
Hive Metastore

The metastore is the central repository of Hive metadata. The metastore is divided into two pieces: a service and the backing store for the data. By default, the metastore service runs in the same JVM as the Hive service and contains an embedded Derby database instance backed by the local disk. This is called the embedded metastore configuration.

In Hive Insert statement is used to populate data.
We can load data to the table from local disk or HDFS.

Teaching Children

I have always been inspired by children. Every time i see a child, i see a ray of hope. If my present sucks, i can work harder to make sure kids get better future. Well in fact this is what every parent wants. Some thing is very wrong in our system. Children are being robbed of their innocence and are stressed out. I have recently started taking youth courses for Juvenile Kids. They are no different then other kids.
Today was my first day.
While packing up one of the kids i had never spoken to helped me pack my yoga mat. I was touched by his gesture. They asked me if i would come back again to teach them yoga the next day. :) :)
On weekdays i go for an hour to teach theses kids yoga.
If you would like to contribute in any capacity please let me know.

Tuesday, June 18, 2013

Hive Essentials (Part 1)

This Post will be dedicated to Hive:

What is Hive?.
Hive is a data warehouse that sits on top of Hadoop .

Online Resources
http://en.wikipedia.org/wiki/Apache_Hive

Hive Queries run as Mapreduce Jobs under the hood.

Hive Components:
Hive MetaStore
a) Table Definition, Indexes and partitions
b) Stored on the Local Machine on Derby Database
Hive Qurey Engine

Basic Sytnax:
Show Tables;//Shows All tables
Describe
; //Describes the schema of the table


CREATE TABLE sample(firstName STRING, lastName STRING, id INT)
ROW FORMAT
DELIMITED FIELDS
TERMINATED BY ‘ ‘
LINES TERMINATED BY ‘\n’
STORED AS TEXTFILE


Hive - Which stung me during (Hortonworks) my exam


Whohoo i failed the Hortonworks practice exam as i had mostly concentrated on Mapreduce , Hbase and Pig. I have a big over confidence Problem. :)
This exam is totally free you can give it 2 times.

I would say your test would mostly be :

30% Mapreduce
30% Pig
30% Hive
10% Hbase + Hcatlog

Prepare well and best of luck dont make the mistakes i made :).

Friday, June 14, 2013

My Technical Side


Been busy with work. I am planning to take a Hadoop Certification Exam.

Here are the details:

Hortonworks Certified Apache Hadoop Developer (HCAHD)The Certified Apache Hadoop Developer certification is intended for developers who design, develop and architect Hadoop-based solutions, consultants who create Hadoop project proposals and Hadoop development instructors.  Those certified are recognized as having a high level of skill in Apache Hadoop development.

Link:
http://www.webassessor.com/hortonworks
Test Duration: 90 Minutes
Fee when converted to INR : 9200 approx.

Will share more ...