Tuesday, November 26, 2013

HBase : Filters

All filters are implemented on the server side.
This is called predicate push down.

You can define a new instance of the filter by using

a)Comparison Filters

Row Filter:
Gives the ability to filter data based on rowkeys.

Family Filter:
Used to filter column Families. Data is retrieved in a column family level.

Qualifier Filter:
Used to filter out specific column qualifier.

Value Filter:
Used to filter out columns with a specific value.

Dependent Column Filter:
It uses timestamp as the reference column and includes all other columns. It lets you specify a dependent column.

b)Dedicated Filters

This filter is used when you have exactly one column that decides if an entire row should be returned or not.

SingleColumnValueExclude Filter:
This is an exclude filter. You will not get the column as a part of your result.

Prefix Filter:
All rows that match this prefix are returned the client.

Page Filter:
You specify the pagesize for your filter. This controls how many rows per page should be returned.

More Later..

Saturday, October 19, 2013

Putty : Network Error Permission Denied

Well after googling for more than 3 hours i figured out why i was not able to SSH to my VM.

Error: Network Error : Permission Denied.

Solution : Open Putty from any drive other than C:\ (DONT INSTALL PUTTY IN C:)

I copied Putty to my D:\\ . I was able to SSH to my VM.

(My Session on Putty)

Friday, September 20, 2013

Simple R Scatter Plo

plot(y~x,pch=19,col="blue",xlab="X axis",ylab="y axis",main="Scatter Plot Diagram")

Data Science

My technical Goals:
a) Learn Stats
b) Figure Out R
c) Learn Python
d) Figure out HBase / Hadoop
e) Learn the art of Visualization


I am sure with a bit of luck and effort i will get there.

Now In Focus : R Statistics
Expect more posts on Math :P

GitHub 2

I recently help some friends learn Git!!... here is the code to do stuff on GitHub...

// Installing Git on Ubuntu
sudo apt-get install git-core
//Git version
Git –version
//Add Colors to console
git config --global color.ui auto
// Status of Repo
git status
//Structure of Git
Tree .git
//See branches
Git branch -a
//See the origins
Git remote –v
//See the log
Git log
//See the recent changes
Git whatchanged
//Deleting Remote Origin
git remote rm origin

1st Creating Identity
//Global User Name
git config --global user.name "Example Surname"
//Global Email
git config --global user.email "your.email@gmail.com"

2nd Creating A Repo
//Initialize Git
Git init
//Make Changes to repo
//Stage your Changes
Git add README
//Commit the Staging area
Git commit –m ‘my first commit’
//Creating a URL linking to your repo
//Pushing your commit
git push origin master
//Make changes in repo
//To see the difference between the master and local
git fetch origin
git diff master origin/master
(git merge origin/master)
//Pulling from Origin
git pull origin master

3rd Forking
//Fork 1st + Copy the address and use the url to clone
Git clone 
//Create a branch
Git branch
//Checkout to the branch
Git checkout
//Create changes to that branch
Gedit …..
//Maybe do a status
//ADD and Commit
//Push it to the
Git push –u origin
//Check it out to update the file system
Git checkout master

Tuesday, September 3, 2013


Lately i have been experimenting with Git. Its a  whole new way to code. I was really impressed with their philosophy Social Coding. That means more sharing :).
My new git hub repo.

If you want to learn Git follow the links below.

Tuesday, August 20, 2013

Installing Julia in CentOS

  • Julia
    Programming language

  • Julia is a high-level dynamic programming language designed to address the requirements of high-performance numerical and scientific computing while also being effective for general purpose programming. Wikipedia
    I am getting my hands dirty installing JULIA. After spending more than 4 hours, I have finally installed JULIA on my CentOS 32 bit OS.
    The steps are as follows:
    1) Install Github
    2) Installing Dependencies
    3) Install JULIA
    5) Build JULIA using the make command.
    Basic Steps:

    yum install git
    git clone git://github.com/JuliaLang/julia.git
    Cd Julia/
    Make –C deps getall
    Su root
    Yum install gcc-gfortran
    yum -y install java-1.6.0-openjdk.x86_64 libXp openmotif openmotif22 python.x86_64 numpy* ncurses-devel gcc gcc-c++ autoconf automake
    yum -y groupinstall "Development Tools"

  • Thursday, August 15, 2013

    Food Anyone

    Well i admit it i splurge once a while. After splurging i keep promising myself i wont do it again. Some perks of living alone are that you you can eat what ever hell you like. Unlike most of the people i dislike a lot of things. (HATE is a strong word!! I wont use it :p).
    a) CURD
    b) MILK
    c) KAREALA
    d) BUTTER
    f) ONIONS
    my list goes on......
    I recently went to BBQ NATION. There was more than 40 kinds of things to eat. All i did was nibble here and nibble there. Horrible experience. Most of the stuff was either one of the things listed above or NON - VEG.

    All told i regret paying them 800 Rs. ....

    Wednesday, August 14, 2013

    Word Count In Pig

    Figuring out Github. Will post the code there once its done. :)

    A = LOAD 'Nameipfile' using TextLoader();
    C= FOREACH ( GROUP B by word ) GENERATE, GROUP  as word, COUNT($1) as ct;
    Dump C;

    Monday, July 15, 2013

    WordCloud with Tag @Srisrispeaks

    I have created a Twitter Word Cloud with all tweets having the tag @srisrispeaks.

    Saturday, June 22, 2013

    Hortonworks Certified Apache Hadoop Developer (HCAHD)

    What a feeling i cleared Hortonworks Certified Apache Hadoop Developer (HCAHD).
    Preparation Tips:
    Buy the Definitive Guide
    Read it!
    Make sure you have sufficient hands on and take the practice exam on the website.
    There were some very good questions asked make sure you have your fundas clear.
    With luck i am sure you will ace it!