Thursday, 31 October 2013

Introduction of this Blog

This blog is series of R self study ( http://flydokyun.blogspot.kr/ )

I've made a decision to open a new blog which has a different title.

Previous R self study was focused on the statistic concept but this post is going to be more practical basis because I am going to analyse with a real data. Furthermore, I am going to introduce an IT environment such as DBMS, Linux, Hadoop.

I think,  data analysis  which is conducting in an enterprise is executed by large IT system environment which has a huge data. Therefore, It is also important to understand IT environment and programming language, such as python or java.

I hope, you will be happy with my blog.


Followings are development environment in my PC.
Those IT environment will be covered more detail in next post.

1. OS : Ubuntu 12.04.2 LTS
2. DBMS : Server version: 5.5.32-0ubuntu0.12.04.1 (Ubuntu)
3. R : R version 2.14.1 (2011-12-22)
4. JAVA : java version "1.7.0_40"
5. HADOOP : hadoop-1.2.1
6. Python  : Python 2.7
7. Test Data source : http://www.nbastuffer.com , www.nba.com