Monday, 27 January 2014

Data Handling (1/4) -unstructured data manipulation

Before I begin to talk about data mining, I would like to share my idea about data handling. When it comes to the data mining, a lot of data should be handled in order to find out the data insight in a high performance system. If your result is deduced by computing small data in your PC, I don't think,  this is a real data mining. The larger statistic data analysis, the higher accurate confidence interval is guaranteed.

I think, to become a practical data analyst, you have to be familiar with data manipulation. From now on, I will introduce a data handling skill by  next several posts.

Today, I am going to show you how to change unstructured data into structured data format using interpreter language like python.

Let me start by clicking the below web page.

 http://www.nbastuffer.com/2013-2014_NBA_Regular_Season_Player_Stats.html

This web URL  provides statistical NBA data which is a  '2013-2014 NBA Regular Season Player Stats'.