该项目收到13 来自天才威客的竞标，平均竞标价格为$196 USD。为像这样的项目获取免费报价
项目预算$30 - $250 USD
Deadline: Thursday 10/ Nov/2016
Using python, pandas, numpy and scikit learn.
For visualizations, you will not need anything more complex than scatter-plots, histograms or line plots. You will provide a single ipython notebook that contains the code for all the answers. Use a separate tab for each question. For each task, also write your appropriate answers in a .txt, .doc or .pdf and submit this along with your code.
1. I have provided you with a dataset called data1. It contains a train and test dataset. Use a suitable method to predict the “Value” given the features (there are 100 features) (there are a number of redundancies in the features). Evaluate and present your results using an appropriate error measure.
2. I have provided you with two datasets in data2.zip. For each dataset:
a. Analyze the data using an appropriate visualization
b. Use an appropriate method to cluster similar data-points together. Justify why you
picked the specific method for each dataset.
c. Output the clustered points using an appropriate visualization.
- The New York Times
- Wall Street Journal
- Times Online