关闭

Data analysis

该项目收到15 来自天才威客的竞标,平均竞标价格为$193 USD

为像这样的项目获取免费报价
雇主工作
项目预算
$30 - $250 USD
全部竞标
15
项目描述

Deadline: Thursday 10/ Nov/2016

Using python, pandas, numpy and scikit learn.

For visualizations, you will not need anything more complex than scatter-plots, histograms or line plots. You will provide a single ipython notebook that contains the code for all the answers. Use a separate tab for each question. For each task, also write your appropriate answers in a .txt, .doc or .pdf and submit this along with your code.

1. I have provided you with a dataset called data1. It contains a train and test dataset. Use a suitable method to predict the “Value” given the features (there are 100 features) (there are a number of redundancies in the features). Evaluate and present your results using an appropriate error measure.

2. I have provided you with two datasets in data2.zip. For each dataset:

a. Analyze the data using an appropriate visualization

b. Use an appropriate method to cluster similar data-points together. Justify why you

picked the specific method for each dataset.

c. Output the clustered points using an appropriate visualization.

在寻找赚取金钱的机会?

  • 设定您的预算和时间框架
  • 大致描述您的建议方案
  • 为您的工作领取工资

雇用同样在该项目上竞标的威客

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online