bigdata project
$10-30 USD
货到付款
In this assignment, you are a given a dataset of approximately 20,000 news documents collected from a set of newsgroups (mailing lists). The set of documents (email messages) is partitioned almost evenly across 20 different topics such as sport, electronics, politics, etc. The documents of each newsgroup are stored in one directory. Each news document is stored in a text file in a semi-structured format.
Here is a sample document:
I attached the document below
项目ID: #11519477
关于项目
有4名威客正在参与此工作的竞标,均价$220/小时
this is my first freelancer project and not a difficult one to crack. i have Worked on BPM tool (Informatica ActiveVos), Database tools : Oracle ,MS SQL Server, ETL & Big Data Development Tools:HDFS, Map Reduce,Hive 更多