Data Processing/Scraping from Standard Format txt Files

进行中 已发布的 Oct 8, 2013 货到付款
进行中 货到付款

Hi, we are looking to hire someone to manipulate already existing data files (will be given web link) that are in a standard .txt file format with numeric and text entries to a format used for computing.

1) We would like you to start with taking 100 of the entries (randomly selected with random number generator) in one of the 30 files we will give you.

2) We would like you to transform these 100 entries into a matrix in .csv form based on pre-specified categories given by us. Two of the columns are word and word count. Another is entry ID.

3) We also would like a sparse representation of the two columns of word and word count where there is a new matrix (rows are entry #, columns are word label - filled with the count) and that depends on size of file. We can talk about this.

4) The deliverable should be in manageable csv file sizes, which won't be a problem for this data...

But, we will definitely have more work if this is done successfully (over all files and more entries needed), so scalable routines are highly encouraged. Thinking about a million entries with a higher budget, if this goes well.

Thank you very much.

Please note that we will only hire someone who has the ability to do this automatically since we are looking for FUTURE work primarily. This is just a pilot.
Once we go from 100 entries to 1 million, manual typing will not work. We realize that file size will be an issue depending on the matrix, so if things eventually need to be broken apart into let's say 1000 files of 1000 entries, we will then use this with parallel computing routines for our computations. Thank you so much and we look forward to working with you.

Big Data Sales 数据输入 数据挖掘 数据处理 网页搜罗

项目ID: #5006785

关于项目

40个方案 远程项目 活跃的Oct 9, 2013

有40名威客正在参与此工作的竞标,均价$141/小时

jaylancer43

Hello - I am an expert techno-functional analyst having vast experience in lots of arenas of IT industry including Excel Macros. I am an Engineering Graduate with an MBA degree. If you see, I am among the niche bid 更多

$111 USD 在3天内
(414条评论)
8.0
Toperfection

Dear "statsphd" Hope you are doing well. I have reviewed the project details and would like to offer our services. We have completed many Research/Data collection/Product add/Data mining assignments on [login to view URL] 更多

$151 USD 在3天内
(168条评论)
7.8
uumairkhalid

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. Regards

$105 USD 在3天内
(189条评论)
7.1
tjawad17

Hello Sir, We are a professional company specialized in Data Mining and Web Scraping. We have our own server, team and tools for data mining and scraping efficiently and accurately. We can parse your given text 更多

$155 USD 在4天内
(165条评论)
6.9
happy2helpp

Respected sir, We saw project description and got complete idea about project. We are expert in Big Data, Data Entry, Data Mining, Data Processing and Web Scraping!!! We have worked on many similar tasks before and 更多

$231 USD 在4天内
(84条评论)
6.9
diamond247

Hello Sir, We are a big set up company with excellent skilled operator who have a lot of experience in this segment, our employee complete more than 300 similar job, i have gone through your project specification, i 更多

$144 USD 在3天内
(243条评论)
7.1
ashok7925

Hi, I am much interested in this work. Please share me more details with sample text file and describe me what would like to do. I can automate all of the process once I get understood your requirement. Please sha 更多

$100 USD 在3天内
(33条评论)
5.3
elMancha

Hello there. I have high Excel and Visual Basic skills with great professionalism. I study electronics and computer engineering at Oporto university and I'm looking for work to fill the blanks on my schedule. I' 更多

$60 USD 在3天内
(40条评论)
5.0
arvt

Hi I'm interested and I like to know more details about your project to bid accordingly. I have experience doing programs and scripts in some projects here and in other freelancer site. I have Skype, Gtalk, MS 更多

$35 USD 在3天内
(12条评论)
4.9
mohanlg

Hi, I am interested to do these project work. Expert in data conversion work. Please send me more details of work to start. Thanks sunny

$35 USD 在2天内
(25条评论)
4.3
RajakScripts

Hi, Please attach the .txt file AND a matrix in .csv form based on your given pre-specified categories for a review, so I can adjust my bid & delivery time precisely. Yes, I aware that you want this to be perform 更多

$88 USD 在3天内
(7条评论)
4.3
gokhanonal

Dear Sir / Madam, I'm a computer engineer (with BS Degree), working freelance in Istanbul, Turkey. I can complete your project as fast & accurate. Please let me know. Looking forward to hearing from you soon, 更多

$35USD 在1天里
(13条评论)
3.6
signo

Hello, I am experienced in working with large files and back-end processing in general. I will definitely finish this project in the next 24 hours. I still need some clarifications before getting started, regardi 更多

$133USD 在1天里
(32条评论)
4.2
thanhhungqb

Dear sir, I have read your requirement carefully and interested in it. I am expert on data entry, data scrapping and process data. I usually to do it automatic. For your project, I think I can automatic by a prog 更多

$126 USD 在3天内
(15条评论)
3.6
sunil440

Good day! I would like to submit my application as Data Collector. I shall be pleased to consider me as a qualified applicant.I believe my qualifications would make me an outstanding asset to your organization. I woul 更多

$100 USD 在3天内
(16条评论)
3.4
GurpreetSngh220

Hi, I am very much interested in your project. I would like to discuss with you more regarding the project. You can rely on me because i am serious on my work and not sitting here to waste time (both of us). you 更多

$188 USD 在5天内
(7条评论)
3.2
FernandoCanizo

Hello, I'm interested, I'd to give it a try. Can you provide a sample file so I can send you my attempt? No compromises. Also send me any other information I should need to build a proper processing script, I'm t 更多

$30 USD 在2天内
(2条评论)
3.4
inoussakabore

Hi i have almready do this kind of job. You can see that in my profile. I am ready to start it. I can do that in about one week.

$250 USD 在7天内
(3条评论)
3.3
igors233

Greetings, I'm professional software developer with 15+ years of experience in similiar tasks. I will produce a standalone exe (no dependecies) that will take as input given txt file (it could be downloaded automatical 更多

$147 USD 在10天内
(4条评论)
3.5
szymszteinsl

Hi! I am professional C/C++/C#/Java programmer. I can do this project with highest quality, Best Regards, Szymszteinsl

$144 USD 在3天内
(2条评论)
3.3