Find Jobs
Hire Freelancers

Multithreaded External Website Source Parser (RegEx Bot)

$30-250 USD

已取消
已发布大约 12 年前

$30-250 USD

货到付款
Title: Multithreaded External Website Source Parser (Regular Expressions) Project Type: Programmed Software, Windows (x64) Application, (Including Source Code) Language: C#.NET, C, OR VB.NET (Visual Basic 2010) Or Other Programming Language for Executable Application Interface: Screenshots of the interface design are attached to the Project Resources, and should be followed similarly Budget: $150 total, with (1) proposed 'milestone' payment of $75 for a completed single threaded version that is limited to saving only the first 100 results of each type of information collected, and only use a single regular expression input file. The first milestone payment will be made after the demonstration application is reviewed. The speed however, even in the first milestone demonstration application, must meet certain 'speed' standards and programmer must have the knowledge of how the speed/accuracy will inncrease/decrease in comparison to the full multithreaded version, given 25~ mbps internet connection and a 5000~ passmark score PC running Win x64 and 8GB RAM. Project Summary: This project description is for a program with the purpose of parsing external website source codes. The user will import a list of regular expressions, one per line named [login to view URL] in the following format: beginning text##!##ending text Whatever text in the source code is between the 'beginning text' and 'ending text' (where '##!##' is in the input file) will be appended to the file [login to view URL] The application will also read a second file with a list of URLs (one per line) named [login to view URL] If multiple matches of the regular expression are found within the same page's source code, each one will be appended to the output file ([login to view URL]) Getting Started: This project is best suited for someone who has already developed this application in part or wholly, though it is quite straight forward to anyone familiar with the scraping, data mining, crawling, etc of websites. Speed, accuracy, and scalability: This software will be run on an approximate 25~ mbps internet connection (mega BIT), and 5000~ passmark score cpu running Winx64 with 8GB of RAM. The acceptable accuracy requirement is 95%, meaning that for a list of 100 URLs where 100 regular expressions are available in the corresponding source, at least 95 (or better) should be found and appended to 95 lines in the [login to view URL] file. The software will make use of large flat files with several million entries in [login to view URL], so should not have any issues either reading large [login to view URL] and appending to largely growing [login to view URL] files. The desired speed of the software, taking into account the 95% accuracy requirement as well as the internet and hardware specifications of its machine is approximately 1800 URLs/minute under typical web server speed conditions. The only difficulty in developing this software should be the treatment of slowly responding websites, unfound urls, and your discretion with how they are handled. Please take a moment to review the attached project resources that contain screen shots with the recommended GUI (user interface) of the software. For any questions regarding the project, feel free to PM me any time (will check them often) and I can provide additional contact information or simply answer any inquiries you have there. Thank you and good luck.
项目 ID: 1488942

关于此项目

9提案
远程项目
活跃12 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
9威客以平均价$152 USD来参与此工作竞价
用户头像
I am ready to deliver the project
$180 USD 在5天之内
4.9 (174条评论)
7.5
7.5
用户头像
I can do this for you. See PM for details.
$150 USD 在3天之内
5.0 (366条评论)
7.1
7.1
用户头像
Expert si here, Lets do it.
$150 USD 在5天之内
4.8 (66条评论)
6.8
6.8
用户头像
please check your pm.
$150 USD 在3天之内
5.0 (20条评论)
4.8
4.8
用户头像
Hi, I can do this. Plz check PMB. regards
$150 USD 在5天之内
5.0 (6条评论)
4.1
4.1
用户头像
Hello sir, I have already created similar software. Please reply if interested. Thank you.
$150 USD 在3天之内
5.0 (3条评论)
3.8
3.8
用户头像
Hi! I can make your project in 1 days, because i have similar application...
$150 USD 在1天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
niles, United States
5.0
29
付款方式已验证
会员自2月 13, 2012起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。