Find Jobs
Hire Freelancers

Back-end scraper for a price comparison website

$250-750 USD

已完成
已发布大约 11 年前

$250-750 USD

货到付款
1) I have already developed a desktop based application using Scrapy/ Python that is hard coded to crawl to three separate sites (using three "spiders") that can pull out product details such as Product ID, Title, Price, Vendor and Stock Position. At present, these details are used to generate .sql files that need to be uploaded to the web server to update the Product Table in the database. 2) The current requirement is to develop a Server version of the scraper. The expected features are as under:- a) The Products Table in the server database to be automatically populated by the scraper. The required fields are Product ID, Title, Price, Vendor, Stock Position, Payment Options, Delivery Time b) Easy extensibility (with some python coding) to add more sites in future. c) To meet the above, the scraper to be implemented as two modules. The "Scraper Module" and the "Parameter Module". d) The "Scraper Module" would do the actual scraping of multiple sites (based on parameters read from the Parameters Module), and also automatically populate the Products Table in the database server. For sites with content rendered in JavaScript, Scrapy to be used with Selenium for effective scraping. e) The "Parameters Module" would include a Form through which scrape parameters such as the primary URL, scraping rules for each field to be scraped, format of data to be extracted, and whether to use simple crawl (for sites without JavaScript) or complex crawl (for sites with content rendered in JavaScript). These parameters would be stored in a table, and accessed by the "Scraper Module" at run time. f) The scraped URLs (referred by the primary URL) to be saved in a Database Table with "processed flag", so that these can be skipped if scraping needs to be resumed after interruption. g) Primary URLs also to be saved with the date of last successful scraping, to enable scheduling of periodic repeat scrapings. h) While executing scraping, only those fields that have changed since last scrape are to be extracted and the original table entry for the product to be "updated", as required. In case of new products, the details to be "inserted" as a new row in the Products Table. i) Scrapy to be used with Selenium for effective scraping of sites with heavy JavaScript content. j) Performance must be adequate to enable scraping of the sites in order to generate the Products database Expected Skills: Web Scraping, Scrapy, Selenium, Python, Data Mining, Javascript, MySQL Budget: USD 200 to USD 300
项目 ID: 4223516

关于此项目

7提案
远程项目
活跃11 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
Hi, I have written many previous python scrapers, and I know how to do this job.
$250 USD 在5天之内
4.9 (6条评论)
4.3
4.3
7威客以平均价$336 USD来参与此工作竞价
用户头像
Hi sir, please check PM, thx Kimi.
$250 USD 在5天之内
4.9 (92条评论)
6.3
6.3
用户头像
Scraping Experts Here. Check the message and contact us. Scraping samples are also attached.
$300 USD 在14天之内
5.0 (5条评论)
5.9
5.9
用户头像
Hi, Ready to start your work. Eagerly awaiting for your positive reply. Please check your inbox for further details. Thanks, Shaik.
$250 USD 在5天之内
5.0 (26条评论)
5.1
5.1
用户头像
Hi, you can connect scrapy with django, the other things are really easy, i can do it!
$600 USD 在7天之内
5.0 (1条评论)
3.0
3.0
用户头像
I'd already done such type of work before for US parks & recreation. Please let me know If I can get the oppertunity further. Thanks Vikas Choudhary
$500 USD 在2天之内
5.0 (1条评论)
2.2
2.2
用户头像
work will be successful
$250 USD 在5天之内
4.7 (2条评论)
0.4
0.4
用户头像
Best work guaranteed. Please check PM.
$250 USD 在10天之内
0.0 (0条评论)
0.0
0.0

关于客户

INDIA的国旗
Trivandrum, India
5.0
1
付款方式已验证
会员自2月 8, 2013起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。