Find Jobs
Hire Freelancers

Python automated process to get content from URL's and put into XML or JSON

£20-250 GBP

已完成
已发布超过 10 年前

£20-250 GBP

货到付款
I have a big list of URL's which I need to run through so i can get content from the pages. This content is in different tags within the page, not all have classes or ids. I am looking for someone to create an automated service to do this so I can leave it running to complete. I want the automated process to load each page and get the content from particular html tags, then process some this content to ensure the html is correct, e.g. replace table content with paragraph tags making sure all text styling (italic, bold, etc) is kept. If possible I want to generate one or multiple batch XML or JSON files so I can import it into another system. I want this to be able to run off my local mac or a linux server, if worse comes to worse you can run it and provide me with the files. The list of URL's are to an external site. Ideally I would be looking for someone to create this with Python using BeautifulSoup and urllib.request. Only bid if you have experience with this sort of thing. Thanks
项目 ID: 5320937

关于此项目

5提案
远程项目
活跃10 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
I've done several scraping jobs with python and can probably do this project without a problem. I usually use python with lxml, but beautifulsoup might be better if the html is too corrupt. Please let me know the website and the details of the scraping job and I will update the bid/time accordingly.
£200 GBP 在3天之内
4.8 (5条评论)
3.0
3.0
5威客以平均价£275 GBP来参与此工作竞价
用户头像
Hello,I'm a PHP and Python developer. I've worked in scrappers before for freelancer.com employers. I've experience working with this kind of application. In my case I like to use content scrapper using XPATH to analyse the site, but in your case I can do it using Beautiful Soap. I've also create it using PHP. I've a lot of them then in PHP if you wish I can adapt one of them to work like you need, this option will be faster than create one from the scratch with python. But is up to you. The output can be on XML file, JSON file, or even I can upload the information to some mysql DB if you wish. I'm online and ready to chat about it or start to work. To start to work I will need some of the URL to check the content. Thanks
£111 GBP 在5天之内
4.5 (6条评论)
2.7
2.7
用户头像
A proposal has not yet been provided
£142 GBP 在3天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED KINGDOM的国旗
Croydon, United Kingdom
5.0
33
付款方式已验证
会员自2月 8, 2011起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。