Find Jobs
Hire Freelancers

Bot Needed to Extract Email & Save as CSV File w/Existing Data

$250-750 USD

已取消
已发布超过 11 年前

$250-750 USD

货到付款
Read everything below to fully understand the project. Do not bid until you have read in full. Personal messages along with the bid will be given extra attention. If a requested feature drastically increases the price, mention how much it is with and without it so that I can correctly compare your bids to the others... During the process, it is very important that we stay in contact with one another. Thanks, Steve OUTLINE I need a program that I can run on Windows to extract email addresses from URL in an existing CSV file and save the results into the same file which contains other data. CSV has this column structure: A- URL B- Email C- Company D- Contact E- Address F- Phone EXAMPLE DATABASE [login to view URL] FEATURES - I need these; .com, .co, .net, .biz, .us - Use comma if more than one email found. - Nulti-threading which can be adjusted by the user (1-30) - Must load data into database (ie: sqlite) for scraping. There are times where I will use this for 100 URL’s and times where I will want to use it for 100k URLs. So it is important that the results be saved either in the CSV or DB in case of a loss of internet or PC restart. - Must be able to read URLs in this format; http, www, and [login to view URL] - Scrape email in source code and screen scrape (for email that is output with JavaScript). If this increases price, let me know. FUNCTION The program will pull the URL (which I can always make column A), scrape the website for email and post the results into the Email column (column B). The program needs to have three scraping modes to help with speed. Do not scrape external URL’s or redirects. 1) Slow - Full scan of entire website (50 URL max) 2) Medium - Scrape only the links found on the initial landing page and stop scraping after 30 URL's 3) Fastest - Scrape only these pages; landing page, contact-us, contact, contactus, about, about-us, aboutus, staff. If these pages have extensions (php, jsp, htm, apx, html, etc), that means that case does matter. So we also have to have Contact-us, Contact, Contactus, About, About-us, Aboutus, Staff, ContactUs, Contact-Us, About-Us. And sometimes, the "contact" page is a folder such as [login to view URL] (max 15 domains) I will use as many threads as I can, and run all URL’s in ‘Fastest’ mode. Then, if there are domains that do not have URL’s, I will run it in Slow or Medium (since it will take longer). One GUI where I will select the file, watch the process, and if possible, specify the URL/time limit for each option (Slow, Medium, Fastest). If that increases the price, let me know. I may later decide that it is better to have a time limit instead of URL limit and will want the ability to change this without rewriting the program. The program will save the results into a new CSV file which defaults to the original file name with the word RESULTS added to the end of it. If it cannot default to the original file name, it should call itself [login to view URL] Since many websites have forms, it would be nice to know this so that I do not continue trying to process those. Maybe the program can detect the <form> code and put FORM in column B so that I can skip those and keep it for my records. DEMO I will want to test this along the way. The demo you provide will need the ability to test at least 50-100 URL’s. It’s much harder to get a good idea of performance with a smaller list. SOURCE CODE I want the source code once the project has been completed. As long as you are available, I will continue to work with you if changes are needed, but if you are unable to be reached, I will need to take it to someone else to receive help. SUPPORT Two-weeks of support once the project is finalized. There are emails that will be missed so revisions will be needed.
项目 ID: 2336622

关于此项目

8提案
远程项目
活跃12 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
8威客以平均价$500 USD来参与此工作竞价
用户头像
We can help in your project, please check PMB and our ratings/reviews to get idea of our experience.
$250 USD 在7天之内
4.8 (239条评论)
7.8
7.8
用户头像
....................
$250 USD 在0天之内
5.0 (87条评论)
6.5
6.5
用户头像
HI, Kindly see details in PMB Thank you
$550 USD 在4天之内
5.0 (39条评论)
6.1
6.1
用户头像
If you are looking for an expert - I am the person for the job. Please check your PMB.
$650 USD 在7天之内
5.0 (13条评论)
5.0
5.0
用户头像
Please see PMB.
$700 USD 在10天之内
5.0 (11条评论)
4.8
4.8
用户头像
Please check your private messages.
$600 USD 在7天之内
5.0 (1条评论)
2.0
2.0
用户头像
Custom software development - <b><i>Removed by Admin</i></b>
$750 USD 在1天之内
0.0 (0条评论)
0.0
0.0
用户头像
Please check the PMB
$250 USD 在1天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
Lexington, United States
5.0
45
付款方式已验证
会员自4月 6, 2011起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。