Update an already existing web scraping tool and gather data for Startups database. It needs to gather data from known sites

€150-200 EUR

进行中

已发布

将近 4 年前

€150-200 EUR

货到付款

1st of all - apologies for the change of budget, the project description is totally different - it is just a modification/upgrade of an existing scraper, not a dev of a new one :) NOT A BIG PROJECT, BUT AN INTERESTING ONE FOR SURE :) THE WEBSCRAPER IS ALREADY DEVELOPED ACCORDING TO THE INSTRUCTIONS BELOW, BUT NEEDS TO BE UPGRADED (GUI-UX & SOME FUNCTIONS) (you will find the dev files and documentation in the attached zip file). You can also check the project out on GitHub ([login to view URL]). We need it to upgrade it so that it can adapt to the changes of all the websites it needs to scrape data from and also we want to add this website : www (dot) startupblink (dot) com web scraping tool and gather data for Startups database. It needs to gather data from known sites (more info in attached documents): Web scraper should be capable to gather basic info as a lead, such as Startup name and some contact information. Possibly startup description and logo Web scraper should accept an URL parameter (where to scrap for the data) and depth level (how deep scraper should dig, e.g. how many sub-links, sub-sections per URL and whether should scraper go outside specified URL, e.g. follow external links) In later stage, same web scraper should be capable to be configured to search for additional leads other than startups - such as: investment entities, service providers, etc... Background and strategic fit. All scraped info should be saved into two databases (startups max 3 years old), other companies. There should be a simple way to convert the DBs into CSV files. This web scraping tool should be configured in such a way that admins can insert starting URL and define what are they looking for, among, for example: startups, investment entities, service providers, etc... as well as list of data they are looking for, such as: company(startup) name, contact data, descriptions, and/or other properties. From tech perspective, the tools should use some already made Web Scraper, regardless of it's tech. stack... There are some pretty cool Java, Python and Node based web Scrapers. From tech perspective, it must be easily deployable tool not requiring some additional server resources or specific infrastructure stack which would create an overhead. Basically, what ever can be run from a container or similar environment could work for us, for as long as it is not resource-hungry and cost a lot when operating. When scraping tool is started, it should find required data from specified URL, then check do we already have found data in our databases, and if not, it should save it into our Startup database Assumptions 1 Starting URL As an operator I want to be able to input starting point (URL) for web scraping MUST HAVE Operator inputs starting URL for scraping 2 Search params As an operator I want to be able to input parameters I am looking for MUST HAVE Operator inputs what type of data, properties is looking for, such as: startup name, startup contact data, startup descriptions, startup logo The params should be added dynamically because they will vary from URL to URL Each searching param should accept multiple selectors... On some websites Startup name is titled as "startup name" while on others as "company name" or just "name"... We need to be able to define multiple params names and group them into single title. 3 Depth level As an operator I want to be able to input the depth level for my starting URL MUST HAVE Operator can select depth level for scraping, choosing from dropdown with values "1, 2, 3, 4, 5, any" defining how deep scraper should dig the starting URL 4 Follow External links As an operator I want to be able to choose whether my scraping tool should follow any external links from my starting URL MUST HAVE Operator choose Yes or No User interaction and design The tool need to have very simple interface for the operators and it requires authorization before the tool can be used.

项目 ID: 25625798

关于此项目

11提案

远程项目

活跃4 年前

想赚点钱吗？

电子邮箱地址

在Freelancer上竞价的好处

设定您的预算和时间范围

为您的工作获得报酬

简要概述您的提案

免费注册和竞标工作

11威客以平均价€344 EUR来参与此工作竞价

@PKonstiantyn

@Hello!@ I have read your description and understand your idea. I have also checked your attachment files. Web scrapping and auto script are my favorite skill. I did so many scrapping and auto script projects using python selenium, beautiful soap and panda. I can satisfy your all requirements perfectly. I am sure I can offer good result and fast delivery because I have good experience in this field. I can show u my previous works if u want. I don't bid on any projects which I can not do. your job is suitable for my skill. let's contact to discuss in detail. best regard!

€500 EUR 在7天之内

4.9

(18条评论)

6.5

@vladzolotukhin

Hello. As a web scraping and data mining expert by python and node.js, selenium web driver, I am glad to place the bid on your project. I have experienced LinkedIn profile scraping, amazon products scraping and ticket pricing tracking and so on. I want to discuss more via chat. Regards. Vladimir

€400 EUR 在7天之内

4.8

(24条评论)

5.2

@havrentiy

Hi, there. I have read your description carefully. I am very interested in your web scraper updating project. I have rich experience with web scraping using Python and PHP. Looking forward to hearing from you. Best wishes.

€175 EUR 在3天之内

5.0

(5条评论)

3.6

@Sunilbg1

I have been working with Software Developer for more than 08 years. I’ve come to know that you are looking for a software developer expert who knows the work very well. I want to let you know that I can fulfill your requirements properly as I’ve the experience working in this sector. I’ll be able to complete your work in time without making any mistakes. I have also checked your time schedule and I can ensure you that time won’t hamper your work. I’ve gathered experiences over the years. So I don’t think you will regret it if you consider me for this job. If you like to know about my skills and experiences then visit my profile and read the reviews given by the old clients. Hoping that I’ll also be able to satisfy you. Also please check my portfolio that is 100% similar to your job posting. So, give it a thought and I’m eagerly looking forward to working with you. Please call me for the interview if you would like me to give a chance. I am available in any kind of communication software to make this project successful. Thanks Sunil BG

€600 EUR 在7天之内

1.0

(1条评论)

1.3

@muhammadsufyan85

Hi sir, I'm a Professional Person in this field, i can do this job efficiently. Why me ? Strict Confidentiality I won't provide client's data to any one You will get me The work will not be outsourced to anyone - Unlimited revision until acceptance Quick response I will complete my work within deadlines

€250 EUR 在4天之内

0.0

(0条评论)

0.0

@Ashokomkar

I am interested for this job

€556 EUR 在25天之内

0.0

(0条评论)

0.0

@AnujMB

I will send you a set of lecture slides and notes and you will need to summarize and make them into concise notes.

€194 EUR 在10天之内

0.0

(0条评论)

0.0

@ramsinghrawat184

Hi, there. I have read your description carefully. I am very interested in your web scraper updating project. I have rich experience with web scraping using Python Looking forward to hearing from you. Best wishes.

€175 EUR 在7天之内