Compile & Sort Data Extracted From Asp Website
$30-250 USD
进行中
已发布超过 13 年前
$30-250 USD
货到付款
Hi there,
I am looking for someone to write a program for me, I need the programme to scrape a horseracing webpage in the following format:
[login to view URL];id=2289726&dte=2010-09-20&npl=yes
I need the data scraped/ extracted and outputted and sorted, the information I am looking to be outputted is in this example is
Name of horse: Denman (IRE) - This won't change
Sire: Presenting (UK) - Sire will always be the first name & won't change
Dam: Polly Puttens (UK) - Dam will always be the second name & won't change
Current Trainer: P F Nicholls - This might change
I attach an example of how I require the data to be outputted and the following should help you understand what I need, i will use the last race he ran on 21 Apr 10 as my example:
Information under race details:
* "min" is the shortest race the horse has run and won at, this is derived from the number "25", this number is in furlongs so I am looking for miles and furlongs, there are 8 furlongs in a mile so this race is 3m 1f.(see below "r", "w" and "p" for result details)
* "max" is the longest race the horse has run and won at (see "min" above and "r", "w" and "p" below for result details)
* "note 1", if the horse has won a race at course "che" then input cheltenham, otherwise leave it blank. This race was "Pun",
so blank in this case
* "note 2", if the horse has won a race on heavy ground("Hy") input heavy, otherwise leave it blank. This race was "Gd" so blank in this case
* "Top W", this is the heaviest weight the horse has carried when winning a race, the weight for this race is 11-10 (this is stone & llbs)
* "Mark", this is the highest number under the OR when the horse has won a race
* "R", this is the number of runs a horse has had on this type of ground/ this course. In this case the ground is "Gd" and the course is "Pun", so this horse has run 8 times on good (total of "gd") and run 1 time at pun(total of pun)
* "W", this is the number of wins the horse has out of his runs on this ground/ this course. In this example the result is 4/11 so the horse did not win. The win will be indicated by a 1 before a / under the heading "result".
* "P", this is the number of places the horse has out of his runs on this gound/ this course. In this example the horse did not place. The place should be indicated if the horse is finishes second in a race with 5 or more horses eg 2/5, 2/6, if the horse finishes second or third in a race with 8 or more horses, eg 3/8, 3/12, if the horse finishes second, third or fourth in a race with 20 or more horses, eg 4/20 or 4/28 or if the horse finished second, third, fourth, fifth in a race with 30 or more horses.
* "Succ" is the total of "w" and "p" divided by "r" and expressed as a percentage.
I would also like if you could include columns labelled "PU", "U" and "F" between columns "r" and "w". These should contain the total for each type of ground and each racecourse and the values will be found under "result" and will always be to the left of the number of runners, see the third race for example where the result is expressed as U/6.
Finally, I would like this programme to update itself and to give me the ability to imput a url for horses i wish to add to this database.
Thanks for taking the time to read this and I look forward to hearing from you. :)
hi. We are highly experienced on crawlers, we specialize in data-mining software, extracting a variety of information from different sources, yellow pages, google, craigslist, classified sites, others. Our price: $ 145 to code it. Please see PMB for details