Parse Old Html Files of quotes and store in MSSQL database
$10-30 CAD
已完成
已发布超过 10 年前
$10-30 CAD
货到付款
I have over 200 html files that need to be parsed, they are very old and do not use closing tags. I did find a pattern where the text I want is between <hr> tags.
Here is an example of the some 2,000+ entries I need to parse.
<HR><BR><BR>
November 13, 2005
<BR><BR>
TEXT TEXT TEXT
<BR><BR>
Author Name<BR>
<A HREF="[login to view URL]">
Biography</A>
<BR><BR><HR>
I am trying to achieve. (I have attached a sample of the html)
- Parse the html documents in a local folder (about 200 files)
- Look for the content between the <hr> tags, the very first quote is between a <blockquote> tag and the first <hr> tag. the rest are between <hr> tags.
- Grab the date (it's a consistent format of MMMM dd, yyyy)
- If there is a html link get it (sometimes there isn't one)
- Somehow get the author name.
Once the data is collected it needs to be stored into a database.
I can do this job for you and deliver in 6 hrs time.
I will deliver a Windows desktop application so that you can run it on your computer and do this yourself.
PS: The job is a bit complex hence my bid reflects a reasonable rate for the work required.
Thanks for considering me.
Bests,
Larry
Hi sir I am an expert in ragex and I can do this task very easily. please award this project to me for 100% professional result.
No satisfaction == No money !!
Hi
I have read the project description and wanted to confirm that we can do this.
We'll extract the data and will put it in mysql database.
In case you want to test, you can provide us a few sample files and we'll do it for free.
As we are new on freelancer.com, we are flexible in all respects. You do not need to pay us until all of the task/project is complete and you are convinced with the results.
I would look forward to discuss this further via Skype.
Looking forward to a quick and positive response.
Regards
Sam D.
OweBest