Find Jobs
Hire Freelancers

small PHP project: parse HTML files, write XML - repost

$30-250 USD

已取消
已发布超过 10 年前

$30-250 USD

货到付款
Hey PHP experts!\r\n\r\nWe need a PHP 5 program that parses HTML files, extracts specific content and writes it to a XML file. \r\nYou find as an attachment a zip, please have a look at the sample files.\r\n\r\nHere is a task description in pseudocode:\r\n\r\n// PHP5\r\nclass Utilities\r\n{\r\n \r\n public $pathToSourceDirectory = 'someSourceDirectory';\r\n public $pathToTargetDirectory = 'someTargetDirectory';\r\n public $nameXMLfile = 'newXMLFile';\r\n public $targetNode = "misc_texts";\r\n public $targetTag = 'body'\r\n public $ignoreDate:Boolean;\r\n \r\n $xmlFile = $pathToTargetDirectory.'/'.$nameXMLfile;\r\n if exists, open $xmlFile \r\n else create $xmlFile first \r\n \r\n write one or several methods that perform the following routine:\r\n \r\n loop through all files inside $pathToSourceDirectory and all its subdirectories\r\n if the file is a HTML file (any extension like .html || .htm || .HTML etc.)\r\n if date of the file is newer than date of $xmlFile || $ignoreDate == true \r\n open file\r\n \r\n parse it: loop through all the top-level tags (do not loop through children tags) \r\n if div does not have the class 'private'\r\n extract content of div\r\n write it to $xmlFile \r\n as a child of the $targetNode \r\n if node with this page name already exists (compare page name) replace content \r\n else add new node\r\n \r\n structure of the resulting $xmlFile:\r\n \r\n // $targetNode\r\n \r\n // path to file (replace slashes (/) with double underscore ('__') + filename (without extension)\r\n // id of first div\r\n content to be extracted\r\n ]]>\r\n \r\n // id of second div\r\n content to be extracted\r\n ]]>\r\n \r\n \r\n \r\n \r\n \r\n content of file B to be extracted\r\n ]]>\r\n \r\n \r\n \r\n \r\n \r\n \r\n \r\n \r\n \r\n save $xmlFile\r\n close all files\r\n return success or error\r\n \r\n}\r\n\r\nLooking forward to hear from you!\r\nAndreas
项目 ID: 4857670

关于此项目

9提案
远程项目
活跃11 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作

关于客户

SWITZERLAND的国旗
Zürich, Switzerland
5.0
31
付款方式已验证
会员自8月 24, 2013起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。