进行中

Write a program to auto catch data from a site.

Hi there.

I need someone to write a program that can auto catch data from this site : [url removed, login to view]

The program is what I [url removed, login to view] the data.

Look at the category list in the right site first.

I need the program can auto catch some categories' data.

Some of them are under [恋愛・少女漫画], from [ちはやふる] to [すきっていいなよ。].

Rest fo them are under [ファンタジー漫画], from [終りのセラフ] to [クレイモア].

*Check the attachment named “category”

You have to output data in the following way:

Data Structure:

id----order

url----URL of the article

cat----category

title----title

magazine----magazine

author----author

genre----genre

character----character

site----goal website

article----the text

entry_data_at----publish time

created_at----catch time

picture----the cover the article

*Check check the attachment named “tip1,tip2,tip3”

Explanation:

[cat],means the name of [url removed, login to view] the category list in the right [url removed, login to view] can see words like [ちはやふる] and [黒崎くんの言いなりになんてならない],they are categorise.

[title],check the category [ちはやふる],turns to a new page,you can see words such as [ちはやふる33巻173首のネタバレ感想] or [ちはやふる33巻172首のネタバレ感想], they are titles.

[article],check one title like [ちはやふる33巻173首のネタバレ感想],turns to a new page,you can see an article with lot of [url removed, login to view] have to catch the body which from the title(ちはやふる33巻173首のネタバレ感想) to the end of the article (end at the place above [目次][コメント] and advertisements).

[entry_data_at],means the publish time of the articel,for example,the publish time of ちはやふる33巻173首のネタバレ感想 is the one written under the title - 2016/10/[url removed, login to view] have to record it by using timestamp,which would turn 2016/10/01 into 1451577600.

[url],means the url of the article,like [url removed, login to view]

[site],all write as [url removed, login to view]

[character],for example,

[url removed, login to view],

under advertisements,there is a [目次] [url removed, login to view] can see [33巻173首] write in black and has no [url removed, login to view]'s the [character]

About [author],[magazine],[genre],[picutre],[id] and [created_at],you should do the following step first.

Search [cat] in [url removed, login to view],use the first result.

For example,search [ちはやふる] in [url removed, login to view],you can get:

作家:末次由紀

雑誌・レーベル: BE・LOVE

ジャンル: スポーツ / 少女マンガ / アニメ化 / 映画化

So,

[author],means the words after [作家:]. In the example the [author] is [末次由紀].

[magazine],means the words after [雑誌・レーベル:], In the example the [magazine] is [BE・LOVE].

[genre],means the words after [genre:],need to use "," to separate them. In the example the [genre] is [スポーツ,少女マンガ,アニメ化,映画化].

[pitucre],the cover of the first [url removed, login to view] have to catch covers and store [url removed, login to view] the datebase there should add a data bar of [pictuer] and have url of each cover.

[id],means the order, the first one is 1, the second one is 2, etc.

[created_at],means the time you catch the article,also have to record by using timestamp. For example,if I catch the date on UTC/GMT+08:00 2016/10/11 14:40:30, so the [created_at] should be 1476168030.

Use [ちはやふる] as the example, do what I said,you can get:

id:1

url:[url removed, login to view]

cat:ちはやふる

title:ちはやふる33巻173首のネタバレ感想

magazine:BE・LOVE

author:末次由紀

genre:スポーツ,少女マンガ,アニメ化,映画化

character:33巻173首

site:[url removed, login to view]

article:<h1 class="entry-title">..........

picture: <..>.

entry_data_at:1451577600

created_at:1476168030

*Check the explanation named “database sample”.

This is what I [url removed, login to view] have to make the program to catch data in this way to make my server can recognize the data.

Need to catch data 2 hours one [url removed, login to view] to send me the program you write to catch data.

Because all I need is a [url removed, login to view] the budget is 300 USD.

Tap 113114 in your bid.

技能: 数据输入, 日语, MySQL, PHP, 软件构架

查看更多: write program auto fill, write simple data entry program, site data search, write data entry program, program auto data, step step writing java search engine program, program auto search, software write mq4, software write chip epson, useful software write book, program auto vote site, software write web specs, free software write user guide, software write edid, free software write company profile, program auto data forum, software write websites idea, software write book images, software write books, auto data input program, software write protection, free software write book, software write book, site data gathering, software write protect software

About the Employer:
( 25 reviews ) kojima, China

项目ID: #11783965

已悬赏给:

gangabass

First of all I know about "113114". I'm expert in web scraping with HUNDREDS of completed projects and excellent reviews that's why I'm sure you'll be impressed with my work. I can create such scraping program fo 更多

$294 USD 在3天内
(309条评论)
6.7

11名威客为此工作的平均竞标价是$361

dboyzhang

Hello, I am able to create the scraper. The first time, the scraper will need to get all the articles on the site and the associated data from cmoa.jp. Then in the future, will it just need to get new data?

$300 USD 在7天内
(325条评论)
7.3
Marie1234

A proposal has not yet been provided

$250 USD 在3天内
(154条评论)
6.8
hmshafeeq

113114 =========================================================================== Hi, I am interested in your project and would like to offer you my service for this project. I have already worked on similar proj 更多

$333 USD 在7天内
(14条评论)
5.9
xielessupport

I can make your project a great success. I'm 31 year old talented PHP and open-source developer. I have 9 years of experience in Server Administration and Web Application Development. I'm expert in web application cust 更多

$331 USD 在10天内
(23条评论)
5.2
miracitech37

Hi I have read your job description extremely carefully , so now don’t need to worry we will give PROFESSIONAL work in MINIMUM PRICE and I am absolutely sure that our team can do the job very well but I have couple of 更多

$555 USD 在10天内
(10条评论)
5.2
staragent

Hello. I read your post. I am a manager of ITCS. We have a strong sense of responsibility , we are perfectionist and very punctual. You will benefit when you work with us. I will be looking forward to keep in touc 更多

$333 USD 在3天内
(4条评论)
4.7
ZhangDa

113114 Nice to meet you. I see you know Timestamp, I guess you are also a developer yourself? because other people don't use such format time usually. Then do you care about the programming language? The framewo 更多

$300 USD 在7天内
(7条评论)
3.7
vnprofl

Hello. I am a professional scraping bot maker. I have created several scraping bots for my employers. Please see my profile i just completed some similar projects. You will get your project done with the best result if 更多

$750 USD 在3天内
(7条评论)
3.2
WTechvision

Hello , I am AutoCAD Drafter and Programmer with 7 years of experience in designing and developing applications for AutoCAD using Autodesk API's. Great experience on AutoLISP & Visual LISP Programming and custom 更多

$273 USD 在10天内
(6条评论)
1.8
Appiqo

We could be the best fit for this job since we possess: 1. Resources having 8+ year experience working within domain. 2. Free post delivery support of 60 days 3. Flexible timing throughout development tenure 4. Ini 更多

$555 USD 在10天内
(1条评论)
0.0
FreeLancerBonny

I have over 10 years of experience in e-commerce websites in software and e-commerce development I have done website grabbing for different companies from world leading food portal and world number 1 ecommerce webs 更多

$333 USD 在5天内
(0条评论)
0.0
srinuloveswork

I can do the work and deliver the program to you in the given time regards srinivad

$444 USD 在8天内
(0条评论)
0.0