Find Jobs
Hire Freelancers

Codez un logiciel

$1500-3000 USD

已完成
已发布超过 9 年前

$1500-3000 USD

货到付款
We search developers with solid expertise in OCR, Tesseract or other open source software. We are the creators of [login to view URL], a platform for independant comic artists that want to show their work to the world. We have created a unique feature > a collaborative, crowdsourced interface for authors to have their comics translated by fans! Thanks to this feature, some comics have been translated in over 15 languages by fans! We now want to improve this interface by automating a step that is currently done manually in 3 phases : 1/ selecting text areas by creating rectangles on top of them 2/ emptying the text areas 3/ then writing the translation in the text area. Milestone 1, texte zone recognition : The software starts by receiving one parameter- the path to the jpg/png file > a full comic page The process analyses the image, and searches for zones of text inside balloons (or speech bubbles). Minimum 4 letters words, All other text is ignored. The software returns the coordinates of the zones it has found. The zones’s coordinates are given in a text file, one line per zone. The line is X,Y,width,height values are numbers in pixels, X,Y is the top-left corner of the rectangle, 0,0 being top left of the image. Exemple: 56,48,350,220 (a 350x200 pixel rectangle with position at X=56, Y=48) The target performance is 1 second for an image The target success rate is at least 95% of text zones being detected properly and can’t fall under 80%. During the test phase, the results will be checked by humans. There won’t be any user interface, this program will communicate with [login to view URL] API and we will do the rest of the process. Milestone 2, OCR : Using an image with only text, selected and extracted by the first part given as parameter for the check of the milescore, but won’t be needed like this when the project is complete The process Analyses the image part, and recognizes the text. Outputs the text in a text file. Encoding in UTF-8 Text must be on one line. if found line returns, writes \n (an actual backslash and a “n”) The target performance is 0.3 second for an image. Milestone 3, completion : Using the full image and the rectangle informations, the soft will make the following process for each part of the image independently: Analyses the image part, and recognizes the text (like Milestone 2) Write the text in a text file, on one line. Begin the line by the size of the letters and a coma. Example : “45,Hello!\nHow are you?” Thus, each rectangle will have it’s line in the text file. We can use the same text file and encode it with the rectangle area, then text. Exemple: 56,48,350,220,45,Hello!\nHow are you? (X, Y, Width, Height, FontSize, Text) The target success rate is at least 95% of text zones being detected properly and can’t fall under 80%. During the test phase, the results will be checked by humans. There won’t be any user interface, this program will communicate with [login to view URL] API and we will do the rest of the process. Performances and APIs If launching the soft can be a little slow, it can have a mode where it’ll work on an entire directory of images. Amilova code will run the program telling what images/dir to do, then will read the created text files. Technical requirements : The software(s) will need to work on linux command-line. You can use the language of your choice, with a pre-validation from us. We’ll want the source code. **** Please bid only if you have solid experience in OCR software. We believe that the 3 parts can be done independently, so if your skills match only one part please state it explicitly. If we could see successful OCR projects in your freelancer history, that would be a huge plus. Prepare a list of the similar projects you’ve been working on so you can present your work with practical cases.
项目 ID: 7041181

关于此项目

11提案
远程项目
活跃9 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
Hello, thank you for invitation to the project I am ready to do it. I am a C++/Python developer working in image processing team in neurosoft.pl. Could you tell me if you have any expectations according to programming language. I can do all 3 parts and I prefer python to do it, the main reason is that Python is multiplatform and has a great support for image processing techniques. Regards, Marek I attach the text region detection samples: [login to view URL] the algorithm can be improved this is the very first version. In case of payment we can negotiate, please send me also some more images so I can test my algorithm
$2,000 USD 在22天之内
4.6 (7条评论)
4.9
4.9
11威客以平均价$2,641 USD来参与此工作竞价
用户头像
A proposal has not yet been provided
$3,157 USD 在30天之内
5.0 (1条评论)
5.4
5.4
用户头像
Hi, I have checked attached 3 samples. If the all text is similar to the samples, it's not problem to OCR the text. But I am wondering which complex font would be coming. I have experience in Tesseract OCR engine through several projects. One of them was extracting document types in single pdf files. Since I am sure I can handle all three milestones, I would like to talk You for further details. Thanks.
$3,157 USD 在30天之内
4.8 (7条评论)
5.7
5.7
用户头像
A proposal has not yet been provided
$2,368 USD 在30天之内
4.3 (2条评论)
5.6
5.6
用户头像
A proposal has not yet been provided
$4,210 USD 在30天之内
4.7 (3条评论)
5.4
5.4
用户头像
Hello I am an expert of image processing. Before I made program about Invoice PDF OCR and de-CAPTCHA. Also I made ALPR program. If you need to require my demo, I will support it. I can this job perfectly. IF you want me, please leave message. Sincerely
$2,500 USD 在20天之内
5.0 (8条评论)
3.9
3.9

关于客户

BULGARIA的国旗
SOFIA, Bulgaria
5.0
5
会员自1月 17, 2008起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。