Find Jobs
Hire Freelancers

Scan pdf and read tables with OpenCV & Tesseract OCR

$250-750 USD

已完成
已发布超过 5 年前

$250-750 USD

货到付款
Project Mission: Search and find all tables in a PDF Convert images of tables from PDF (or other) to CSV-formatted tables. Mainly pdf but must use OCR because not all PDF are formatted for parsing Must be able to handle tables that are on two pages (see standard bank report pg 12/13) Requirements: OpenCV (Python) Tesseract v4 A set images of pdfs will be provided. It's important not to optimize the solution for these specific tables. The solution must be generic and will be tested against other images of tables. It is a priority to handle regular tables with high precision. Pie-charts and similar diagrams are a bonus. Proposed steps: 1. Analyze images using OpenCV to determine table cells (rows and columns). 2. Slice input image into multiple images based on cells. 2. Use Tesseract 4 to OCR text from each cell. 4. Output data to CSV Expected outcome: - Conversion is at least 95% accurate with our test-set. Standard tables but not provided to avoid overfitting. - Docker image with all dependencies provided. - Function / Script / API that takes an image and outputs CSV-table. Readings / Links: Improving quality: [login to view URL] Finding text blocks in an image using OpenCV: [login to view URL] Table Analysis using with histogram: [login to view URL] Docker OpenCV Image: [login to view URL] Attached files: pdfs to convert
项目 ID: 18574010

关于此项目

6提案
远程项目
活跃5 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
Hello sir. I have been working on web technologies since 6 years and studying machine learning since 2 years. I have worked on OCR and OpenCV for reading images and emotion detection projects. Ready to start your job. Please ping if interested.
$500 USD 在10天之内
4.3 (11条评论)
3.6
3.6
6威客以平均价$1,101 USD来参与此工作竞价
用户头像
Hi there, I've read your project description and I am confident enough that I can handle this project according to your expectations. I have done similar projects before and I want to take over this project as well. If you're interested then please contact me to see my portfolio :) I'll be waiting for your response. Regards
$550 USD 在18天之内
5.0 (9条评论)
5.5
5.5
用户头像
Hi there Roaya is a startup based in Egypt and we are Odoo official partner. We are ready to start working on your project. Please let us discuss the details. Regards Mohammad Alaa
$2,000 USD 在20天之内
4.9 (19条评论)
5.0
5.0
用户头像
I have already worked on Ocr with PDF . and extract text from it . so I can do your job within a time limit with your satisfaction.
$1,888 USD 在30天之内
4.1 (2条评论)
2.9
2.9
用户头像
Hi,dear. I am very interested in your project - 'Scan pdf and read tables with OpenCV & Tesseract OCR'. I've already done this kind of project before. I'm a professional programmer with 12 years of experience. If you award me, I'll implement all of your requirements in a short time. Skills: Java, Machine Learning, Python, Software Development
$555 USD 在3天之内
0.0 (0条评论)
0.0
0.0
用户头像
I'm a senior software developer with very a high personal standard for code quality and I pay attention to detail. I have been programming full-time for more than 10 years. Some of my experience is summarized below: ➢ Java 7 & 8 (6+ years experience) ▪ Android, Java EE(J2EE), J2ME, JSF, JSP, PhoneGap ▪ Gradle, Maven, Ant ▪ Spring, Hibernate, MyBatis, EJB ▪ Jboss/Wildfly, Tomcat, Weblogic ▪ TestNG, JUnit, Mockito ▪ Swagger, Dropwizard, JAXB, Axis2 ➢ C# (.NET Core + Standard + Framework) ▪ Dapper & Entity Framework ▪ NUnit ➢ SQL (10+ years experience) ▪ MySQL, MSSQL, Stored Procedures ➢ Oracle (+- 1 year experience) ▪ PL SQL, Stored Procedures ➢ HTML (+HTML 5, 10+ years experience) ▪ JSON, JavaScript, CSS, AJAX, XML, YAML ➢ PHP (10+ years experience) ➢ C++ (3+ years experience) ➢ Pure C (2+ years experience) ➢ Cisco IOS (2+ years experience) ➢ Perl (2+ years experience) ➢ SH (10+ years experience) ➢ BASH (10+ years experience) ➢ Clarion (version 8 & version 10) ➢ Python ➢ VB (.NET) ➢ Delphi ➢ Assembly I am very proficient with Linux/Unix which I have used for more than 10 years with KDE, Gnome, Fluxbox and pure terminal. Flavours I have used include: ➢ Gentoo ➢ CentOS ➢ Debian ➢ Mint ➢ Kali + Backtrack 2 & 3 ➢ RedHat ➢ (K)Ubuntu ➢ FreeBSD (UNIX) ➢ Knoppix ➢ Arch ➢ PHLAK ➢ OpenSUSE ➢ Fedora ➢ PCLinuxOS among many others
$1,111 USD 在20天之内
0.0 (0条评论)
0.0
0.0

关于客户

SOUTH AFRICA的国旗
Sandton, South Africa
0.0
0
付款方式已验证
会员自1月 22, 2019起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。