Computer vision: Write a document scanner in Python 3

The job is to write a Python 3 module that implements the following algorithm:

1. Read a .png or .jpeg image of a document or letter.

2. Find contours of the letter, similar to what is shown in this article:

[url removed, login to view]

However, instead of assuming that the letter is rectangular, this step should be more sophisticated: It should find better contours, like this:

[url removed, login to view]

3. Unskew the letter into a rectangular hull. It is NOT ok to just add white space; instead, the whole document should be "undistorted" by this step. For example, the original image may look like this, and the distortion-correction should still work:

[url removed, login to view]

4. If needed, correct the text angle like this:

[url removed, login to view]

5. Apply OCR / create a searchable PDF.

Other requirements:

- The original colors should be retained in the result.

- The Python 3 API should be similar to the following:

import paperscan as ps

with open('[url removed, login to view]') as fp:

img = [url removed, login to view](fp)

hull = [url removed, login to view](img)

page = [url removed, login to view](img, hull)

angle = ps.text_angle(img, page)

page = ps.fix_angle(img, angle)

text = [url removed, login to view](page)

with open('[url removed, login to view]', 'w') as fp:

ps.write_pdf(fp, page)

Kemahiran: Grafik Komputer, Python, Pembangunan Perisian

Lihat lagi: can write computer engineering technical report, write computer technology, write computer repair page, code scanner document java twain, write computer science survey paper, write computer articles, transcriptionist transcribe recordings send word document write book, technical document write web service, read data word document file write data powerpoint vb net asp net projects, python script file write telit, embed website javascript document write, document write unescape decrypt online, document write unescape decrypter, document write unescape decrypt, document write unescape decode, document write email font size, write computer repair bid, applet scanner document, python telit file write, document write extjs

Tentang Majikan:
( 14 ulasan ) Stuttgart, Germany

ID Projek: #11555619

Dianugerahkan kepada:


Hello, I have worked on several computer vision problems like salient part detection in images, face recognition, text recognizer etc Please ping if you are interested in working with me. thanks,

€200 EUR dalam 5 hari
(18 Ulasan)

3 pekerja bebas membida secara purata €146 untuk pekerjaan ini

€155 EUR dalam 2 hari
(4 Ulasan)
€83 EUR dalam 3 hari
(0 Ulasan)