Does not handle glyphless fonts, as used by tesseract

Bug #1830473 reported by Julian Andres Klode
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Poppler
Fix Released
Unknown
poppler (Ubuntu)
Fix Released
Undecided
Julian Andres Klode

Bug Description

[Impact]
PDF documents OCRed with tesseract show black boxes when selecting text

[Test case]
Run ocrmypdf on a pdf, open it in evince, select some text. You should not see black boxes.

[Regression potential]
The patch is minimal invasive and only changes the behavior for the tesseract glyphless font.

[Other info]
Upstream is unhelpfully refusing to merge the patch in https://gitlab.freedesktop.org/poppler/poppler/merge_requests/208 because it's not abstract enough for them (as it only handles tesseract).

Changed in poppler (Ubuntu):
status: New → In Progress
assignee: nobody → Julian Andres Klode (juliank)
description: updated
Changed in poppler:
status: Unknown → New
Revision history for this message
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package poppler - 0.76.1-0ubuntu3

---------------
poppler (0.76.1-0ubuntu3) eoan; urgency=medium

  * d/p/glyphless-font.patch: Support Tesseract's glyphless font (LP: #1830473)

 -- Julian Andres Klode <email address hidden> Sat, 25 May 2019 12:31:14 +0200

Changed in poppler (Ubuntu):
status: In Progress → Fix Released
Changed in poppler:
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.