Use accessibility tags embedded in PDF #1671

yurydelendik · 2012-05-08T19:53:03Z

Per conversation with Marco Zehe:

If a PDF file has accessibility tags embedded, such as for headings, tables, form fields, graphics with alternative text etc., try to use them in the text layer

RyanEdwardHall · 2013-06-14T17:31:55Z

I would really like to see something happen here, we have a blind user at work that took a look at pdf.js output and said it's unreadable with NVDA or JAWS. I understand this a huge challenge given how the div's have to be positioned for text selection, but as the issue stands, visually impaired users are unable to use the library because the dom lacks semantic features

trjohnst · 2018-03-15T22:44:38Z

Our team is also looking for a very similar feature. It would be great if such a thing was baked into PDF.js to handle semantic markup such as headings, lists, tables, and image. I know PDF's have such information encoded in their file format, but am unsure of the specifics of if they can be translated to semantic markup.

@RyanEdwardHall if the elements contained within the text layer have appropriate resetting applied to them via CSS, would that alleviate concerns around the positioning of the DOM elements?

somascope · 2019-11-29T23:40:45Z

I too would like such a feature for document structure tagging for accessible PDFs. If it's of helpful reference, these is Adobe's Acrobat page for doing this in software, which possible provides some insight.

cuhaller · 2020-01-27T23:41:01Z

I've also been looking for a solution to this for a while now and came across this article, https://engineering.linkedin.com/blog/2019/04/under-the-hood--learning-with-documents, outlining how LinkedIn solved this issue using PDF.js for the PDFs uploaded to LinkedIn Learning. I contacted the authors to learn more and they initially seemed open to contribute their edits back, but I haven't heard back after following up and offering to pay for their efforts. Guess they got busy with other projects.

trjohnst · 2020-01-28T00:36:09Z

@cuhaller that was the very project I was working on that needed accessible tagging. The PDF format when created correctly is surprisingly close to basic HTML elements that aid page readers. I've since left so, unfortunately, do not have access to any of the code. I will check in with the dev that worked on that component.

cuhaller · 2020-02-03T22:51:19Z

@trjohnst thanks, much appreciated, let us know what you find out.

jcsteh · 2020-04-16T23:43:20Z

This issue and #6269 are duplicates. I'd suggest this should be closed as a duplicate of #6269, as the latter has more technical details.

timvandermeij closed this as completed Apr 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use accessibility tags embedded in PDF #1671

Use accessibility tags embedded in PDF #1671

yurydelendik commented May 8, 2012

RyanEdwardHall commented Jun 14, 2013

trjohnst commented Mar 15, 2018 •

edited

somascope commented Nov 29, 2019

cuhaller commented Jan 27, 2020

trjohnst commented Jan 28, 2020

cuhaller commented Feb 3, 2020

jcsteh commented Apr 16, 2020

Use accessibility tags embedded in PDF #1671

Use accessibility tags embedded in PDF #1671

Comments

yurydelendik commented May 8, 2012

RyanEdwardHall commented Jun 14, 2013

trjohnst commented Mar 15, 2018 • edited

somascope commented Nov 29, 2019

cuhaller commented Jan 27, 2020

trjohnst commented Jan 28, 2020

cuhaller commented Feb 3, 2020

jcsteh commented Apr 16, 2020

trjohnst commented Mar 15, 2018 •

edited