Extract bookmarks from PDF files using Matlab?
2 views (last 30 days)
Show older comments
How to extract all the bookmarks from a PDF file? These bookmarks are usrually the headlines of the pdf. Thank you.
4 Comments
Christopher Creutzig
on 18 Oct 2023
Those “bookmarks” form a tree structure, with chapters and sections, and include (hyperlink) targets in the PDF. What kind of output would be useful for what you are trying to do, a flat vector of strings, or do you need the nesting information? Do you need the link targets?
(Not saying I have a solution for any of these, but it would help in trying to answer your question.)
dpb
on 18 Oct 2023
@Christopher Creutzig, if you're particularly interested in/knowlegeable of pdf file interaction, you might find <another recent question> of some interest.
Answers (1)
dpb
on 15 Oct 2023
Moved: dpb
on 15 Oct 2023
High level MATLAB functions including extractFileText, pdfinfo and readPDFFormData in the <DataAnalyticsToolbox> don't return the bookmarks; you'll have to have some 3rd party pdf toolset to be able to do that...there are some like <itext bookmark example> that utilize code in a DLL that you would have to write mex code in your language of choice to use.
2 Comments
See Also
Categories
Find more on Environment and Settings in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!