Hi, (posting this yet again to all threads related to pdf annotations)
I was very frustrated by Logseq’s lack of pdf annotation import so I forked pdfannots to create a proof of concept to import pdf annotations. Here’s the link.
You can use it like so python ./pdfannots.py -f md_and_edn path_to_pdf -o path_to_pages_markdown --edn_output path_to_asset_edn
.
This will create the markdown annotation file (in the pages folder) as well as the edn files containing the metadata of the highlights (in the assets folder).
This is still early and does not yet support highlight color parsing nor highlights different than simple text highlighting (no shapes, no rectangles etc). Don’t hesitate to help in the PR as I’m not a pro at this!