What are hls__ nodes and why don't the original entries show up?

tl;dr why doesn’t a simple search for written-content reveal that content? (my entire logseq directory is only 297M; I can’t imagine it’s too large to index? so I must be hitting some misconfiguration?)

Hi I have a string of text like visualizing that shows up both in the PDF (../assets/) filename itself and in the content that originally embedded it (it was a journal entry from about 18 months ago linking to the PDF and using the title of the paper that included “visualizing”).

Just now I found myself confused that I couldn’t find the content, but when I grep my local filesystem I see the journal entry. When I search logseq for visualizing I see some auto-generated hls__ node (that seems to be derived from the PDF asset itself) and I do not see the journal entry.

So question: why is the journal entry not showing up? This makes me worry that there’s potentially a LOT of my content I’ve been missing due to false-negatives like this one. (less important question: what are these hls__ files and can I just delete all of them?)

1 Like