Calibre can create a full text index to search through everything (well, for files that actually contain the text, and it needs a lot of space).
Selfhosted
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
-
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
-
No spam posting.
-
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
-
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
-
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
-
No trolling.
Resources:
- selfh.st Newsletter and index of selfhosted software and apps
- awesome-selfhosted software
- awesome-sysadmin resources
- Self-Hosted Podcast from Jupiter Broadcasting
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
I had the same idea a while back and was wondering why no one has implemented something like this yet. This seems like an actual useful application for LLMs.
I am using Zotero (Citation Management Software) to collect scientific Articles I have read. Sometimes I forget in which Article I read about something specific. A search, where you could describe what you are looking for in a sentence, which then returns the Article with the relevant part, would be a gamechanger.
What you are looking for is a RAG and is one of the few legitimately useful implementations of LLMs outside the wall of hype.
Thanks for the link! Learned something new today.
nice, that is exactly what I'm looking for - thanks : )
I've had the idea for a while to use an LLM to gather metadata about books for me as well as generate tag lists for themes, plot, writing style, etc for everything in my ebook library. You could also generate non spoiler plot summaries and produce recommendations for similar books.
A search, where you could describe what you are looking for in a sentence, which then returns the Article with the relevant part, would be a gamechanger.
Yeah, exactly that
Paperlessngx will store pdfs and index their contents for searching. It's not necessarily meant for books but I think it would work.
Keep the actual epubs and search in those? (You basically want a transcript of a read out book...which is the book itself.)
He wants all of his books in one index.
And I think they want a solution that'll index audio books, too. An LLM that'll listen to, transcribed, and index audio books.
Audiobooks have a simple workaround if you can find a version of the book in text format to download, just index that.
Sure. I'm just saying, I think OP is looking for something that doesn't require either buying the book again or pirating it.
yeah I'm only talking about ebooks - I just mentioned audiobookshelf because it can also do ebooks and I've read here that people use it as a ebook management thing