this post was submitted on 24 Jul 2024

53 points (90.8% liked)

Selfhosted

49410 readers

1041 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago

MODERATORS

HybridSarcasm@lemmy.world

HybridSarcasm@lemmy.hybridsarcasm.xyz

Uses for local AI? (lemmy.mtate.me.uk)

submitted 11 months ago by smeeps@lemmy.mtate.me.uk to c/selfhosted@lemmy.world

55 comments fedilink hide all child comments

Im using Ollama on my server with the WebUI. It has no GPU so its not quick to reply but not too slow either.

Im thinking about removing the VM as i just dont use it, are there any good uses or integrations into other apps that might convince me to keep it?

top 50 comments

sorted by: hot top controversial new old

[–] RandomLegend@lemmy.dbzer0.com 33 points 11 months ago (1 children)

It's a tool like any other. If you don't have any usecase for it, just don't use it.

I use it to summarize release notes and generate some minor descriptions for generic stuff in my TTRPG campaigns.

[–] DrinkMonkey@lemmy.ca 14 points 11 months ago (1 children)

generate some minor descriptions for generic stuff in my TTRPG campaigns.

Need a quick 200 word description of the interior of an apothecary? Or a band of marauding orcs? It’s been a huge time saver for me.

[–] RandomLegend@lemmy.dbzer0.com 7 points 11 months ago

Yup, never had to usw "Random NPC Merchant No. 14" again.

[–] yesman@lemmy.world 25 points 11 months ago (3 children)

Think of LLMs like a stupid office worker. You wouldn't rely on them to make critical decisions, but they're valuable for tedious stuff.

For example, my calendar changed the way to enter new events breaking my workflow. Now I just type out a skeletal schedule and have LLM convert that into a .csv that I import.

I'm thinking of Ripping my CD collection again. I'm researching a way to use a LLM to tidy up the metadata.

I had a folder full of random stuff I've saved for years. Had a LLM organize and categorize it for me. I had to tweak the prompt enough that this was a medium difficulty task, but still way easier than doing it manually.

[–] domi@lemmy.secnd.me 3 points 11 months ago (1 children)

I'm thinking of Ripping my CD collection again. I'm researching a way to use a LLM to tidy up the metadata.

If you ever figure out how to use AI to determine the genre(s) of a song, let me know. Have been looking for something like that for quite a while.

[–] BuccaneerScientist@discuss.tchncs.de 3 points 11 months ago (1 children)

Nextcloud Recognize is supposed to do that, but I haven't tried it. You might try looking down that road.

[–] domi@lemmy.secnd.me 3 points 11 months ago

Thanks for the tip! I took a look and it seems like Recognize uses this: https://github.com/jordipons/musicnn

Last update was 4 years ago but will give it a try this weekend.

[–] andreas@lemmy.kfed.org 3 points 11 months ago

for the metadata, LLMs may not prove so great. Use MusicBrainz Picard or Beets

[–] miau@lemmy.sdf.org 2 points 11 months ago

Can you share some info on how you did that folder organization? Did you provide the AI with a list of files?

[–] WeLoveCastingSpellz@lemmy.dbzer0.com 14 points 11 months ago (2 children)

playing dnd alone is pretty cool

[–] RandomLegend@lemmy.dbzer0.com 7 points 11 months ago (4 children)

Any model recommendation for that?

The ones i tried get stuck in a loop at some point due to the small context windows.

[–] 1rre@discuss.tchncs.de 2 points 11 months ago (2 children)

Yeah even gpt4o couldn't keep track of encounters, run battles etc. in my case...

I think if you wanted to do it mechanically consistently you'd probably need to integrate it into a vtt where you give it context and potentially fine-tune it to give quest related summaries & gming rather than just "stuff"

[–] Bluesheep@lemmy.world 2 points 11 months ago

I don’t know how tech savvy you are, but I’m assuming since your on lemmy it’s pretty good :)

The way we’ve solved this sort of problem in the office is by using the LLM’s JSON response, and a prompt that essentially keeps a set of JSON objects alongside the actual chat response.

In the DND example, this would be a set character sheets that get returned every response but only changed when the narrative changes them. More expensive, and needing a larger context window, but reasonably effective.

[–] RandomLegend@lemmy.dbzer0.com 1 points 11 months ago

VTT integration would be one hell of a job to do.

load more comments (3 replies)

load more comments (1 replies)

[–] pe1uca@lemmy.pe1uca.dev 9 points 11 months ago (1 children)

I've used it to summarize long articles, news posts, or videos when the title/thumbnail looks interesting but I'm not sure if it's worth the 10+ minutes to read/watch.
There are other solutions, like a dedicated summarizer, but I've investigated into them and they only extract exact quotes from the original text, an LLM can also paraphrase making the summary a bit more informative IMO.
(For example, one article mentioned a quote from an expert talking about a company, the summarizer only extracted the quote and the flow of the summary made me believe the company said it, but the LLM properly stated the quote came from the expert)

This project https://github.com/goniszewski/grimoire has in it's road map a way to connect to an AI to summarize the bookmarks you make and generate at 3 tags.
I've seen the code, I don't remember what the exact status of the integration.

Also I have a few models dedicated for coding, so I've also asked a few pieces of code and configurations to just get started on a project, nothing too complicated.

[–] VeryNiiiice@sh.itjust.works 4 points 11 months ago (2 children)

Which one do you use to summerize videos?

[–] AnUnusualRelic@lemmy.world 4 points 11 months ago (1 children)

Does it work with porn videos?

[–] maniel@sopuli.xyz 1 points 11 months ago* (last edited 11 months ago)

asking the important question, but yeah, the plot is essential in porn

[–] pe1uca@lemmy.pe1uca.dev 1 points 11 months ago

Well, it's a bit of a pipeline, I use a custom project to have an API to be able to send files or urls to summarize videos.
With yt-dlp I can get the video and transcribe it with fast whisper (https://github.com/SYSTRAN/faster-whisper), then the transcription is sent to the LLM to actually make the summary.

I've been meaning to publish the code, but it's embedded in a personal project, so I need to take the time to isolate it '^_^

[–] thirdBreakfast@lemmy.world 5 points 11 months ago (1 children)

I use the Continue VS Code plugin with Ollama to use a couple of different models (deepseek-coder-v2 & starcoder2) to recreate a local only Github Copilot type experience for coding. This is on an M1 Apple Silicon though. For autocomplete the generation needs to be pretty brisk - I'm not sure how that would go in a VM without a GPU.

[–] Amongussussyballs100@sh.itjust.works 2 points 11 months ago (1 children)

How well does the M1 chip keep up? What size models are you running with it? Interested in getting an M1 laptop and I am curious.

load more comments (1 replies)

[–] Banthex@feddit.org 4 points 11 months ago

https://github.com/hendkai/paperless_sort_low_quality_ollama let ai tag your paperless ngx files base on content.

[–] bizarroland@fedia.io 4 points 11 months ago (2 children)

I have a 4070 sitting around collecting dust that I got from a trade, I've been thinking about setting it up with whispr and TTS and having a way to talk to my house.

I have a couple of smart home integrations, mostly air conditioning, light switches, security, and doors.

What I would like would be to have a few speakers on the walls that can talk to my server where I can say something like, hey computer, turn on the lights in the dining room and the lights in the dining room would turn on without transmitting that information to Google or Amazon.

[–] possiblylinux127@lemmy.zip 2 points 11 months ago

I am really curious if you can get the traditional smart functionality along with a LLM. Maybe have some sort of keyword the prompts the AI. You also could write a custom generated system prompt that includes the weather, time and any other information

[–] umami_wasbi@lemmy.ml 2 points 11 months ago (1 children)

You can try integrating with SEPIA. Not that I used it befote but it surely looks promising.

[–] bizarroland@fedia.io 1 points 11 months ago

Wonderful. I'll check it out. Thank you!

[–] hendrik@palaver.p3x.de 4 points 11 months ago* (last edited 11 months ago)

Roleplay (text adventures), a (stupid but occasionally funny) dungeon master, translation and help with creativity. These are the use cases I found. If you don't need that, you might get rid of it.

[–] andreas@lemmy.kfed.org 3 points 11 months ago

I use local AI for coding (more recently) and ML Photo storage facial recognition and security camera object detection (been using the later 2 for years now actually, don't want that kind of info out on someone else's cloud training on my images)

[–] just_another_person@lemmy.world 3 points 11 months ago

None

[–] slazer2au@lemmy.world 3 points 11 months ago (1 children)

Wanting answers to things you don't want google to know that you don't know.

[–] dwindling7373@feddit.it 3 points 11 months ago (2 children)

There are a huge number of vastly better solutions to get that...

[–] umami_wasbi@lemmy.ml 3 points 11 months ago (6 children)

IMO LLMs are ok to get a head start of searching. Like got a vague idea of something but don't know the exact keywords. LLMs can help and use the output on whatever search engine you like. This saves a lots of time tinkering the right keywords.

load more comments (6 replies)

[–] Aquila@sh.itjust.works 1 points 11 months ago (2 children)

Such as…?

[–] dwindling7373@feddit.it 7 points 11 months ago

A privacy respecting search engine.

[–] AustralianSimon@lemmy.world 1 points 11 months ago

Duckduckgo or SearX

[–] possiblylinux127@lemmy.zip 1 points 11 months ago

I use it for everything. I did move to ollama to podman and I replaced webUI with Alpaca

load more comments