26
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 01 Aug 2024
26 points (93.3% liked)
Free Open-Source Artificial Intelligence
2873 readers
27 users here now
Welcome to Free Open-Source Artificial Intelligence!
We are a community dedicated to forwarding the availability and access to:
Free Open Source Artificial Intelligence (F.O.S.A.I.)
More AI Communities
LLM Leaderboards
Developer Resources
GitHub Projects
FOSAI Time Capsule
- The Internet is Healing
- General Resources
- FOSAI Welcome Message
- FOSAI Crash Course
- FOSAI Nexus Resource Hub
- FOSAI LLM Guide
founded 1 year ago
MODERATORS
Haven't heard of all-in-one solutions, but once you have a recording, whisper.cpp can do the transcription:
The underlying Whisper models are MIT.
Then you can use any LLM inference engine, e.g. llama.cpp, and ask the model of your choice to summarise the transcript:
You can also write a small bash/python script to make the process a bit more automatic.
Okay, the idea is excellent, but I've just spent the last hour trying to get any app out there to record my calls.
I've tried the open source one on f droid, and it almost works. I can get it to record my side, but that's it.
I tried commercial ones. I tried commercial ones with horrendous privacy policies. Nothing seems to work.
I've used the accessibility options. I've gone deep down into the rabbit hole, so it looks like Android is fully cutting off the ability to record calls. In Australia at least.
What a shame.
These apps all seem to have the same ability of dropping the recording into a folder, so I could synchronize that across my network, have my server check for new files that appear into that folder, and then the LLM could convert that into a text file and send it straight back to me.
Living the dream! But... Not
I expected that recording would be the hard part.
I think some of the open-source ones should work if your phone is rooted?
I've heard that Google's phone app can record calls (though it says it aloud when starting the recording). Of course, it wouldn't work if Google thinks it shouldn't in your region.
By the way, Bluetooth headphones can have both speakers and a microphone. And Android can't tell a peripheral device what it should or shouldn't do with audio streams. Sounds like a fun DIY project if you're into it, or maybe somebody sells these already.
I was just thinking that, a raspberry pi working as a Bluetooth peripheral and some python code would work?