Opensource

2408 readers

94 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

Credits

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

⠀

founded 2 years ago

MODERATORS

pylapp@programming.dev

Google open-sourced its watermarking tool for AI-generated text (www.theverge.com)

submitted 5 months ago by BrikoX@lemmy.zip to c/opensource@programming.dev

3 comments fedilink hide all child comments

It’s available via Google’s Responsible Generative AI Toolkit.

Repo: https://github.com/google-deepmind/synthid-text

all 4 comments

sorted by: hot top controversial new old

[+] sam@feddit.org 5 points 5 months ago* (last edited 5 months ago) (2 children)

[deleted]

[–] sukhmel@programming.dev 5 points 5 months ago

It says 'as a large language model' in the beginning, and 'sincerely' in the end

[–] RonSijm@programming.dev 5 points 5 months ago (1 children)

It gives an example:

For example, with the phrase “My favorite tropical fruits are __.” The LLM might start completing the sentence with the tokens “mango,” “lychee,” “papaya,” or “durian,” and each token is given a probability score. When there’s a range of different tokens to choose from, SynthID can adjust the probability score of each predicted token, in cases where it won’t compromise the quality, accuracy and creativity of the output.

So I suppose with a larger text, if all lists of things are "LLM Sorted", it's an indicator.

That's probably not the only thing, if it can detect a bunch of these indicators, there's a higher likelihood it's LLM text

[–] wolfyvegan@slrpnk.net 1 points 3 days ago

Lychee can grow at tropical latitudes, but it needs hot (rainier) summers and (drier) winters w/ 50-150 hours at 0-12°C in order to fruit well, so it's more of a subtropical fruit.