this post was submitted on 20 Mar 2025
143 points (100.0% liked)

Fediverse

31941 readers
698 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 2 years ago
MODERATORS
 

I made a Lemmy instance with a custom algorithm that keeps only the top 20% most unique (=interesting?) posts. It does this by calculating a similarity score between every post on my instance and all posts that came before it. The top 80% of posts with the highest self-similarity get removed instantly.

The idea would be that this allows me to cut through the noise that's running through the communities, similar to how xkcd-signal attempted to do 20 years ago.

The instance is mostly meant for reading, not posting. So it has a very open federation policy (for now).

If anything, this is experimental. So please let me know what you think! You can see the type of stuff that gets removed in the modlog (https://lemmy.coffee/modlog).

you are viewing a single comment's thread
view the rest of the comments
[–] ptz@dubvee.org 17 points 2 days ago* (last edited 2 days ago) (1 children)

Interesting. One of my instance's guiding philosophies is "Quality over Quantity". I've taken different steps toward achieving that (defederate from the Reddit repost instances, disallow pretty much all content bots, manually/locally mod duplicate posts, etc).

Do you plan to publish your algorithm/filter? Would be interested in seeing if it could be tuned and possibly reduce some of the workload for me.

[–] coffeeadmin@lemmy.coffee 10 points 2 days ago (1 children)

Do you plan to publish your algorithm/filter?

In an ideal world sure. But I'd have to think about that some more, because in principle I don't want people to game it :)

[–] gkaklas@lemmy.zip 11 points 2 days ago (1 children)

Lemmy's license is AGPL, so you would need to at least publish changes to Lemmy itself 😉

(I don't know if e.g. the code for the algorithm is separate, in order to have a closed source algorithm with an open source Lemmy fork)

[–] HK65@sopuli.xyz 2 points 2 days ago (1 children)

Does GPL/AGPL require you to publish the code even if you are not selling the software? As in I could run a library computer with my custom Linux distro without giving anyone the source, but I wouldn't be able to publish it or sell it only as binary blobs, right?

[–] QuazarOmega@lemy.lol 7 points 2 days ago (1 children)

Selling is outside the scope of the licence, you can do whatever you want with monetisation, be it free or paid-for.
But any one person that uses your GPL if local, AGPL if local or through a remote service, has the right to request you a copy of the code and you have an obligation to comply and provide it

[–] HK65@sopuli.xyz 2 points 2 days ago