this post was submitted on 07 Jul 2025
348 points (100.0% liked)

Technology

72499 readers
3454 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 
top 18 comments
sorted by: hot top controversial new old
[–] [email protected] 2 points 9 hours ago* (last edited 9 hours ago) (1 children)

Seems like it screens out bots regardless of whether they use AI or are just traditional asshole-created bots.

[–] [email protected] 3 points 9 hours ago

It does, yes. This will prevent your content from being indexed, in most cases. The benefit is that your servers will be up. If your servers are down, they can't be indexed either

[–] [email protected] 12 points 17 hours ago (2 children)

Why does default config check Mozilla specifically?

{
"name": "generic-browser",
"user_agent_regex": "Mozilla",
"action": "CHALLENGE"
}

Guess that's why I've seen Anubis check screen quite a few times.

[–] [email protected] 3 points 10 hours ago* (last edited 10 hours ago) (2 children)

The creator of Anubis did an interview on the Selfhosted Show podcast a little while back and explains this in detail, and it’s worth a listen.

Here’s a time stamped link for the interview

[–] [email protected] 3 points 9 hours ago* (last edited 7 hours ago)

Great interview! The whole proof-of-work approach is fascinating, and reminds me of a very old email concept he mentions in passing, where an email server would only accept a msg if the sender agreed to pay like a dollar. Then the user would accept the msg, which would refund the dollar. So this would end up costing legitimate senders nothing but would require spammers to front way too much money to make email spamming affordable. In his version the sender must do a processor-intensive computation, which is fine at the volume legitimate senders use but prohibitive for spammers.

[–] [email protected] 2 points 9 hours ago
[–] [email protected] 19 points 16 hours ago (1 children)

Afaik, almost every browser uses "Mozilla/5.0" as part of the user agent, Mozilla mentions it as well in developer docs about User agents, it's a historical compatibility thing apparently.

[–] [email protected] 5 points 16 hours ago (1 children)

Interesting, thanks!

Guess it's the same kinda thing as amd64 on Intel lol

[–] [email protected] 10 points 15 hours ago* (last edited 15 hours ago) (2 children)

it's even stupider, it's more like why there is no windows 9 because of programs doing stuff like if os.name.startswith("windows 9") then print("this program is not compatible with windows 98") end

[–] [email protected] 4 points 14 hours ago (1 children)

Yup. There was a time when Mozilla was somewhat dominant, so browsers unlocked features based on the browser being Mozilla (as opposed to Internet Explorer).

[–] [email protected] 4 points 14 hours ago* (last edited 14 hours ago)

well if you want to get into it, i think the last browser that didn't have mozilla in the useragent was internet explorer, which had "trident/9.0" or something. every other browser on the market is based on the old KDE browser Konqueror, which had "khtml, like gecko" in it. when that didn't work they just added "mozilla" to it. then apple took that codebase and added "safari", chrome took that codebase and added "chrome", etc etc etc. compatibility problems just kept compounding on every browser based on khtml until we got to the point where microsoft edge's current user agent is Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/134.0.0.0 Safari/537.36 Edg/134.0.0.

even firefox has had to give in to this: my useragent is Mozilla/5.0 (X11; Linux x86_64; rv:140.0) Gecko/20100101 Firefox/140.0 even though the version of gecko in firefox 140 is v125, from 2022.

[–] [email protected] 1 points 15 hours ago
[–] [email protected] 4 points 16 hours ago* (last edited 16 hours ago)

My hero in shining armour (not a sarcasm, just a form of appreciation of someone who did what I would never have done)

[–] [email protected] 10 points 23 hours ago (1 children)

I use the lynx browser sometimes, for hacker news, some blogs that I follow, or just for a quick browse to find an answer.

The fact that more and more websites need to use this kind of protection is saddening me, since lynx doesn't support JavaScript.

That's just another reason why I fucking hate AI.

I don't hate it, I /fucking/ hate AI.

[–] [email protected] 4 points 23 hours ago

You can bypass it by changing the user agent to not include Mozilla in the beginning.

[–] [email protected] 110 points 1 day ago

Cool project. The closing slide was pretty funny

If you are working at an AI company, here's how you can sabotage Anubis development as easily and quickly as possible. So first is quit your job, second is work for Square Enix, and third is make absolute banger stuff for Final Fantasy XIV. That’s how you can sabotage this the best.

[–] [email protected] 2 points 1 day ago

Interesting. I clicked on a link here a couple weeks ago and was presented with this and wasn't really sure what it was. Thanks for sharing this! It seems like a good alternative.

[–] [email protected] 9 points 1 day ago

Is this the first seed of the Blackwall?