Anybody know anything about the following User Agent strings?
ReplicantReaderBot
: “Replicant” isn’t an entirely unique brand name. I hope this is unrelated to the Replicant LLM chatbots. If it is, is it used to train or is it just a client of the chatbots?ArenaBot/1.0 (+<https://arena.im/bot/;> contact@arena.im)
(page is a 404; is this used to train LLMs or does an LLM use this as a client to fetch data?)SocialBeeAgent
: again, used to train LLMs or a client of an LLM?Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_3_3 like Mac OS X; en-us) AppleWebKit/533.17.9 (KHTML, like Gecko) Version/5.0.2 Mobile/8J2 Safari/6533.18.5 Tencent/BrandProtection
. Does this obey robots.txt or am I gonna have to add another Nginx rule? I normally block brand-protection bots.
hellhound gayming
in reply to Seirdy • • •...could you give me a list of them, i haven't really thought about them that much but i don't want them on my site either
Frost, Wolffucker 🐺
in reply to hellhound gayming • • •Seirdy
in reply to Frost, Wolffucker 🐺 • • •process and set of actions that a right holder undertakes to prevent third parties from using its intellectual property without permission, as this may cause loss of revenue and, usually more importantly, destroys brand equity, reputation and trust
Contributors to Wikimedia projects (Wikimedia Foundation, Inc.)Rocky 🏳️⚧️
in reply to Seirdy • • •Seirdy
in reply to Rocky 🏳️⚧️ • • •