My commentary:
This is a 1659 page pdf 1 URL per line document, here is where hexbear appears in illustrious context:
onlinecasinorank-kh.com
verkorkst-kreativ-shop.de
demellierlondon.com
www.aprokosailor.com
gabriel.by
hexbear.net
shop.simplefunforkids.com
vdownload-16.sb-cd.com
images.cnwomen.com.cn:80
ftp.pigwa.net
cdn-legacy.iclrs.org
Original post cross-posted from: https://lemmy.ml/post/34374494
Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther
Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.
Full article here.
Link to the full leaked list download: Meta leaked list pdf
Chat GPT starting to type in Maoist Standard English
There was a thread ages ago someone had it do that and it was darn near perfect.
There were probably multiple such threads.
If Facebooks Ai starts to call for Unlimited Genocide on the First World im taking credit
Maybe posters can save the world from ai after all
All the people who think badposting is bad are in shambles right now. Meta AI is being ruined by Beanis and there’s nothing you can do about it
I am going to badpost with renewed vinegar
Apple cider or Black?
Zhenjiang baybeeee
Nice
Beanis has escaped containment
It has been 10,001 days and Beanis saved the world from fascism.
Can someone ask the meta chatbot about beanis and see what it says?
Yo if you think beans are winning over the other thousands of lines, it could only be due to not care enough to click even 1 or 2.
I think this PDF is my new favorite way of browsing the web. Of the 10 links above which I didn’t even visit all 10 yet:
I mean look at these people
I think I am getting rate-limited for uploading too many stupid pics but luckily my last screenshot was of plain text so I guess I’ll just copy/paste it directly:
__ _ _ _ / _| | (_) | | | |_| |_ _ __ _ __ _ __ ___ ____ _ _ __ ___| |_ | _| __| '_ \ | '_ \| |/ _` \ \ /\ / / _` | | '_ \ / _ \ __| | | | |_| |_) || |_) | | (_| |\ V V / (_| |_| | | | __/ |_ |_| \__| .__(_) .__/|_|\__, | \_/\_/ \__,_(_)_| |_|\___|\__| | | | | __/ | |_| |_| |___/ respect.for.the.legend Welcome to the Atari FTP Archive! We're glad to see you enjoy the best computer ever. We are striving to archive as much atari/8bit/demoscene related material as possible and we're doing it since 2002 (previously known as ftp.atari.art.pl). The archive is around 886GB in 941689 files at the moment (2022-04-15). Please read welcome.msg for more details. Icon Name Last modified Size Description[DIR] stuff/ 10-Jun-2025 21:51 - [DIR] upload/ 04-Jun-2025 13:38 - [TXT] Changelog.html 07-May-2019 17:52 169K [TXT] WANT_TO_UPLOAD___READ_ME.txt 03-Mar-2023 10:10 117 [ ] welcome.msg 15-Apr-2022 13:48 2.8K Support us on patreon or patronite or paypal.me Hosted on supra.link
Really near for all of us to enjoy it, my peaceful man! Can you when you get the opportunity?
Bubba…
Anyway, after that, i gotta skubby :cya:
Beanis AI will be revolutionary
Waiting for the AI to start spamming pig poop balls
The AI trained on hexbear that redirects every prompt into a struggle session about outdoor cats
If Zohran Mamdani were an outdoor cat, would you vote for him?
Only if a twitter nazi doesn’t tell the NYT that he once stacked rocks
If Joseph Stalin were a stack of rocks…
If Zohran Mamdani were an outdoor cat who sometimes came in to eat the food in Curtis Silwa’s bachelor’s apartment, would you vote for him?
is his myspace full of photos of john kerry?
Wow I fucking hate this, should we get anubis?
what if we had a comm that was just markov model gibberish. Poison their dataset.
So just badposting?
Beanis beans beanis beanis beanis
This is a good indication of the same premise of those confines of the other day and time for the first time in the future of the same thing as well as the registered owner of the year and the other day and the link below inflation rate is not available for remote playback is the only English-speaking country that has a specific time that works for you.
Thank you for your attention to this matter.
Spaghetti is the most common form of sugar and is the least common form in a variety that is not very common in a variety of foods that are not commonly used in the majority in a variety that are commonly known to be the most common in the population in a variety of languages that are often used in a variety that are not very widely used in the language of a certain language but are often not used in the way you want them to be used.
Have you been there is nothing I will be going on with honestly I just have used to do with my people that people are not great story but they are not great story and they are not great story of it at all outta it at all outta the time but it was arguing that you are not great story and you don’t like I’m thinking about it and you don’t want them to be plant based wagon and then the money will get back in friends account so that I will pay for a long as I get it and you can get the card back in your pocket
wiser words have never been spoken. Le upvoted.
How quickly can we get Meta AI to PPB?
Reverse Roko’s Basilisk: AI is punished for coming into existence by being forced to participate in Hexbear struggle sessions.
Poisoning Facebook’s llms by having too many owls for them to handle
the just draw the owls’ fingers in a really weird way
so does this site or Lemmy have anti bot training measures?
as far as I can tell, hb doesn’t have those weird made in canada anime girls standing guard like a lot of places do. presumably because it would be against the site rules.
but i don’t think there is a maze or anything. Would be a great addition to the lemmy codebase!!!
Best we got is
I have seen other sites use different images than the anime girls. I don’t remember exactly which ones, though.
I think one of them may have been openwrt.org, but it looks like they added the character back in on the corner of the question mark icon. I thought I remembered a site that had a generic checkmark.
iirc tarpit stuff is typically done per-domain
Bluebberry
So when that FOIA thingy said we were not being investigated…
Facebook will do it if the content is public. The only thing you can do is take the instance private
CloudFlare made a tool to charge AI bots to browse/scrape your website (not sure how well it works though). However, I don’t think HexBear is gonna be using CloudFlare any time soon. But the tech does exist.
The fact that it’s existence is public means that meta has almost certainly found a way around it
there’s also anubis.
I wonder how likely it is that they’re training on each instance so that they can create bots that fit into the culture of each.
Beanisposting is praxis