#dev 2025-07-22

2025-07-22 UTC
#
gRegor
Seconded, Stripe is great
#
gRegor
I just emailed adactio about huffduffer tags feature seeming to be mostly gone, e.g. https://huffduffer.com/tags/indieweb is 404
#
gRegor
Interestingly tags on previous posts are still available in the JSON representation, so the data is still there. Maybe an intentional choice to remove tagging then. Will report back what I hear.
#
[social]
I know the answer to Huffduffer tags, they are off for the time being due to AI bots pounding Huffduffer. The tags provides and added vector with multiple pages. So this is why they are off.
#
gRegor
Oof, that stinks
#
[social]
Completely. This is why we can’t have nice things..
#
gRegor
Though I did a quick test and adding an input field `name="bookmark[tags]"` *does* add those tags to the post
#
[tantek]
that's worth documenting as negative on the /LLM page
#
gRegor
indeed
#
[social]
I stumbled onto the lack of tags in Huffduffer in the last month or so and asked.
#
gRegor
I meant to ask back in February when it came up, which feels like yesterday, hah
#
Loqi
[preview] [Jeremy Keith] @vanderwal @jgarber I removed them for performance reasons (the site was getting hammered by AI crawlers). It broke my heart.
grufwub joined the channel
#
gRegor
Added on /huffduffer and linked from /LLM criticism section
#
[morganm]
BOOOOO!
#
[mattl]
This stuff sucks but it also helps in that the more people can see their favorite things rejecting AI-slop and blocking things the better
#
[mattl]
It’s why I’ve stopped using Bluesky
#
gRegor
It's still easy to scrape by other means like https://huffduffer.com/new, so it really seems to just be about the performance
#
gRegor
uh oh, what's Bluesky doing?
#
gRegor
(I mean, I'm sure adactio is anti AI slop)
#
osteophage
"The mechanism and expectations are similar to robots.txt files on the web."
#
osteophage
Unfortunately robots.txt files are already getting ignored, so I don't expect this to do much either way.
#
IWDiscord
<o​steophage>
#
[mattl]
Bluesky won’t stop showing me AI-slop terms and they’re giving out blue checks seemingly at random…
#
[mattl]
Also Bluesky tends to be more news focused and I’m doing my best to limit myself to reading/interacting with news
#
[snarfed]
osteophage it's not a guarantee, but the big model co's are already all obeying the anti-AI headers now, just like most crawlers generally obey robots.txt
#
[snarfed]
tools for expressing and detecting intents/preferences are valuable, and can work at scale, even if they're not perfectly enforceable
#
osteophage
Just added some links to the wiki about this a moment ago.
#
[mattl]
I’m still plugging away at my thing… which uses a mixture of IP addresses and user agents to outright block things
#
[mattl]
It does require you use HAproxy, but you’ll also just be able to pay me for hosting if you’d rather.
vidak joined the channel
#
[tantek]
I’m going to need to modify my “if it’s not curlable then it’s not on the web” 😔
#
[tantek]
Or maybe we need a new definition for a “human accessed web”
gRegor and vidak joined the channel
#
[tantek]
Is Bluesky selling a firehouse to LLM indexers?
#
[tantek]
firehose* of posts
#
[tantek]
I think a silo selling a firehose of posts to LLM indexers (without your consent or ability to opt out) may be enough reason to ditch the silo (and not even POSSE to it, or maybe only POSSE an article title and link)
#
[snarfed]
the firehose and historical data is all public, no selling involved
#
[tantek]
[snarfed] "public" can mean many things, e.g. cc-nc, and thus its reasonable to ask if a company's "public" resource is being sold
#
[tantek]
or institutions which offer their data "for free" to researchers and individual members of the public, but corporations have to pay
MrNice joined the channel
#
doesnm
what is huffduffer
#
Loqi
Huffduffer, built by Jeremy Keith is a tool for creating podcasts out of audio files from around the web https://indieweb.org/Huffduffer
vidak, Maxpm, cow_2001, barnaby, gRegor, claudinec, rob32, zoglesby, hedy, capjamesg, eb, okCiel, suki, vikanezrimaya, roxwize, srushe, nnrx, ancarda, athenaeryma, doesnm, ttybitnik and [schmarty] joined the channel
#
[snarfed]
sure. didn't mean to imply that they were mutually exclusive. Bluesky doesn't sell data, they do work hard to make it all easily accessible for free, and they're adding the ability to opt out of AI indexing
#
[snarfed]
(linked here earlier)
gRegor, ttybitnik and srazkvt joined the channel
#
[tantek]
So much distributed burden for everyone to have to opt out of LLM indexing everywhere
#
gRegor
Definitely feels like one of the darker moments on the web :/
#
[snarfed]
[tantek]++ agreed 😢
#
Loqi
[tantek] has 33 karma in this channel over the last year (154 in all channels)
#
[snarfed]
Cloudflare's trying to flip it to opt in
#
perryflynn
as long as the politicians are all on the "AI first" hype we can just give up. you can't do anything about it. that is the advantage and also the biggest disadvantage of the internet.
#
perryflynn
and if the first laws will come (if ever) then the scaper will just move to a country where no laws are established.
[Sophia_wood] joined the channel
#
[Sophia_wood]
The Silo ate my website 😞 I need to host a map and my haiku site after Glitch has shut down and my brain hurts just thinking about moving everything right now. I am frustrated I didn't see the email and wish there had been more of a runway.
#
[Sophia_wood]
^^ It is totally on me - I used Glitch to learn how to do servers (make an API, Map server, etc. )
#
[artlung]
Ouch [Sophia_wood]... what remnants do you have?
#
[Sophia_wood]
okk - I am able to download my projects!
#
[Sophia_wood]
Now I need to figure out how to host.
#
[Sophia_wood]
just figured out how to get them - there was a glitch when I went to go get them.
#
[Sophia_wood]
I don't feel like I have the bandwidth to self-serve with security and not understanding risk.
#
[Sophia_wood]
thoughts?
#
[artlung]
I'm not sure I understand the scope of the question. What's the site in question do and how is it supposed to work?
#
[Sophia_wood]
I have 3 projects I'd like to save:
#
[Sophia_wood]
• Porthales - openmap server with all the stormhole drains for my city with added portaldestinations
#
[Sophia_wood]
• soHaiku - an rss feed that self populates every hour with a new oulipo haiku
#
[Sophia_wood]
I need to dig in and remember what I did, but I think they are mostly node.
#
[Sophia_wood]
• webkitty - an api to check on the webkitty status that changes every 30-120min.
#
[social]
Does your host support Node apps?
#
[Sophia_wood]
I was hosting on Glitch - it just died
#
[Sophia_wood]
correction - go killed
#
[Sophia_wood]
so I'd like to avoid Fastly
#
[Sophia_wood]
So I think I need to find a host - (or become a host) for node apps
#
[social]
I didn’t use Glitch, but I have followed many Glitch apps and I’ve been trying to see how they have been approaching moving their apps. But, I haven’t seen them resurface. Many were planning on moving things to their own light Node servers (like Digital Ocean), but those take a bit of work beyond building good apps.
#
[artlung]
I would be looking at whether netlify could satisfy the requirements.
#
[social]
My host offers many server options, including a virtual host for Node.
#
[Sophia_wood]
I want to make sure I can't get hit with any surprise bills - a place that caps rather than charges.
#
[Sophia_wood]
I use netlify for a lot of other sites - just not node yet - I'll check on it.
#
[social]
When you have demoed your site I think the stormhole drains may be the only one that needs some backend (depending on how you have it built), but the others seemed like it could be done serverless.
#
[schmarty]
[Sophia_wood] - Jenn Schiffer (former head of community at glitch) had a lot of projects to rescue and re-home, as well. she tried out a bunch of things and made a list of recommendations on her site: https://livelaugh.blog/posts/glitch-alternatives/
#
[schmarty]
she put a focus on services with free tiers that let you start without putting in a credit card.
#
[Sophia_wood]
omg - "if you manually download via the dashboard, _it will not include assets_, but his script does."
#
[Sophia_wood]
I am so grateful for this post
#
[schmarty]
glitch++ fastly--
#
Loqi
fastly has -1 karma over the last year
#
Loqi
glitch has 1 karma over the last year
#
[Sophia_wood]
fastly-- for sure I will miss glitch
#
Loqi
fastly has -2 karma over the last year
#
[Sophia_wood]
I will also miss their team - always so helpful.
#
[Sophia_wood]
Thanks [schmarty]++
#
Loqi
[schmarty] has 14 karma in this channel over the last year (45 in all channels)
#
[Sophia_wood]
and [artlung]++ and [social]++ I think I will look at digital ocean and myhost for the node apps.
#
[social]
I sent a note with my hosting company, I’ve used Digital Ocean for a long time, but I’ve stopped running the apps I host there. They are good, but it does often require some hosting experience.
#
[social]
[schmarty]++ that link is one I had seen mentioned but not tracked down. It is full of helpful information for moving beyond Glitch.
#
Loqi
[schmarty] has 15 karma in this channel over the last year (46 in all channels)
#
[mattl]
I would be wary of Digital Ocean purely because they have made it so trivial for anyone to sign up and launch a bot that attacks other sites.
#
[Sophia_wood]
Oh wow- good to know. I am looking for sites that feel good on who the serve and support.
#
[Sophia_wood]
I missed the may post - I was on vacation, and then heads down trying to figure out work in June- so I’m going for a run, then digging in to options. I love this community. Thank you
#
[mattl]
I would look at Jenn Schiffer’s recommendations for hosts after Glitch
#
gRegor
"The Silo Ate My Website" would be a good podcast name
#
[mattl]
They probably all have some AI-bullshit you have to contend with
#
doesnm
gRegor: technically all sites are powered by silos
#
[social]
One of the things I love about my host is the lack of AI.
#
[mattl]
doesnm: how so?
#
doesnm
ISP-s
#
gRegor
backs away from discussion XD
#
gRegor
meant mostly jokingly
#
[mattl]
you can rent or buy a server of your own, install whatever you like… how is that a silo?
#
gRegor
Just not interested in the topic of *how* independent something is
#
doesnm
[mattl]: are it's really your server?
#
[social]
gRegor is going independent of the independence discussion
#
gRegor
[galaxy brain meme]
#
[mattl]
doesnm: if I buy a server, yes it’s mine. if I rent a server, it’s not my property but it is effectively mine. I’d say the same is true of a virtual server.
#
gRegor
I moved up this section, feel free to expand with quotes and additional resources: https://indieweb.org/Glitch#Alternatives_to_Glitch
#
[mattl]
If you can choose the operating system, I’d say that’s a valid level of “it’s yours”
#
[mattl]
gRegor++
#
Loqi
gRegor has 39 karma in this channel over the last year (110 in all channels)
#
doesnm
[mattl]: choice from limited list or bring out own iso?
jak2k joined the channel
#
[mattl]
either way.
#
[artlung]
Added [mattl]’s contribution to the https://lab.artlung.com/comics-a11y/ to the actual page. I'm on the lookout for more examples too. Bestiary of comics markup eventually.
#
[mattl]
a choice of a bunch of Linux distros and maybe some BSD or possibly Windows… that’s about the limit of choices you really have on amd64 hardware.
#
gRegor
[artlung], possibly for that page (it's adjacent at least): just realized that /Monocle is pulling out the alt text and displaying it under the image. I thought the xkcd feed was actually including that in the description text, but nope, it's just the img+alt
#
doesnm
[mattl] what if i want alpine or pfsense
#
[schmarty]
what operating systems a hosting provider supports and whether that host is a silo are not really related.
#
gRegor
It's handy so I don't have to hover on the image to read it. Also good for mobile.
#
[artlung]
https://xkcd.fyi was mentioned during FrESH. It specifically creates transcripts visible to the user in a way alt + title are used in xkcd
#
gRegor
ooh, nice! I like that font as well. Very appropriate
#
carrvo
Any Apache gurus can help me understand when it chooses its status codes? I used to have it where under a webspace path it would return 401 at first (not logged in), then a 403 if their login still did not have permission. Other paths were 403 by default. I'm not sure what changed in my configuration but now it only returns a 403...I suspect it may have to do with a change whereby a page configured for anonymous use will return 20
#
[mattl]
doesnm: you can install those on your own server and if they’re not listed you can ask your host if they’ll install it for you.
#
[mattl]
carrvo: do you want to share your config in a gist or pastebin?
marc_in_space joined the channel
#
[mattl]
doesnm: otherwise you can argue that the whole internet is a silo which I don’t believe it is.
#
[artlung]
[gRegorLove] thanks. added some points about xkcd https://lab.artlung.com/comics-a11y/
#
Loqi
[preview] Comics Accessibility [DRAFT IN PROGRESS]
#
carrvo
Give me a bit and I will see how to share.
#
[snarfed]
timeless and relevant, re "how independent": https://snarfed.org/2022-12-03_48201
#
Loqi
[preview] [Ryan Barrett] I thought using loops was cheating, so I programmed my own using samples. I then thought using samples was cheating, so I recorded real drums. I then thought that programming it was cheating, so I learned to play drums for real. I then thought using ...
#
[mattl]
there’s an element of purism with some of this stuff that people like to get into too
#
[snarfed]
absolutely
#
[mattl]
ie. my website is better because we only use 100% free software, hosted on risc-v hardware using solar power.
#
[mattl]
and then you realize that to connect to your website you need to go through 13 Cisco routers and the wifi hotspot at Starbucks.
#
[snarfed]
"yeah I wrote my web server and OS from scratch, and I built my own server and racked it in a colo facility I built myself, on land I own, powered by my own generator, connected by a boutique ISP I run myself...but I'm not yet making my own chips out of FPGAs. I'm worried about that dependency. not to mention the gas I put in my generator, I have to buy that, that's a problem too"
#
[mattl]
i notice you’re using a hard disk you purchased from Toshiba for that server too. Can’t believe you’re not standing there manipulating all requests with a handful of magnets and some paper tape.
#
[snarfed]
and a steady hand
#
capjamesg
[snarfed]++
#
Loqi
[snarfed] has 41 karma in this channel over the last year (77 in all channels)
#
capjamesg
Great quote in your blog post.
#
[mattl]
I’ve been that guy too. Unlike most people here, I was in a cult that focused very heavily on such purity. As I started to get disillusioned with it, I began to quickly see how ridiculous it is. Well, even the person in charge of the cult uses a VPS host for their website and email, not to mention their DNS and domain name registration.
#
[snarfed]
self righteousness is way easier than doing the real work, building accessibility etc
#
carrvo
Something like that anyway.
#
carrvo
I have noticed in the logs it has a difference between `denied` and `denied (not authenticated)` and I vaguely think it may be related to `GroupExternalAuthNCheck Off` bypassing the authn when it's authz checks for anonymous access.
vidak and bugliker02 joined the channel
#
[mattl]
new blog post! [artlung] helped me think about this again recently. https://mat.tl/blog/2025/07/22/the-freedom-to-fuck-off/
#
Loqi
[preview] [Dr. Matt Lee] I first heard this described in the most unusual of places: a movie. In 24 Hour Party People, Steve Coogan portrays Tony Wilson who co-founded Factory Records and would later sign bands like The Durutti Column, Joy Division (later to become New Orde...
#
Loqi
[mattl] has 34 karma in this channel over the last year (78 in all channels)
#
[artlung]
[mattl]++
#
[KevinMarks]
I do like the bit in 24 hour party people where the big label wants to buy them and Tony explains they don't own anything
#
[mattl]
yeah, London Records.
#
[mattl]
shame Keith Allen is that movie tbh
#
[KevinMarks]
Also, I know two 2010s era web people who are now goat farmers, which is surprising as there are two
#
[mattl]
Jeff Minter is a Llama farmer, I think
#
[KevinMarks]
That just feels like a natural progression.
#
[mattl]
HTML/Goats -> 6502 assembler/Llamas
#
[KevinMarks]
I was thinking of Eran Hammer and Derek Powazek
#
[social]
[mattl] Great piece. When I work with companies about product / tool selection one of the key things I push them to keep track of is “how do I get out of this and move to something else”. Products with lock-in most often are the ones you don’t want to enter into as they haven’t thought of the basics and they have lost too many customers.
#
[snarfed]
[mattl]++ love it
#
Loqi
[mattl] has 35 karma in this channel over the last year (79 in all channels)
#
[snarfed]
mirrors the two key IndieWeb ideas: 1) use your own domain, 2) own your data, be able to migrate/export it
#
[mattl]
I need to do a deeper look at some of the free domain name things. See what if any are actually good in 2025
barnaby joined the channel
#
[snarfed]
subdomains count as a starting point
#
gRegor
[mattl]++ great post! You might want p-name on just the title, so indienews sees it as an article
#
Loqi
[mattl] has 36 karma in this channel over the last year (80 in all channels)
bugliker027 and bugliker02 joined the channel
#
[mattl]
I’ll take a look later… gotta go leave the house and walk around for the rest of the day
#
[mattl]
[snarfed] I mean stuff like the .tk names that were gratis at one point
bugliker02, rozodru and bugliker027 joined the channel
#
epoch
an example of the free subdomains thing https://freedns.afraid.org/
#
epoch
lot more DNS control with those than you got with .tk domains afair
#
epoch
ofc the person that actually owns the domain can pull the rug without notice
#
epoch
so, not really that good for trying to do something that is gonna stick around forever
#
epoch
but could be used to practice before commiting to spending $/yr for your own domain
#
[mattl]
Yeah omg.lol has the same thing but you pay $20 a year for all the services but also they have a pool where people who need help can get it for $0
#
[mattl]
I’m going to keep a $0 tier for http://Libre.fm but I want to work on some way to take money that’s affordable for people regardless of their location. I’ve yet to see a solid way to do that.
#
[mattl]
Ie. Charge US $10, UK 8GBP or whatever, but find a way to make it equivalent of $5 USD for other countries
gRegor, sarna and bret joined the channel
#
carrvo
Any ideas on my Apache config?
#
[Sophia_wood]
after going for a run I may have figured out how to do an all client side approach for two of the pages and run a python script on my computer that pushes the changes to the rss feed and API.
#
[Sophia_wood]
I'm still working out my math approach.
#
[Sophia_wood]
When a Silo eats your website - go for a run.
#
[Sophia_wood]
*map approach
bugliker02 joined the channel