#meta 2025-07-22

2025-07-22 UTC
#
[tantek]
totally spaced on world emoji day 4 days ago 🤦
#
Loqi
I added a countdown scheduled for 2026-07-17 12:14pm UTC (#7207)
#
gregorlove.com
edited /Huffduffer (+366) "+Issues section with citation why no front-facing tags system"
(view diff)
#
gregorlove.com
edited /large_language_model (+125) "/* Other Criticism */ huffduffer/AI scraping info"
(view diff)
#
osteophage.neocities.org
edited /large_language_model (+1002) "added criticism about bandwidth and ignoring robots.txt"
(view diff)
#
[tantek]
Wondering if we should have a more general page for “why we can’t have nice things” and list examples like that
#
[tantek]
I’m sure we can think of plenty more
#
[tantek]
LLM crawler scraping has really screwed up a lot of default good faith assumptions about publishing public info on the web and I kinda hate it
#
[tantek]
Photos of other people come to mind. It was one thing for the surveillance apps to index all publicly viewable photos (that was bad enough) but…
#
[tantek]
It’s potentially much worse IMO when GenAI photo scrapers are used to create fake images of people without their consent, often in gross ways 😔
#
osteophage
Yeah it's a nightmare.
#
[tantek]
We may need a new definition of “open web” that’s not the “free for all random VC brainfarts to crawl and exploit web”
#
osteophage
Seems pretty inextricable from being open.
#
[mattl]
open doesn’t mean we have to let everyone access it.
#
[mattl]
if I have a party, you can’t come to my house and just take any of my things you like the look of.
#
[mattl]
open web just means the web anyone can have a page or site on without permission… vs. the facebooks/etc of the word
gRegor joined the channel
#
[tantek]
there are similar metaphors like retail stores are open to the public by default, but that doesn't grant anyone permission to come in and steal things
#
[tantek]
of course this is kind of being "tested" right now with all the influencers going into grocery stores and filming themselves making snarky points for their various channels
#
[tantek]
[mattl] I'd say the party at your house is more like a "friends-only" set of posts / permissions rather than open. Or perhaps I'm misunderstanding your "house" 😆
#
[tantek]
osteophage, "open" doesn't mean "free for all"
#
[tantek]
it could also mean "public space" like sidewalks, public parks etc. in most public parks you're also not allowed to take things like plants or animals. that doesn't make them "not open"
#
[tantek]
that said, there are public parks with streams and perhaps there's a good analogy here where you're allowed to drink from a stream (as a human, take your own risks on the water quality), however you're not allowed to siphon off all the water into your tanker and go sell it as "stream water".
#
[tantek]
does that make the stream not "open"? I don't think so, it's still open to individual humans to experience and use, it's just not open to exploitation
MrNice joined the channel
#
trwnh
"anyone can have a page/site without permission of facebook et al" perhaps some kind of independent web... an indie web if you will... hmm... 🤔
#
doesnm
trwnh: where is this quote?
#
trwnh
but otherwise you could maybe think about "commons" of a sort
#
trwnh
doesnm: it was a quote of [mattl] from a few hours ago
#
doesnm
i have chat logs locally
#
trwnh
anyway, the thing about scrapers is that i generally don't care to discriminate against them, as long as they aren't costing me anything. standard caveats apply (public is public, etc) so if you want to meaningfully change anything then you need some basis for applying policies to requests, and you need to make licenses discoverable in a standard way (insofar as anyone respects licenses, or otherwise if they disrespect your license you can
#
trwnh
apply a policy to reject their requests)
#
trwnh
i do think "put your info out there and expect it to never be misused" is something that fails any serious test
#
trwnh
the foundation of maintaining expectations for your resources is to never share your resources more widely than you intend to. i would wager that the vast majority of personal information is actually *not* something that should be made fully public. in most cases you want to have some idea of how far that info will spread and what people will do with it. "who wants to know?" is a common contextual modulator
#
trwnh
open access is useful for things that should be public, like scientific findings or datasets. but if you want access control then the open web is not very good at providing that... whereas something like facebook can easily provide that.
barnaby joined the channel
#
trwnh
the problem is that we don't have a really good foundation for identity that can be used to authenticate an http request, or a bearer token or capability management system that can be used to authorize an http request
#
trwnh
or otherwise to establish a session on a website facebook-style
gRegor, barnaby, claudinec, hedy, capjamesg and vikanezrimaya joined the channel
#
[tantek]
I think there are creative solutions worth exploring short of requiring identity. Wikipedia does some, pseudonymity is worth exploring etc
doesnm joined the channel
[schmarty] joined the channel
#
[tantek]
I wish there was a way for a site to ask (and get confirmation either cross site via oauth or via the browser) is this person logged in via GitHub, mastodon, indulging etc? Like not their identity, just a Boolean
#
[tantek]
That would be enough to enable say “read full post” permissions on something you don’t want scraped (and then you to rely on those IDPs banning bot accounts pretending to be human)
#
[tantek]
^ this is just one off the top of my head brainstorm for a solution that does not require a “really good foundation for identity”. I’m sure there are plenty of others
#
aaronpk
could probably make a proposal for that in the FedCM working group
#
aaronpk
you’d definitely need the user to confirm before sharing the boolean value with the site, so figuring out the right prompt to show the user would be tricky
#
doesnm
FedCM is still chrome only?
#
doesnm
on ~~outdated~~ Iceraven
#
[tantek]
aaronpk, indeed! lots to be figured out but I believe it has promise
#
[social]
In my Complexity / Social Lenses I have a lot of work on the public, private, and in between space (the transitional space, like a front porch) and the advantages and how to work through building it.
barnaby joined the channel
#
capjamesg
[social] Any relevant links?
gRegor joined the channel
#
[social]
I have pieces I’ve included around Social Comfort as well. I use this rather than trust to help understand boundaries and what is public, private, and the in between space.
#
[social]
I need to dig. I have some public talks / presentations, but much of it was workshops. A lot of it was helping large companies sort it out. I may have written up chunks of it. I have a long write-up I did around Google+ and its problems with boundaries (there really weren’t any that were clear and not porous.
#
[social]
I used a paper by Douglas Appleyard, “Livable Streets” as part of the foundation for the in between space / porch.
#
[social]
The downside of doing work for large companies (and doing it well) is it focusses on their distinct needs, technologies in place, but also how they deal with rick and compliance. The other piece is this isn’t public facing work, but often under NDA for long stretches.
#
[social]
To work around the NDA I often use things I have out there publicly as a base.
#
[social]
The loss of SlideShare functioning properly after it left LinkedIn means a lot of that stack of things I had public in presentations is now hard to find and use.
#
artlung.com
edited /Front_End_Study_Hall (+99) "/* To-Study List! */ svg"
(view diff)
gRegor joined the channel
#
[tantek]
SlideShare-- silos--
#
Loqi
silos has -2 karma in this channel over the last year (-3 in all channels)
#
Loqi
SlideShare has -1 karma over the last year
#
artlung.com
edited /The-Open-Graph-protocol (+264) "/* Criticism */ legal exposure"
(view diff)
#
[tantek]
[social] is that SlideShare regression in functionality documented in an article somewhere?
#
[tantek]
Please add it as a criticism to the page if you can find it
#
[tantek]
What is SlideShare
#
Loqi
SlideShare is a presentation hosting/sharing silo, owned by LinkedIn, which is now owned by Microsoft, that requires Javascript just to navigate to next/previous slides https://indieweb.org/SlideShare
#
gregorlove.com
edited /Front_End_Study_Hall (+359) "/* To-Study List! */ +fieldset"
(view diff)
#
[social]
I need to sort out how have my own version and feed into theirs easily. The new endeavor is https://jaunt.com
#
[social]
I know I haven’t written it up, but it is discussed a lot. Rashmi and Jon who started Slideshare are building a new version which looks good.
#
[social]
An overview of my Complexity Lens work is in this Jaunt https://jaunt.com/j/seeing-clearly-through-the-fog-of-complexity-1049
[Sophia_wood] joined the channel
#
[mattl]
Just seems like yet more AI-slop?
#
capjamesg
[social]++
#
Loqi
[social] has 1 karma in this channel over the last year (11 in all channels)
#
capjamesg
Thank you for sharing the slides.
#
gRegor
what is glitch
#
Loqi
Glitch is a service that shutting down project hosting and user profiles 2025-07-08. It allowed you to quickly prototype web applications in a complete IDE with built in version control, sharing, custom domain support, and more https://indieweb.org/Glitch
#
mat.tl
edited /Glitch (+113)
(view diff)
#
[mattl]
what is glitch
#
Loqi
Glitch was a service that allowed people to quickly create static and NodeJS applications in a browser or by importing a git repo. The shut down project hosting and user profiles on 2025-07-08. It allowed you to quickly prototype web applications in a complete IDE with built in version control, sharing, custom domain support, and more https://indieweb.org/Glitch
#
gRegor
doh, edit conflict
#
[mattl]
i blame fastly.
#
gregorlove.com
edited /Glitch (+575) "+How to section, alternatives, mv up Jenn's article"
(view diff)
#
[mattl]
gRegor++
#
Loqi
gRegor has 38 karma in this channel over the last year (109 in all channels)
marc_in_space, bugliker02, barnaby and bugliker027 joined the channel
#
gRegor
what is free domains
#
Loqi
It looks like we don't have a page for "free domains" yet. Would you like to create it? (Or just say "free domains is ____", a sentence describing the term)
#
gRegor
what is free domain names
#
Loqi
Free domain names are provided by certain domain name registrars without cost https://indieweb.org/free-domain-names
#
loqi.me
created /free_domains (+30) "prompted by gRegor and redirect added by gRegor"
(view diff)
bugliker02, bugliker027 and gRegor joined the channel