#indieweb 2025-01-07

2025-01-07 UTC
qbasiq, [schmarty], jgee118692253458, [Murray], [Joe_Crawford], jeremy, klymilark, grufwub, Tiny17, Tiny, TinyCaptain, cedric, Ed1, cdravcte2 and cdravcte joined the channel
#
edmael
Goood morning everyone (or everning or afternoon or night depending where you are :D).
#
edmael
Trying the chat from Discord to see if it gets a bit better than using it via the webclient.
#
edmael
Is there a way to get my nickname "verified" in the webchat as well?
#
edmael
I struggled to find some proper documenation about it, but maybe it's just me 😄
#
capjamesg[d]
What is chatnames?
#
Loqi
The chat names page has a list of chat regulars sorted by nickname with their website and usual timezone(s), and is used for icons next to nicknames in IndieWeb chat logs https://indieweb.org/chatnames
#
capjamesg[d]
I think you need to add your name to that list ^^
#
capjamesg[d]
I'm not 100% sure though.
#
edmael
I've done that yesterday, but maybe it takes some time (or I need to also create a user page on the wiki)
#
edmael
(Or maybe I used my name and not my nickname, could be that as well)
cdravcte joined the channel
#
edmael
It works now, yay!
#
Loqi
😃
cedric, nemonical, longlongdouble, [qubyte], ren, ttybitnik, oakridge and rvalue joined the channel
#
vinceaggrippino
👋 Hey everyone, I'm Vince.
#
vinceaggrippino
I'm American, from New York, but I live in Malaysia.
#
vinceaggrippino
I used to do some web development work, but it's been quite a while. There aren't any employment opportunities for me, but I still like talking about programming. On Mastodon, I saw someone mention an upcoming Homebrew Website Club meetup and I thought I'd check it out.
#
vinceaggrippino
</introduction>
#
capjamesg[d]
Do you have a personal website?
#
vinceaggrippino
capjamesg[d]: Thank you! I have a GitHub Pages site, but it's nothing impressive. Honestly, I've been neglecting it. But I've been thinking about putting some effort into making it a decent site. The current iteration is built with 11ty: https://vaggrippino.github.io
#
edmael
Hey Vince, welcome! I'm pretty new as well, but if I can help you with something feel free to ask and we'll get there together 😄
#
vinceaggrippino
edmael[d]: Thanks, Ed. Right now I'm trying to understand / figure out how to code the RSVP thing for the upcoming Homebrew Website Club. It's optional, but it looks interesting. This is probably a question for the -dev channel, but I'm gonna do a little more reading before I ask 😁
#
edmael
I guess you've already checked here, right: https://events.indieweb.org/
klymilark joined the channel
#
vinceaggrippino
edmael[d]: Ya, that's how I got here. The links for the RSVP / webmentions thing are kinda circular. The instructions are mostly targeted at people who are using packaged CMSs like WordPress or Ghost, but my site is (mostly) hand-coded. I saw something about "h-entry" and "p-rsvp", but the examples don't show much more than CSS class names. I think I saw Microformats mentioned. I just need to stop skimming and read it properl
trwnh joined the channel
#
edmael
Ahahaha yeah, I get it! Start from here: https://indiewebify.me/ and when you get to webmentions try these: https://webmention.io https://brid.gy (at least this is what I did the past few days and now I have the webmentions somehow working :D).
#
Loqi
edmael: lol
#
edmael
Bad informations? That was like the crash course 😄 The other starting point I had was this: https://www.joelotter.com/posts/2023/03/indieweb/
#
vinceaggrippino
edmael[d]: Thank you! I think that's what I'm looking for.
#
edmael
[edit] Bad informations? That was like the crash course 😄 The other starting point I had was this: https://www.joelotter.com/posts/2023/03/indieweb/
ren, aaronpk, ren-, [KevinMarks], wobbol, aelaraji3 and barnaby joined the channel
#
[tantek]
really wish I had an archive of all the events I attended / participated in, regardless of what "platform" they themselves were organized / hosted on.
#
[tantek]
trying to reconstruct this historically is very hard
#
[tantek]
curious if folks have good techniques for backfilling from various sources of past events?
#
[Joe_Crawford]
Cloud based calendars and email archives. But yes, it's hard. I have a barcamp type event that took place in Arlington VA in like... 2010 or 2011 -- and I can't find any evidence of it in my email anywhere. I thnk maybe it took place in the offices of Palantir. It's vexing I can't find evidence of it.
#
[Joe_Crawford]
Also, edmael, vinceaggrippino, welcome aboard! You're in the right place!
#
[tantek]
ironic not being able to find something that took place at Palantir
gRegor and longlongdouble joined the channel
#
[Joe_Crawford]
It was a last minute decision to check it out. I was visiting my sister. Thus, no RSVP i think. And at the time I was unaware of what Palantir, the company, was. And only vaguely aware of what the word meant.
#
gRegor
My backfilling has mostly been for concerts I've attended, so usually I can find them by looking at band sites / Internet Archive snapshots. Often ending up in a snapshot of a Myspace page. :)
#
gRegor
Or emails / gchat archives
#
[tantek]
Email is a good source yeah at least for discovery of event permalinks
#
gRegor
o/ Welcome vinceaggrippino! You can definitely show up to Homebrew Website Club without the RSVP, just a nice to have. People at the HWC can also help answer questions about the process.
#
aaronpk
I've gotten bad about posting events on my site but I was doing it for a while for all the conferences and things I attended
ttybitnik joined the channel
#
carrvo[d]
I've thought it nice before to see a history of events/meetings before, but the implementations I have worked with (Google and Microsoft) have steered me away. Too much post-editing and too many people reusing the same meeting by moving it to the future.
#
carrvo[d]
And the most painful are meeting series. Somehow editing the future events requires editing past events...
#
aaronpk
recurringevents--
#
Loqi
recurringevents has -1 karma in this channel over the last year (-2 in all channels)
ren joined the channel
#
[tantek]
recurringevents--
#
Loqi
recurringevents has -2 karma in this channel over the last year (-3 in all channels)
#
[tantek]
I'm trying to at least get an archive of all or the *in-person* particular public or semi-public (e.g. standards related) meetings and events. I'm less concerned about virtual events and internal company meetings etc.
#
[tantek]
in-person events don't tend to be recurring / meeting series any more
sebbu2 and teasea joined the channel
#
petermolnar
https://www.businessinsider.com/openai-anthropic-ai-ignore-rule-scraping-web-contect-robotstxt - yeah. I was wondering why my server load suddenly started alerting. Self hosted git (forgejo) and a*hole AI scrapers are not a great combination.
#
petermolnar
well, now one more thing gets "I'm a teapot" response from my system.
#
petermolnar
the benefits of running a very low power home server is that I notice scrapers
#
capjamesg[d]
petermolnar I have seen quite a few mentions of this issue recently.
#
capjamesg[d]
I checked my logs but didn't have many requests. I nevertheless now send 404s to those and many other bots for every page on my site.
#
capjamesg[d]
I'm hoping that a 404 means they won't try with another user-agent.
ttybitnik joined the channel
#
petermolnar
hence my "418 I'm a teapot"
#
petermolnar
daily 13k hit from ClaudeBot every 2-3 days
#
aaronpk
Are they self identifying with a user agent? I haven't looked at my logs in a long time
#
capjamesg[d]
I got 57 from Claudebot today.
#
capjamesg[d]
You can look for "ClaudeBot/1.0; +claudebot@anthropic.com" for Claude.
#
capjamesg[d]
OpenAI identifies too.
#
[tantek]
I'm mixed on allowing AI crawling. Like if you truly post a lot of educational accurate content, you're helping teach the AIs which helps others who use them. It gets to the heart of why post anything publicly.
#
gRegor
what is robots txt
#
Loqi
robots.txt is a file used to inform web crawlers what parts of a site should or should not be crawled https://indieweb.org/robots_txt
#
gRegor
Some other disallows documented on there ^
#
capjamesg[d]
Having fun with typography ✨
#
rdg
[tantek]: why not just use a search engine?
#
[tantek]
rdg, what does that have to do with blocking or allowing AI crawlers on your personal site?
#
gRegor
search quality is going down too as they push all this "AI summary" stuff, unfortunately :/
#
[tantek]
gRegor, that's also part of my point, which is if you do allow AI crawlers to index your posts, then there's a chance your posts (or what you say) will make it into the "AI summary" stuff at the tops of search engines.
#
[tantek]
even if it's not classic "SEO", having an impact on the content of the top search result seems like an interesting use-case worthy of consideration
#
gRegor
Yeah, I get that. I think it's pretty slim chance personally, but sure.
#
rdg
[tantek]: if you feed AI (well I'm guessing LLM) with your posts you allow people to get "some" content via LLMs rather than just finding your site
#
[tantek]
gRegor, it's probably more likely on more specific or particular content
#
[tantek]
rdg, there's no "rather than". Search engines are going to show "AI summary" results at the top of search results whether or not you allow crawling your posts.
#
gRegor
I'm pretty pessimistic towards it currently, entering my old man yells at cloud era I think :)
#
[tantek]
gRegor lol 😆
#
petermolnar
my posts are not public domain. when the ai summary doesn't show attribution - and usually it doesn't - it's violating CC.
#
rdg
[tantek]: you can also choose other search engines
#
capjamesg[d]
petermolnar SearchGPT does link to sites.
#
petermolnar
if i could tell them to scrape text only, i might even be ok with it, but my photos are off limits for ai
#
[tantek]
rdg, sure you can. however, you're not the one choosing the search engines that other people use.
#
capjamesg[d]
(This is in "Search" mode, which is generally available but doesn't always trigger unless you explicitly use search mode.)
#
capjamesg[d]
All of those light grey names link to the source.
#
capjamesg[d]
There is also a "Sources" link at the bottom which links to sources:
#
[tantek]
Amazing. So that incentivizes posting about topics so ChatGPT will cite you in its results
#
capjamesg[d]
ChatGPT does sometimes automatically consult its search system. This is indicated by a message that says it is searching the web. I'm not sure how they determine when to automatically query the search system.
#
Loqi
friendly reminder [tantek], capjamesg[d], we try to keep jargon (ChatGPT) out of this channel to make it more inviting to newcomers, can you move this to #indieweb-dev?
#
[tantek]
lol ChatGPT is end user software not jargon. I'll fix that
#
[tantek]
gRegor, I see you added ChatGPT to /jargon in 2023, do you mind if I remove it? ChatGPT(.)com is an end user website where you enter a query into a text field and see results, not unlike Google(.)com in that regard.
#
[tantek]
(oops I meant to ask that in #indieweb-meta, apologies)
#
rdg
[tantek]: not choosing, but hopefully incentivizing them to try others
#
rdg
building a search index over posts or other static content has a computational cost much lower than training a full LLM over the whole web
#
[tantek]
Still not seeing the connection between trying to influence what search engine other people choose to use, and whether or not you allow AI crawlers to index your personal site.
khrome joined the channel
#
[tantek]
Sure, if you want to build your own search index, that's an important consideration!
#
rdg
you don't think that relying on LLM/AI summaries goes against the ideas in POSSE? https://indieweb.org/POSSE
#
[tantek]
quite the opposite! LLMs are yet another syndication target!
#
rdg
AFAIK LLMs may not link back to your site, they can just present information from your site keeping the user from visiting it, so I don't think I can agree with that view
#
carrvo[d]
I'm guessing that AI would get your content out, but comes with the risk of missing attribution (linking, license violations).
#
carrvo[d]
I recently discovered that Google's search engine AI results have links added these days (yay!). But I remain pretty skeptical.
#
Loqi
yay!
#
khrome
Anyone around here doing unified client+server buildless ESM?
#
[tantek]
"LLMs may not link back to your site, they can just present information from your site" <-- same problem with offering RSS feeds and RSS reposter bots presenting your content on spam blogs. Doesn't stop people from offering RSS feeds
#
carrvo[d]
Interesting comparison.
ttybitnik joined the channel
#
trinsic_paridiom
Hi I am new here I found out about this community from a hacker news [post](https://news.ycombinator.com/item?id=42581119). I am an information technology consultant who provides various technical solutions to the public. I consider myself an autonomous individual who believes strongly in independent free will and open concepts that grow humanity towards sustainability from a deontological perspective, and then from a utilit
#
trinsic_paridiom
[edit] Hi I am new here I found out about this community from a hacker news [post](https://news.ycombinator.com/item?id=42581119). I am an information technology consultant who provides various technical solutions to the public. I consider myself an autonomous individual who believes strongly in independent free will and open concepts that grow humanity towards sustainability from a deontological perspective, and then from a
#
Loqi
[preview] [dend] Be a property owner and not a renter on the internet
#
Loqi
[preview] [dend] Be a property owner and not a renter on the internet
[qubyte]1, [tantek]1, [KevinMarks]1 and gRegorLove_ joined the channel
#
trinsic_paridiom
I'm looking to get some advice on preserving my website content.