#dev 2024-09-06

2024-09-06 UTC
thegreekgeek, pavel, geoffo, claudinec, thegreekgeek_, ChrisFu and mretka joined the channel
# 06:52 
[0x3b0b] Further anecdotal information for the thing [tantek] and [snarfed] were dealing with: After what I presume were some edits and re-federating, Tantek's profile on my instance now shows posts that were missing before, and MBP does _not_ bubble the updates to the top
[qubyte], GWG, thegreekgeek, sp1ff, thegreekgeek_, [KevinMarks] and ttybitnik joined the channel
# 13:26 
pcarrier any technical reason why one would want to avoid creating feed entries with dates in the future? I'm trying to post "daily" and running ahead, wondering if I might break stuff doing this
[schmarty] joined the channel
# 13:35 
[schmarty] If a feed reader shows posts chronologically by post date they'll likely float atop others until their publish time passes.
# 13:38 
aaronpk and because of that, defensive coding of feed readers suggests they might rewrite the date to the date the post was discovered in the feed instead, defeating your entire goal
# 13:41 
pcarrier FWIW that wasn't my entire goal, but rather to keep 1 post per day and not have to do anything for the next 2 days
# 13:42 
aaronpk well if you don't mind people reading all 3 posts on the same day then i guess it's fine
# 13:43 
pcarrier Yeah I don't mind 🙂
# 13:45 
sebbu some blogs supports posting a post at a specific date
# 13:48 
sebbu (and i can see a perfectly valid example of future-dated post : talking about / organizing an event (even if you could or should use a calendar for that), including when it opens, close, the map, the different events inside the events, where they are (on the map), with who (guests), if you need to register beforehand, etc...
# 13:49 
pcarrier sebbu: yeah it's different, the post only appears on the site and feeds at that date then
# 13:51 
pcarrier Or rather, I would guess? Anyway not so with my static site generator, it's there on next publication. I could filter and automatically deploy every hour or something, but I won't.
# 13:55 
sebbu yeah, i remember using webcron for something similar around a decade ago
# 14:57 
[tantek] What is scheduled
# 14:57 
Loqi scheduling in the context of the indieweb, refers to the feature of setting a specific time for a post to be published in the future https://indieweb.org/scheduled
# 15:04 
[tantek] What is MBP?
# 15:04 
Loqi It looks like we don't have a page for "MBP" yet. Would you like to create it? (Or just say "MBP is ____", a sentence describing the term)
# 15:07 
[tantek] Hmm looks like I have Unicode problems on my http://micro.blog/t (as well as a duplicate post every time I changed a slug)
# 15:08 
[tantek] Is anyone here following my Atom feed and seeing weird characters or duplicate posts? Or not? And in which reader(s)?
# 15:08 
[tantek] XML--
# 15:08 
Loqi XML has -1 karma over the last year
# 15:12 
pcarrier [tantek]: https://tantek.com/updates.atom looks good in miniflux except plenty of dups.https://srv.us/ukd52f37ksovcp5yiv3nkftazojpiakd
# 15:13 
pcarrier argh https://srv.us/ukd52f37ksovcp5yiv3nkftazojpiakd
# 15:16 
[tantek] Got it. Yeah the dupes are likely from me changing the slug
# 15:17 
[tantek] Sigh, maybe I have actually support atom:id so I can change slugs without causing dupes in feed readers
# 15:18 
pcarrier you mean URL slugs? do you provide redirects when you change them?
# 16:15 
[tantek] Yes, always
# 16:16 
[tantek] So feed readers are typically not smart enough to notice that new URLs are redirects of existing ones and dedupe that way
ChrisFu joined the channel
# 16:20 
[tantek] ooh Social CG telcon today in ~40min!
# 16:21 
[tantek] https://www.w3.org/events/meetings/6b91f267-6cb4-41ca-9a67-5035cee827dd/20240301T130000/
# 16:21 
[tantek] cc: [snarfed] [manton] [KevinMarks] and whoever else is interested
ChrisFu, barnaby and srijan joined the channel
# 17:34 
jimw [tantek] I'm not sure I see how noticing new URLs are redirects of existing ones would even work for a feed reader. If it sees `https://example.com/original` and then `https://example.com/new` it has no way to know if `/original` is redirecting to new unless it's hitting every old feed URL to notice. And since it's a feed reader, it's probably not
# 17:34 
jimw even loading `/new` until the user clicks through to it somehow. That's why you need a stable identifier like `atom:id` if the URLs aren't stable.
# 17:36 
aaronpk my webmention receiving code does something similar, but it relies on the webmention sender to send an update from the *old* URL
# 17:36 
aaronpk otherwise yeah it requires polling all previously received URLs forever which isn't scalable
# 17:37 
aaronpk if the new post has a reference to the old URL, then I can use that as a trigger to check the old URL for confirmation
ChrisFu joined the channel
# 17:38 
jimw Yeah, even just pulling new URLs found in the feed could be pretty rough if most feeders did that because it would multiply a poll of your feed into multiple requests for full pages whenever there was a new post.
# 17:39 
jimw Not quite the thundering herd of Mastodon link preview requests because the polls are more staggered, but still not how most feed readers work, I think.
AramZS joined the channel
# 18:19 
[tantek] pretty sure rel=alternate can solve this?
# 18:20 
aaronpk that doesn't sound like quite the right semantics
# 18:20 
aaronpk it's more like rel=old lol
# 18:20 
[tantek] as in when a feed consumer sees a rel=alternate link on a new item, if the feed consumer already "knows about" the alternate link (as in, has seen it before), then it can do a HEAD retrieval of the alternate link to see if it returns a redirect to the new item
# 18:20 
[tantek] well it's a form of rel=alternate
# 18:21 
[tantek] and then a HEAD check on it can confirm that it's been redirected to the new thing
# 18:21 
[tantek] that's how you do the bidirectional confirmation
# 18:21 
[tantek] without re-retrieving the entirety of the existing item
# 18:21 
aaronpk it would work, it just doesn't match the meaning of rel=alternate IMO
# 18:21 
aaronpk and HEAD vs GET doesn't really make a difference at the end of the day
# 18:21 
[tantek] rel alternate handles A LOT of meanings
# 18:22 
[tantek] HEAD request at least enables the server to skip generating the body
# 18:22 
aaronpk i wonder how many people actually implement it that way
# 18:22 
[tantek] past URLs are valid alternates IMO
# 18:22 
aaronpk vs generating the whole thing and just having the http server return only the HTTP header
# 18:29 
jimw You're also assuming that the feed consumer is loading the page for each entry, unless you mean the rel=alternate should be in the feed entry somehow.
# 18:30 
[tantek] yes feed entries should have rel=alternate links anyway, e.g. to the HTML permalink
# 18:31 
[tantek] jimw, no, the Atom entry can have a link rel=alternate
# 18:31 
[tantek] or multiple!
# 18:31 
[tantek] aside: FYI aaronpk https://blog.erlend.sh/indie-social-sign-in-could-go-mainstream
# 18:35 
jimw So on tantek.com, you should have added rel="alternate" links to the h-entry with the old slug, and a feed reader should have seen that, checked that it redirected to the u-url for the h-entry, and thus known the h-entry with the new u-url was an update to the old h-entry it had seen earlier?
# 18:37 
[tantek] jimw, no restating: "the Atom entry can have a link rel=alternate"
# 18:38 
[tantek] HTML consumers don't have this problem
# 18:38 
[tantek] only Atom consumers have this problem, therefore only need to put fixes in the Atom
# 18:39 
jimw why isn't it a problem for HTML consumers? if my feed reader is using h-feed, presumably i'm going to get duplicate entries if the u-url for an h-entry has changed between retrievals.
# 18:44 
[snarfed] ideally it shouldn't
# 18:45 
[snarfed] but if it does often on your site, that's probably what u-uid is for
# 18:46 
[tantek] jimw, because HTML consumers load an h-feed page and see only the current entries. problem solved
# 18:46 
[tantek] so far this has only been a reported problem in Atom / other feed file consumers
# 18:48 
jimw ok, so what i'd call traditional feed readers that accumulate their own database of entries from feeds don't consume h-feed, so it's not a problem.
# 18:49 
[tantek] social readers do consume h-feed
# 18:50 
[tantek] so far they don't seem to have the duplication problems of trad feed readers
# 18:50 
[schmarty] aaronpk's aperture consumes h-feed and accumulates its own database of entries from those feeds. it should suffer from this same duplication issue i think.
# 18:50 
[tantek] I suspect aaronpk would say something if it did
# 19:07 
[tantek] whew two hours of SocialCG meetings
# 19:09 
[tantek] what is jf2
# 19:09 
Loqi jf2 is a W3C Note and a JSON Post Serialization Format of microformats2 for that is optimized for h-entry consuming code, as compared to the standard microformats JSON representation https://indieweb.org/jf2
# 19:09 
[tantek] cc [jacky]
ChrisFu, [morganm], ttybitnik and barnaby joined the channel
# 20:39 
gRegor Just tried XRay HTML input with a different u-url and u-uid property, it only returned the u-url. So I think Aperture would have the duplication issue
# 20:41 
[tantek] Interesting hypothesis. I'll wait for the Monocle screenshot though
# 20:43 
aaronpk got an example of a URL that has this problem? i want to check the database
# 20:43 
aaronpk i don't know which of tantek's recent posts has had the URL change
# 20:45 
gRegor Depending on polling frequency of course
# 20:46 
gRegor I think https://tantek.com/2024/247/t2/twenty-years-microformats-shorter2 from the screenshot
# 20:46 
Loqi [preview] [Tantek Çelik] Twenty years ago this past February, Kevin Marks and I introduced #microformats in a conference presentation.
Full post: https://tantek.com/2024/044/t1/twenty-years-microformats
Aside: This is an even shorter summary of that post from ~200 days ago...
# 20:47 
gRegor Appears twice. The full post appears only once in the screenshot.
# 20:48 
gRegor In Monocle I'm only seeing 1 copy of each, -summary and -shorter2
# 20:51 
gRegor So it's looking correct at a glance in Monocle
# 20:53 
[tantek] what is deleted
# 20:53 
Loqi 🚮 A deleted is a post that has been removed https://indieweb.org/deleted
# 20:54 
aaronpk ah i do see some dupes
# 20:55 
aaronpk https://media.aaronpk.com/2024/09/06135510-4453.png
# 20:55 
[tantek] ^ what is that a screenshot from?
# 20:55 
aaronpk the aperture database
# 20:56 
aaronpk oh wait
# 20:56 
aaronpk the three dupes are marked as deleted
# 20:56 
aaronpk interesting
# 20:56 
[tantek] nice
# 20:56 
[tantek] 😄
# 20:56 
[tantek] so aperture is doing some de-duping! would love to hear how
# 20:56 
aaronpk wait no, they are marked as "not currently in feed"
# 20:56 
[tantek] so we can document it for others
# 20:57 
aaronpk which is the same as falling off the end of the feed
# 20:58 
[0x3b0b] [tantek] re: earlier question - sorry, I used MBP as an abbreviation for "microblog.pub"
# 20:58 
[tantek] does aperture parse u-uid to de-dupe things "not currently in feed"
# 20:58 
[tantek] 0x3b0b a-ha! in my abbr cache MBP = MacBook Pro
# 21:00 
[0x3b0b] That makes sense. I have had a similar problem before with D2 = Diablo II, Destiny 2, D2: The Mighty Ducks...
# 21:01 
aaronpk i'm confused, from everything i can see in the database, i should see two copies in monocle, but I don't
# 21:02 
[0x3b0b] There's a small group that's set up a github org recently to try to take over further development of a fork of microblogpub; one of them has a rudimentary version of post editing working. I'm looking forward to trying to merge in some of the accumulate changes and then making pull requests for them to pull in some of mine. But I digress (as is my wont.
# 21:02 
[mattl]2 SC2 = something called Starcraft 2, apparently.. it's always Short Circuit 2 to me
# 21:03 
[KevinMarks] If you are migrating urls and use 301 that's an explicit instruction to remember the new url in future. If you use 302 it means remember the old one and recheck.
# 21:05 
[tantek] aaronpk, I suspect Monocle is smart enough to collapse per redirects
# 21:05 
[tantek] when it tries to get the h-entry for a URL, and the URL redirects
# 21:05 
aaronpk I don't remember doing that but it has been a long time since i wrote that code
# 21:06 
[tantek] so it's fine that the database has a bunch of crufty data. I mean databases often do
# 21:06 
[tantek] this isn't the first time I've changed slugs so I suspect you fixed this previously 🙂
# 21:07 
aaronpk this is also weird because i see code for handing u-uid but xray doesn't return that property
# 21:13 
[tantek] spooky property at a distance!
# 21:13 
[tantek] tis the season I suppose 😉
[aciccarello] joined the channel
# 21:57 
[mattl]2 would love it someone with a wordpress site that isn't mine could try this: https://github.com/mattl/wordpress-to-eleventy/tree/main
# 22:05 
jimw finally implemented a Webmention receiver on my site. all it does is email the mention to me, but that's better than the old stub that just threw them away.
# 22:06 
superkuh Nice.
# 22:06 
superkuh How did you do it? Custom webserver program? Dynamic language with webserver? Parsing webserver logs?
# 22:09 
[tantek] jimw++ that sounds useful!
# 22:09 
Loqi jimw has 2 karma in this channel over the last year (3 in all channels)
# 22:09 
jimw it's custom software. i've rolled my own blog software for a long time.
# 22:09 
[tantek] rollyourown++
# 22:09 
Loqi rollyourown has 1 karma over the last year
# 22:10 
jimw open source, technically, but not quite to the point where anyone else could use it without having to hunt down references with my name. https://github.com/jimwins/talapoin
# 22:10 
Loqi [preview] [jimwins] talapoin: A small blogging system, used on trainedmonkey.com
[0x3b0b] joined the channel
# 23:15 
[tantek] I am thinking of adjusting my auto_link function to treat @-names as *local* unless a default @-name prefix URL is specified
# 23:15 
[tantek] that is linking something something like @example to href="/@example"
# 23:16 
[tantek] which allows the local site to treat "/@" as a nicknames cache lookup
# 23:16 
[tantek] this would allow me to repair and redirect any past plain @-names in my past posts that used to mean people's Twitter handles back in the day, but may be deleted or sold as of today, to that person's personal site or other preferred @-contact
# 23:17 
[tantek] hmm, I wonder if we could do this for the wiki as well, even if it's a bit manual
# 23:18 
[tantek] lol turns out I already did this for myself just over 10 years ago 😂  /@t /[tantek] let's see if those link properly in the logs
# 23:25 
[tantek] ah slack auto-link interference
# 23:43 
[tantek] ok also considering @0-999 auto-embedding a time element that converts the Swatch beats to 24hr time
# 23:43 
[tantek] 24hr Z time that is, since Swatch beats are 0-aligned to CET/BST