#dev 2022-04-08

2022-04-08 UTC
nertzy, lagash, YimingWu[d], Silicon[d], jacky, gRegor, gRegorLove_, johnnrs[d] and sayanarijit[d] joined the channel
# 02:43 
jacky as I'm reading https://www.inkandswitch.com/cambria/, I'm wondering if some sort of version stamping should be added to things like Microsub and Micropub
# 02:44 
jacky to help with versioning and API translations
# 02:45 
jacky like if we have a breaking change (like renaming `action` to `act` or introducing PATCH for particular operations), this has a notion of a translation layer (Stripe's approach is probably the most ideal) so requests can be 'upgraded' or 'downgraded' to the version understood by a server
# 02:45 
jacky this is meant for local-first software but I can see how this benefits non-local-first
# 02:48 
jacky tl;dr: it's schema migrations for API contracts
gRegorLove_, jacky, trig[d], johnnrs[d], Asaf_Agranat[d], laker[d], cygnoir[d], dovedozen[d], sayanarijit[d], hepphepp[d], samhenrigold[d], YimingWu[d], aspenmayer[d], Silicon[d], Jeremiah[d], tracydurnell[d], indieweb-irc-bri, corenominal[d], Nan[d], capjamesg[d], hoenir, shaunix[d], wackycity[d], balupton[d] and mro joined the channel
# 06:45 
capjamesg[d] jacky I was thinking about the societal impacts of large-scale, easy to access / crawl social graphs.
# 06:46 
capjamesg[d] With h-cards, governments could maintain a database of changes to your profile over time without having to circumvent any social media businesses' crawling limits.
# 06:46 
capjamesg[d] I am unsure whether this is a legitimate concern but it feels like one.
# 06:46 
capjamesg[d] What if everyone had a h-card on their site in the UK? Would people start mapping friends to create massive datasets for advertising purposes?
MarkJR84[d] joined the channel
# 07:24 
doosboox capjamesg[d]: it definitely is a valid concern. That said they wouldn't really have a way of mapping "friends" in a reliable manner, nor know which forums you are frequenting or what kind of information you usually search for.
# 07:25 
doosboox capjamesg[d]: btw, I think I've asked this before but what do you use for the indexing and search for your search engine?
# 07:25 
capjamesg[d] That is valid. I think I left XFN creep in there a bit without being explicit. I concur with what you said about mapping friends.
# 07:26 
capjamesg[d] I use Elasticsearch doosboox.
# 07:26 
capjamesg[d] 8GB server holds the 400k or so documents.
# 07:27 
capjamesg[d] As for the actual querying, I have a Python Flask server that wraps around Elasticsearch and turns human queries ("what is X...") into the right schema.
# 07:27 
capjamesg[d] The schema can vary depending on if a query is a "discover" query (find people whose h-cards mention something) or if the query needs to be ordered in some way.
# 07:40 
doosboox capjamesg[d]: how much of this have you built yourself and how much is off the shelf components?
# 07:41 
capjamesg[d] Elasticsearch is off the shelf. The crawler and search result representation (query cleaning, featured snippet extraction, etc.) is mine.
gRegor joined the channel
# 07:42 
capjamesg[d] Then things like post type discovery are part of indieweb-utils.
# 07:45 
doosboox how do you parse and translate human queries?
# 07:52 
capjamesg[d] The approach is naive right now.
# 07:52 
capjamesg[d] If a question contains a "what is" at the beginning, for example, the engine looks for a <dfn>, a direct answer in HTML documents that is likely to match based on a few semantic rules, and a couple of other things.
# 07:53 
capjamesg[d] Or if a question starts with "who is", the engine will look to retrieve a h-card for the person whose name is mentioned if one is available.
# 07:53 
capjamesg[d] To keep this efficient, this search only runs on the top few results.
# 07:54 
capjamesg[d] The assumption is that if the engine has information on, say, my h-card, it would show up highly for a search "who is jamesg.blog" (with "who is" filtered out because it is not useful information to query in the index).
# 07:54 
capjamesg[d] At scale, these sorts of naive rules might fall apart as an index grows a bit in favour of more complex logic. But I'm not building a really big search engine like Google 🙂
# 07:55 
capjamesg[d] I also remove some punctuation and transform the final, cleaned query, into Elasticsearch syntax (i.e. if a user has provided a keyword that requires a certain filter is used). I can't think of any of these syntax examples off the top of my head but I remember building at least one.
petermolnar, hoenir, capjamesg[d], shaunix[d], corenominal[d], wackycity[d], MarkJR84[d], edburns[d], indieweb-irc-bri, Crypto[d], Nezteb[d], edgeduchess[d], grantcodes[d], mro, niklasfyi[d], Murray[d], YimingWu[d], laker[d] and omz13 joined the channel
# 09:49 
jamietanna jacky +1 on versioning, but I'd say there are quite a few industry-used means for doing it we could follow? That does look interesting as an approach
[James_Van_Dyne] joined the channel
# 09:55 
petermolnar if anyone wants the avatars from /chat-names , I made a quick hack at petermolnar.net/indiewebavatars.php?name=[username] but it's far from perfect
# 10:04 
capjamesg[d] petermolnar++
# 10:04 
Loqi petermolnar has 8 karma in this channel over the last year (40 in all channels)
tetov-irc and Murray[d] joined the channel
# 10:53 
Caesar[m] <petermolnar> "if anyone wants the avatars from..." <- Maybe an idealistic dream, but from an indieweb viewpoint it would be great if avatars were picked up from our websites instead of having to manually update them at /chat-names, wiki sparklines, etc
balupton[d], kimberlyhirsh[d] and aspenmayer[d] joined the channel
# 11:20 
Murray[d] Wait, Loqi is a dinosaur on Discord!
# 11:21 
Murray[d] petermolnar++
# 11:21 
Loqi petermolnar has 9 karma in this channel over the last year (41 in all channels)
tracydurnell[d], petermolnar, hepphepp[d], dovedozen[d], Nan[d], nertzy, mro, jacky, gRegor, baracurda, cambridgeport90, samhenrigold[d], sayanarijit[d], Ramon[d], omz13 and Jeremiah[d] joined the channel
# 14:56 
jacky so my site uses the async webmention callback flow (https://indieweb.org/Webmention-brainstorming#Asynchronous_status_notification) to handle ingestion of Webmentions so it's more of a push flow versus pulling/polling (although I do have support for that to make quick importing simpler)
# 14:57 
jacky actually I think I answered my own potential question (how do I work with services that don't support callbacks?) - by mainly avoiding them or falling back to a poll of webmentions
# 14:58 
sknebel not sure what "mainly avoiding them" means in this case, since you cant choose if the site you send a WM to supports it or not
mro joined the channel
# 15:09 
jacky ah I should have mentioned that my site doesn't do a lot of the work for webmention processing
# 15:10 
jacky like Lighthouse does the actual work of sending and receiving and it could probably do some work to just invoke the callback after an hour or so as sent if nothing happened
# 15:17 
sknebel ok, yeah, for integration between bits of your site you can of course do that
# 15:17 
sknebel (I would've considered Lighthouse part of it)
# 15:33 
jacky okay I think I have a decent flow now that's all push-based
# 15:33 
jacky the tests let me think so lol
# 15:33 
jacky might blog about it
gRegor, mro, baracurda, adstew, JPax[m], KartikPrabhu, Silicon[d], cygnoir[d], Christian_Olivie, Darius_Dunlap[d], jacky and yequari[d] joined the channel
# 20:42 
jacky what is rel=subscribe
# 20:42 
Loqi rel-subscribe is an experimental rel value for linking from your home page to your subscription endpoint, and is currently prototyped by Aaron Parecki on aaronparecki.com; try the Follow button at https://aaronparecki.com/follow or any permalink https://indieweb.org/rel-subscribe
# 20:44 
jacky keep coming back to this
# 20:45 
jacky like if something like https://subtome.com was baked into browsers, I'd be glad
# 20:46 
sknebel in old firefox readers could register themselves for feeds
# 20:46 
sknebel but instead of extending that to feed discovery with a button in the UI they killed that stuff completely
[jeremycherfas] joined the channel
# 20:48 
jacky that could have been a really good feature in itself
# 20:50 
jacky https://addons.mozilla.org/en-US/firefox/addon/awesome-rss/ is what I use now to get that in a way
# 20:52 
jacky hmm I wonder if a bridge to use as a subscription endpoint could help people
# 20:53 
jacky like if they don't have Microsub on their site but they do use Feedly, it could point them to a page that'd subscribe them in Feedly
# 20:53 
jacky runs to user page
# 21:02 
jacky okay so https://indieweb.org/User:Jacky.wtf#rel-subscribe_proxy_endpoint is the notion
# 21:03 
jacky tbh the simplest form of this would be people being able to use their site to follow one another (which is good in itself) - the field could be autopopulated with the URL
# 21:03 
jacky I do think [schmarty] wrote something about autocomplete and URLs tho
# 21:05 
[schmarty] zoop: https://martymcgui.re/2020/05/25/a-hole-in-browser-autofill-support/
# 21:08 
[schmarty] and in answer to a clarifying suggestion: https://martymcgui.re/2020/05/26/121444/
# 21:08 
Loqi [Marty McGuire] Thanks, Ryan! I see the same behavior on Firefox, but probably wasn’t clear explaining it (“start typing from somewhere in the middle”). I do use this when I’m on my full computer, and it helps!
Where I get really frustrated is on my iOS dev...
# 21:09 
jacky heh nice
# 21:15 
@jackyalcine ↩️ I do! I actually wrote this reply to you from my site (heavy lifting done by http://brid.gy). I like having a place where I can point to people and say, “I did that. It might not be awesome, it might look weird, but I DID THAT”. And then put a… https://jacky.wtf/2022/4/ve/veExvi96fU7VC-75oi8f-neO (twitter.com/_/status/1512539024329744387)
# 21:18 
[tantek]1 jacky++
# 21:18 
Loqi jacky has 29 karma in this channel over the last year (70 in all channels)
KartikPrabhu, paulrobertlloyd and ShinyCyril joined the channel
# 22:15 
jacky capjamesg[d]: I think there's a markup issue on https://jamesg.blog/2021/07/21/building-a-blog-search-engine-part-ii/
# 22:16 
Loqi [James] Building a search engine for my blog: Part II
alex11 joined the channel
# 22:34 
Caesar[m] Heh, looks like some escaping is needed... unescaped `<title>` tag (and others) in the content
tetov-irc joined the channel
# 23:20 
jacky looks like the great firewall is tripping up my server https://jacky.wtf/2022/4/9f/9f4pvjwdAJvFn8Oj-nC34e0K
# 23:21 
jacky it got that by reading the plain HTML of the page (I leave `h-koype-stubbed-from-head` as a hint when I handle my reply contexts
nertzy joined the channel