#dev 2023-06-01

2023-06-01 UTC
#
vladimyr
capjamesg: just saw this https://mstdn.link/
#
capjamesg
Interesting, thanks for sharingQ
geoffo, ahappydeath, greglopez, [aciccarello], Jack, [fluffy], Nuve, [Ana_R], matt1, holiday_medley, sp1ff, gxt__ and bret joined the channel; darylsun left the channel
#
[tantek]
capjamesg, some ideas along these lines (question about how to label links in a site-wide Index in #indieweb channel) in /PSC
#
[tantek]
what is PSC
#
Loqi
A permashortcitation (or permashortid), abbreviated as PSC is a short non-hyperlinked citation to a post permalink, in contrast to a permashortlink which does hyperlink to a post https://indieweb.org/psc
#
capjamesg
I also don't know how to do word sense disambiguation.
#
capjamesg
Like, IndieWeb could be mentioned in 20 different contexts, but the index could only say that there are 20 documents pertaining to the subject.
#
[tantek]
do you mean automatically or how do you display different word senses in the index?
#
[tantek]
or differences in "how relevant is this to the topic/word"?
#
[tantek]
like is it "merely" a mention, or is it a whole paragraph about the topic/word?
#
capjamesg
> do you mean automatically or how do you display different word senses in the index?
#
[tantek]
high level off the top of my head answer to the original question (how to label the links) : if it's a page, use the shortname (segment) of the page, e.g. "about". if it's a post, and you have incremental post IDs starting from 1, use that number instead of a page number. if you don't have incremental low numbered post IDs, make something up that looks like one, e.g. datetimestamp+postnumber (similar to the ID given in the PSC
#
[tantek]
examples)
#
[tantek]
this is why an index may need to be some mix of automatic + curation
#
capjamesg
I may defer context gathering to GPT.
#
[tantek]
I'd say wait until you have an actual problem you need to solve (e.g. too many mentions of which terms) and then we can analyze that particular case and if the information therein helps point to specific solutions
#
capjamesg
I'll share my index in the next hour or so here.
#
capjamesg
(No GPT)
#
capjamesg
I can't call it index.html without changing my homepage. I bet TBL didn't think about this :D
#
[tantek]
You could call it more descriptively than functionally. Eg siteindex because it's an index across your whole site right?
#
capjamesg
Does it make sense to have a site-wide bibliography?
eitilt joined the channel
#
[tantek]
depends on what kind of bibliography? IMO at least a top level /bibliography should be a list of all your published works, as somewhat described in https://indieweb.org/appearances#What_is_an_appearance_or_not (and the Wikipedia examples cited therein)
#
capjamesg
Pro tip: Building an index points out the typos you have made :D
#
capjamesg
Wait...
#
[tantek]
this is amazing capjamesg++
#
Loqi
capjamesg has 34 karma in this channel over the last year (101 in all channels)
#
capjamesg
Showing blog titles is way better than the NLP approach I was going to take (and orders of magnitude faster).
#
capjamesg
I tried asking ChatGPT to generate an index-like summary of the context of word usage in a post. It worked well on the one example I gave. But to do that over all my posts and ask for a single word? Not ideal.
#
[tantek]
did you limit your index word list to words with at least one capital letter in them?
#
capjamesg
I think it's missing quite a few.
#
capjamesg
It's not complete.
gRegor and jan6 joined the channel
#
capjamesg
What is an offline feed reader?
#
Loqi
It looks like we don't have a page for "offline feed reader" yet. Would you like to create it? (Or just say "offline feed reader is ____", a sentence describing the term)
#
capjamesg
What is an offline reader?
#
Loqi
It looks like we don't have a page for "offline reader" yet. Would you like to create it? (Or just say "offline reader is ____", a sentence describing the term)
#
capjamesg
Has anyone done any thinking on offline readers?
#
[snarfed]
capjamesg lots of existing feed readers have offline support! eg NewsBlur
#
capjamesg
Do they have web offline support?
#
[snarfed]
ah, probably not, mostly via mobile apps. offline PWAs are definitely less common
#
gRegor
what is offline first
#
Loqi
offline first is a method of developing a website that once a user has visited it once on their device, it will always provide at least some content (posts) and or features (like creating posts) when they revisit even when offline without internet access, using technologies like Service Workers and Local Storage https://indieweb.org/offline_first