nekr0z, reed, calebjasik, Lohn, vikanezrimaya, Abhas[m], hala-bala[m], SamWilson[m], astralbijection[, cambridgeport90[, P[m], LaBcasse[m], lasr[m], benatkin, [tw2113_Slack_], capjamesg[d], aaronpk[d], indieweb-irc-bri, hoenir, IWSlackGateway, sebbu, Saphire, capjamesg, petermolnar and hendursa1 joined the channel
#petermolnartiny bit offtopic, but not that much: I have mixed feelings on using git for my www folder, because git, unlike some other solutions, doesn't have the notion of history of a single file. I was wondering if anyone decided to use something else - svn, fossil, etc - for this specific use.
#sknebelpetermolnar: does SVN have a clearer notion of "history of a single file"?
#sknebel(I guess it might handle renames more natively? thats the main "meh" part of git for me in that case)
#petermolnartruth to be told, I never enjoyed svn, and that's worded lightly
#sknebelthe main issue with git IMHO would be renames, so unless that specifically is something you expect to come up a lot I'm not sure something else is worth it
#petermolnar(though still better, than ibm clearcase or that monster called MKS Integrity)
#sknebelfor me, the overhead of (mentally and setup wise) dealing with something else likely would be too annoying
#sknebelalthough I guess programmatic integration (if thats something your after) is painful enough in git that maybe that warrants it
#sknebel(I've considered integrating git to have access to edit history of posts)
jamietanna joined the channel
#jamietannapetermolnar what do you mean not having history for one file? `git log --follow path/to/file` should show the history for that file, including renames, for all time? :D
geman and neceve joined the channel; geman left the channel
#capjamesgAre there any good reads on improving relevance in search results?
#capjamesgMy blog search engine isn't as good at very broad queries (i.e. coffee) and I'd like to research what the best practices are.
hendursaga joined the channel
#capjamesgShould internal links to a page count toward ranking? Should word count be considered?
#aaronpkI think it depends on the goal of the search engine
#capjamesgIn this case, it's a blog search engine. The goal is for queries to as closely identify with the visitor's intent as possible.
#LoqiTechnorati was a real-time blog search engine that provided date-ordered results for text phrases or links, typically within seconds of when people published on their blogs https://indieweb.org/Technorati
#[KevinMarks]Technorati was mainly date ordered, but we also ordered by blog ranking (which was a count of links from distinct blogs in the last 6 months)
#petermolnarwhat were those "slow" and "fast" type of entries from a few days ago? If you were to apply that thinking capjamesg: there are kinda evergreen entries that last longer (eg. a review of a music album) vs things that expire easier (how to fix problem X on ubuntu 9.10)
#capjamesgI think the issue would be classifying fast vs. slow.
#capjamesgI suppose I could add some kind of numerical classifier that my crawler can interpret.
#capjamesgAlso, I would expect that word count is, to an extent, correlated with fast and slow in that "fast" posts are not likely to be as detailed, etc.
Lohn, Seirdy and [jeremycherfas] joined the channel