#dev 2019-08-06

2019-08-06 UTC
jjuran, KartikPrabhu, GWG, ben_thatmustbeme and IWSlackGateway joined the channel
#
Zegnat
My biggest issue with recaptcha is when it breaks a form when I do not have JS turned on, and the website I am on does not tell me it is because they want me to fill in a JS-only captcha
KartikPrabhu, [tantek], [svandragt] and IWSlackGateway joined the channel
#
jjuran
JavaScript Required; Didn't Mind-Read
cweiske and [fluffy] joined the channel
#
[fluffy]
gah, isso’s code is pretty badly-factored
#
[fluffy]
maybe I’ll switch back to disqus for the short term while I figure this out 😕
#
sknebel
[fluffy]: what's the problem with it?
IWSlackGateway, [tonz], [frank], [pfefferle] and jeremycherfas joined the channel
#
petermolnar
I came to learn a terrifying conclusion today: after dealing with so many archivers and bookmark managers, I went back to the idea to download everything (css, js, images), inline them, and save the html of a page. Except... I realized I'm inlining a silly amount of tracking JS as well. Which means I'd need to filter JS and CSS based on domains... for archiving. I'm starting to seriously consider we should actually go back to gophe
#
petermolnar
9.5MB HTML file O.O
#
Zegnat
That is basically how I have been archiving, petermolnar. Inline everything. Headless Chromium + https://github.com/WebMemex/freeze-dry and then dump the entirety to an HTML file
#
Loqi
[WebMemex] freeze-dry: Snapshots a web page to get it as a static, self-contained HTML document.
#
Zegnat
The results are pretty good, but yes, big
#
petermolnar
I wasn't aware of this lib, I'll try it
#
petermolnar
ah, this is nmp
#
Zegnat
Yes, wasn’t a problem for me, since the headless chromium controller I used (Puppeteer, by Google) was through npm too.
#
Loqi
[Zegnat] node-beanstalkd-web-archiver: Worker that creates static DOM dumps of web pages for archival purposes.
#
Zegnat
The lib has to run in a DOM situation, so you may as well run it in a headless browser. And for a browser to run it, the lib may as well be javascript
[svandragt], [tantek] and [jgmac1106] joined the channel
#
jjuran
Inline CSS and JS, but record the URLs of the original resources so filters can be applied later.
[Lewis_Cowles] joined the channel
#
[Lewis_Cowles]
Anyone else archiving using a proxy? If you pull into your same domain, you can use a backend language to do things like replace specific js files, use or pass to jQuery to parse DOM using DOM, then paste details elsewhere.
#
[Lewis_Cowles]
Very old version of a proxying scraper (takes a list of URLs), extracts details
#
[Lewis_Cowles]
This one was a version used to scrape an old Drupal to migrate towards WordPress-esque JSON
eli_oat, loicm and [tonz] joined the channel
#
[tonz]
This is of interest to me too. I am looking to replace Evernote, and one of the functionalities to replace is storing local copies of webpages as bookmark.
#
[jgmac1106]
oooh so you can have them offline?
[KevinMarks] joined the channel
#
[KevinMarks]
1.9MB for a tweet is something Maciej should mock
#
Zegnat
The actual tweet would probably be smaller, base64 encoding does inflate image size.
#
Zegnat
jjuran: I feel like that is something treora worked to add to freeze-dry, where it would record the original URLs in a data- attribute or something
#
treora
Yep, freeze-dry stores original URLs of inlined resource in data-original-${attribute}
#
treora
It is something I came up with however, would be nicer to have some standard to follow.
#
[tonz]
[jgmac1106] yes so you can have then offline. As a way to ensure you have the copy you actually reference, but also because of stuff disappearing behind paywalls etc. Another road would be through archive.org but for my work usage having offline copies is extremely useful.
#
jgmac1106
I know there are long term plans for Known to have offline support with service workers…I might just wait, but I live in a land of PDFs those I usually have local copies of somewhere…just hope I am at that locale when I need them
#
Zegnat
[tonz]: yeah, sounds like something like treora’s project where you can take a snapshot of the current DOM you see in your browser for future referencing is exactly what you need!
#
Zegnat
My project just takes his work and runs it on a server so I can instruct an external box to generate an archive copy for me
#
Zegnat
(Standing on the shoulders of giants and all that!)
#
jeremycherfas
[jgmac1106] One of the best uses of Dropbox for me is that all my PDFs (at least, the ones I have bibliographised in Bookends, live in a Dropbox folder.
[jgmac1106] joined the channel
#
[jgmac1106]
it was mendeley for me, now I have been just grabbing the link to a pdf and making a bookmark in Known, or just uploading to my server if I can't share
#
[jgmac1106]
Mendeley synced to Google Drive for awhile, but trying to own it all myself, but I have privilege of academic access, most knowledge not locked away
#
[jgmac1106]
most links would be to a paywall for most
#
jeremycherfas
Even when I have access, I download for preference. Means I can annotate, share etc.
#
jeremycherfas
Mebndeley sucked for me.
#
omz13
Why is it that reference management software sucks so much?
#
omz13
Its a rhetorical question, BTW.
[snarfed] joined the channel
#
jeremycherfas
I highly recommend Bookends on Mac OSX
#
jeremycherfas
I suspect one reason online versions -- I have tried only Mendeley and, briefly, Zotero -- suck is because they do not have sufficient focus or a decent business plan.
#
omz13
Did you use Papers... it was good (they finally got most of the bugs out in V3), but now its gone all subscription based (yuk!)
#
jeremycherfas
I didn't. Too happy with Bookends. But honestly, that's one application where I would be happy to subscribe because it is so important to me.
#
[jgmac1106]
mainly they suck because human suck at data entry or just argued about what kind of data the files should have....no pdf parses well, most citation databases full of garbage,....and because pdf suck in general...
#
[jgmac1106]
will look into Bookends
#
[jgmac1106]
..but I really want to keep trying to do this all from my own site....at least the notes, maybe a manual copy
#
omz13
What I found was some publishers were better than others at embedding metadata on their site, so (Papers) was usually able to automagically get the citation data when it downloaded the paper... and a lot of times there was minimal metadata in the PDF (sigh)
[eddie] joined the channel
#
jeremycherfas
Bookends does a very good job of parsing from PDFs and DOIs, and Publishers these days do a very good job of providing bibtex and others for download.
#
jeremycherfas
Bookends is not on your site, but is on your machine. And I think it would be a simple matter to present data on your own site. I know you could do static export of HTML formatted any which way you choose with links to the PDF if you have it.
[Rasul_Kireev] joined the channel
#
Loqi
[Greg McVerry] Learning to learn online: A study of perceptual changes betweenmultiple online course experiences
#
[jgmac1106]
I add a pdf to my server, bookmark it, go back and read and add blockquotes and notes
#
[jgmac1106]
been doing that more than hypothes.is lately
seki[m4 and funwhilelost[m] joined the channel
#
[jgmac1106]
now that I have another 50 gigs of storage I can finally move all my pdfs over
[pfefferle] and [renem] joined the channel
#
[renem]
[snarfed] As a follow up to my ActivityPub "problems" with Mastodon.host (if you remember) with bridgy.fed, I created a new account on mastodon.social and it worked as it should. Sorry for the work, but I didn't reach the admins of mastodon.host and also got no information that bridgy.fed or any other services are "blocked".
#
[renem]
[snarfed] In the end, I moved my account from mastodon.host to [renem]@mastodon.social today and all works fine now. Thanks for your patience and help!
#
[jgmac1106]
what I would need to figure out first jeremycherfas is how to batch rename a ton of pdfs to just author and title.
#
[jgmac1106]
[Rasul_Kireev] you can use this tool to check your h-card: https://indiewebify.me/
#
[jgmac1106]
and we have been working on the MDN docs for microformats that has some examples: https://developer.mozilla.org/en-US/docs/Web/HTML/microformats
#
[Rasul_Kireev]
[jgmac1106] I have been using this tool and is the reason I'm asking you guys. Because I had trouble declaring to separate h-cards , I ended up adding a span h-cards around the whole page, which is probably not the best solution.
#
[jgmac1106]
so if you had p-card as a CSS selector you san use microformats too <section class="p-card h-card></section> that would be your h-card
jackjamieson joined the channel
#
[jgmac1106]
what is your url?
#
[Rasul_Kireev]
Thanks, I will head over to MDN resource you mentioned to check it out.
#
[Rasul_Kireev]
rasulkireev.com
#
[Rasul_Kireev]
[jgmac1106] Thanks for your help! I'm sorry, I didn't want to bother anyone too much with an issue like these!
#
[jgmac1106]
move the h-card from the span and to the div with the navbar id
#
[jgmac1106]
I just don't see the closing </span> but I could be missing it
#
[jgmac1106]
I would move the h-card up and delete the span,
#
[jgmac1106]
that would mean your nav items would be inlcuded in your h-card you can decide if you like that or do not like it and then act accordingly
#
[jgmac1106]
and never feel like it is a bother with any issue, that is why the community is here
#
[jgmac1106]
you could turn that entire page into one gigantic h-card, some people do that
#
[jgmac1106]
Who is zegnat?
#
Loqi
Martijn van der Ven is a long-time web tinkerer living in Sweden (CEST or Europe/Stockholm timezone). Pronouns: he or they https://indieweb.org/User:Vanderven.se/martijn/
#
Loqi
Martijn van der Ven
#
Loqi
Martijn van der Ven
#
Loqi
Martijn van der Ven
#
[jgmac1106]
but I think the missing closing </span> is root of most of your trouble you don't close until the very bottom and after the </div> where the span opens
#
[jgmac1106]
in the above example you can see an entire page h-card....but the question of what happens when I have two h-cards on a page comes up often...and I am not sure I can give an exact answer
#
KartikPrabhu
what is representative h-card
#
Loqi
The representative h-card for a page is an h-card on that page that represents that page, if any, as not all pages are about a person or organization, a page might not have a representative h-card https://indieweb.org/representative_h-card
[timothy_chamber joined the channel
#
[jgmac1106]
ohh I always thought it was an either or...I put u-uid OR u-url..I never put both...goes to fix many a thingas
#
[Rasul_Kireev]
Thanks all, I have a bunch of things to try updating now! Thanks for examples too!
#
[Rasul_Kireev]
[jgmac1106] why do you ask about zegnat? Do I have that somewhere on my page?I have no idea who or what that is 🙈
#
[jgmac1106]
ohh sorry, no I was asking our friendly bot to give me a url.
#
[jgmac1106]
zegnat has a single page...really gigantic h-card so I was showing examplle
#
[jgmac1106]
Who is jgmac1106?
#
Loqi
J. Gregory McVerry (Greg) is an educator trying to use the web to help engineer better teachers https://indieweb.org/User:Jgregorymcverry.com
#
[jgmac1106]
So Loqi helps us out.
#
[jgmac1106]
Who is Loqi?
#
Loqi
Loqi is a friendly and useful bot/digital therapist present in the IndieWeb discussion channels https://indieweb.org/User:Loqi.me
#
[jgmac1106]
You can also use Kaja to check your microformats
#
[jgmac1106]
!mf2 rasulkireev.com
#
[jgmac1106]
well you saw an example of that working above
#
[jgmac1106]
tonz, omz13, jeremycherfas, thx for inspiration now transferring 1,778 pdfs to my url
#
[jgmac1106]
I should have done this forever ago...though never had the space
#
Loqi
yea!
#
[Rasul_Kireev]
Wow. That's incredible. Not something I see everyday.
[KevinMarks] joined the channel
#
[jgmac1106]
Loqi does lots of cool things.
#
[jgmac1106]
What is an h-card?
#
Loqi
h-card is the microformats2 vocabulary for marking up people, organizations, and venues on web sites https://indieweb.org/h-card
#
[jgmac1106]
we can ask questions, find about people, ask about times, make memes, and feed loqi
#
[Rasul_Kireev]
Ok, I have to ask. How do you feed it?)
#
[Rasul_Kireev]
Feed Loqi
#
[Rasul_Kireev]
No, doesn't work 🙈
#
aaronpk
gives Loqi a cookie
#
Loqi
enjoys the cookie
#
[jgmac1106]
Feed Loqi All the Things
#
jackjamieson
Not Acceptable! An appropriate representation of the requested resource could not be found on this server. This error was generated by Mod_Security.
#
jackjamieson
Documenting my mod_security woes in case it's useful for anyone's future reference. On my WordPress site I started getting the following error whenever I try to log into an app using the IndieAuth plugin:
#
jackjamieson
(whoops, reverse the order of my last two messages :)
#
jackjamieson
My host is bluehost. I contacted their technical support and they asked me to identify the URL of the plugin. I gave them the authorization endpoint URL (https://jackjamieson.net/wp-json/indieauth/1.0/) and the token endpoint URL (https://jackjamieson.net/wp-json/indieauth/1.0/token)
#
jackjamieson
The support representive disabled mod_security at those URLs, but apparently it kept re-enabling itself for some reason. So it's now been escalated to their technical team who will email me
[fluffy] joined the channel
#
[fluffy]
[sknebel] oh, it’s just that like… all of the database stuff is ad-hoc, there’s no documentation, the code flow is really all over the place, etc.
#
[fluffy]
I was trying to add in a way of having a thread_id that’s separate from the page URI so that reply notifications would have a sensible link to the post, and it’s… complicated.
#
[fluffy]
also very inconsistent internally with hwo things are named or handled. It definitely has a “this is a quick hack that grew out of control” feel to it.
#
[fluffy]
For now I’ve decided to just normalize my thread IDs to be the same as my post IDs which lets Publ take care of routing users to the right page 😕
#
[fluffy]
but this means that it’ll be a pain to migrate my really old threads from before I was using Publ. Which, to be fair, was kind of lazy and short-sighted of me. But I like encouraging laziness that works. 🙂
#
[snarfed]
renem++ glad it's working now!
#
Loqi
renem has 1 karma over the last year
[tantek] joined the channel
#
aaronpk
"this is a quick hack that grew out of control" is a pretty accurate description of Loqi too
#
[fluffy]
yeah, I tend to be a lot more tolerant of that when it’s a one-off thing that’s not intended to be reusable though 🙂
#
[jgmac1106]
I now own all my pdfs from my url...will have to remove some doubles that gathered over the years...but excited to never have to worry about "it is on this machine, google drive, etc" http://jgregorymcverry.com/readings/
#
[jgmac1106]
probably should password protect page too not to get take down notices from publishers
#
[tantek]
[jgmac1106] start with immediately adding a rule to your http://jgregorymcverry.com/robots.txt that blocks all bots from that directory
#
[jgmac1106]
What is robot.txt?
#
Loqi
It looks like we don't have a page for "robot.txt" yet. Would you like to create it? (Or just say "robot.txt is ____", a sentence describing the term)
#
aaronpk
robot.txt is /robots.txt
#
[jgmac1106]
this is it actually I think: https://indieweb.org/robots
#
[jgmac1106]
that is the link to the actual robot.txt on the wiki
#
[jgmac1106]
User-agent: *
#
[jgmac1106]
Disallow: /readings/ I think that would be correct yes?
#
[jgmac1106]
thanks I may work on that page so it doesn't take as many clicks to learn, but first finish grades and do NYC page
#
[jgmac1106]
but robot.txt page is up..
#
[jgmac1106]
..then I go to metya and see tantek already had same thought
#
[tantek]
[jgmac1106] added examples to here for you that you can copy/paste and edit accordingly: https://indieweb.org/robots_txt#Examples
#
Loqi
tantek has 19 karma in this channel over the last year (113 in all channels)
#
[tantek]
agreed, I get particularly frustrated by the "keep clicking lots of times to try learn even the basics" problem that so much "documentation" has
#
[tantek]
I also get very frustrated with very long and wordy examples when really most people are looking for a 2 line (at most) example to copy paste
#
[jgmac1106]
its a tough balance as long scrolls hurt comprehension as well...why I like accordion boxes....
#
[jgmac1106]
the worst is food recipes now
#
[fluffy]
there’s… legal reasons why recipes are the way they are
#
[jgmac1106]
similar issue....I want the cook time and I have to read about your journey to some market in Almafa
#
[fluffy]
they’re not GOOD reasons, but they exist.
#
[tantek]
I would say never ascribe to legality what is explained by incompetence
#
[fluffy]
there’s legal precedent that recipes in and of themselves aren’t copyrightable, so people tack on these increasingly-complex stories as a way of adding copyrightable flavor.
#
[tantek]
[jgmac1106] right, if I was interested in some journey thing I'd go elsewhere for some Belairing
#
[fluffy]
and that led to it being A Thing People Do which then made it sort of a standard thing
#
[tantek]
the inverse of that are Tweet recipes
#
[tantek]
no journey, no nonsense
#
[tantek]
ingredients + instructions. done
[miklb] joined the channel
#
[jgmac1106]
old twitter design paradigm?
#
[miklb]
[eddie] or GWG - micropub is working again 🎉 but tried posting from Indigenous and my note was truncated at 140 characters. Any thoughts why?
#
[tantek]
do you have a post a few decent 140 character posts in order to "unlock" the ability to post more?
[eddie] joined the channel
#
[eddie]
Indigenous iOS or Android?
#
[eddie]
Hmm, shouldn't be Indigenous. I post > 140 all the time
#
[eddie]
I think it must be on the Micropub endpoint side
#
[miklb]
I tried posting something that was ~180 characters. It stripped the note after 140 on my WP site (data loss). Just curious where I would start.
#
[tantek]
that's harsh
#
[miklb]
I hadn’t been able to post micropub in some time, but after recent round of updates, I found a combination that works. what else would you call it?
#
[miklb]
for all I know it’s something I have configured wrong.
#
[miklb]
maybe in my syndication target
#
aaronpk
my recipes are pretty barebones https://aaronparecki.com/recipes
#
aaronpk
I do enjoy the series of pictures in those recipe blog posts, but the longwinded text is a little frustrating sometimes
#
[eddie]
miklb Looks like there is no logic in Indigenous for 140 characters. The title field appears after 280 characters, but since that's double 140 characters I can confirm you should definitely look at the Micropub endpoint
#
[eddie]
(I thought maybe the title field appearing what altering the data sent in some way, but doesn't seem like that's it)
#
[eddie]
Speaking of Indigenous/Remark, etc. I finally have a laptop with macOS 10.15 beta on it so I can move forward progress on that front
#
[fluffy]
yeah my recipe-recipes are pretty barebones too, http://beesbuzz.biz/food
#
[fluffy]
but not all of those are recipes, some of those are food blogging
[grantcodes] joined the channel
#
[grantcodes]
One of my next projects is going to be a sort of micro.blog equivalent for recipes 🙂
#
[fluffy]
Oh, I came across one of those not too long ago, because a friend of a friend had posted a variation on one of my recipes there
#
[fluffy]
let me see if I can find it again
#
[fluffy]
pepperplate.com
#
[grantcodes]
Oh that's cool. Looks like it's more for storing than publishing though?
#
[fluffy]
yeah probably
#
[miklb]
[eddie] looks like its definitely in the syndicate side of the WP micropub ecosystem. Not sure why it wants to strip all incoming content over 140 though. I would get only syndicating only 140 without a link.
#
[grantcodes]
Not tried it but this looks quite interesting for recipes too: https://www.copymethat.com/ (random name though)
#
[eddie]
miklb Yeah, how strange
#
[fluffy]
Wow that is an awkward name.
#
aaronpk
i use Paprika, and have been very tempted to make some sort of micropub export for it
#
[fluffy]
anyway I just use Publ to publish everything and it’s fine for recipes 🙂
#
[grantcodes]
I'm hoping to be so something with git so you can copy other peoples recipes and then change them and show the differences too
#
[miklb]
I have 2 different syndication targets for Twitter and tested them both. Same behavior.
#
[grantcodes]
Also for my recipes site I am planning on severely limiting the intro and outro junk that noone cares about (mentioned above) - maybe some language recognition to see if they mention "my family really loves xyz"
#
[grantcodes]
[miklb] It was only the twitter copy that was truncated or the content in wp as well?
#
[miklb]
the note was > 140 characters (I’ve since updated the post on WP) but micropub created the WP note with 140.
#
[miklb]
I’ll look at the logs shortly.
#
[miklb]
The note was 264 character. I wrote it in Drafts first so I wouldn’t loose it, posted with Indigenous. Micropub created a 140 character note. I’m afraid I wasn’t being clear.
#
[grantcodes]
Ah got it now :thumbsup: and yeah that's not great
#
[miklb]
s/Micropub/wordpress-micropub/
#
[miklb]
I’ve been having pretty good luck just saving my WP admin to my home screen and posting natively. The admin on mobile is just fine now for it.
[pfefferle] joined the channel
#
[snarfed]
agreed, wordpress's native mobile apps and mobile web admin UI are all decent
#
[miklb]
mobile app is good if you don’t have custom fields in the posting UI. That’s been the biggest complaint I’ve seen. Which affects the post kinds plugin.
[tonz], jackjamieson and [KevinMarks] joined the channel
#
[KevinMarks]
Aaron has good h-recipe markup https://aaronparecki.com/recipes
[manton], [timothy_chamber and [fluffy] joined the channel
#
[fluffy]
Hmm, unmung doesn’t do a great job of formatting my site. Am I doing something wrong in my mf2? (probably!) http://www.unmung.com/indiecard?url=http%3A%2F%2Fbeesbuzz.biz%2Fblog%2F
#
[fluffy]
Notably it’s losing all of the paragraphs and formatting in the text.
#
[fluffy]
Ah, it’s because it’s just getting p-summary, I don’t have e-content on my index page.
#
[fluffy]
Which is purposeful because I didn’t want to deal with the below-the-cut thing just yet.
#
[fluffy]
and technically I should be using e-summary, not p-summary, I guess
#
Zegnat
If you want parsers to keep the HTML used within the summary: yes :)
loicm, eli_oat1 and KartikPrabhu joined the channel
#
[KevinMarks]
I'll have a look, I shouldn't throw up on the user like that
#
sknebel
What is mod_security?
#
Loqi
mod_security is a web application firewall for the Apache web server https://indieweb.org/mod_security
#
sknebel
jackjamieson: ^^^ can you add a note to that if bluehost isn't mentioned already?
#
jackjamieson
sknebel - good call. Will do so now
#
[KevinMarks]
talking of hacks that went too far, there is way too much conditional code in the jinja templates in unmung
#
[KevinMarks]
it's doing an OK job on your recipes too, though not h-recipe parsing
[pfefferle] joined the channel
#
[fluffy]
Yeah, my recipes don’t do h-recipe
#
[fluffy]
also unmung isn’t properly absolutifying the image URLs.
#
[fluffy]
oh also each card is just linking to the blog page rather than the individual entry.
#
[fluffy]
which I assume is also my fault somehow
#
[fluffy]
although my entries seem to be setting u-url correctly?
gRegorLove joined the channel
#
[KevinMarks]
I may be using the author url - I'll check later
#
jamietanna[m]
Spent the evening restructuring how my site stores data so its easier to work with micropub - not yet there but will be awesome when it's done so I can then carry on with writing the micropub endpoint
jjuran, gRegorLove_, KartikPrabhu, [fluffy]1 and [KevinMarks]1 joined the channel