#microformats 2016-02-29

2016-02-29 UTC
fuzzyhorns joined the channel
# 01:01 
tantek edited /microformats2-parsing-issues (+342) "/* exclude style elements before parsing */ drop both style and script when parsing" (view diff)
# 01:02 
tantek greetings mf2 parser developers, I've made a proposal to resolve the style (and script) tag issue, please review and provide feedback: http://microformats.org/wiki/microformats2-parsing-issues#exclude_style_elements_before_parsing
# 01:02 
tantek tommorris, kylewm, aaronpk, KevinMarks, et al
# 01:06 
aaronpk http://microformats.org/wiki/index.php?title=microformats2-parsing&curid=8366&diff=65418&oldid=65356 also solves the camelCase issue on that page, right?
# 01:08 
aaronpk edited /microformats2-parsing-issues (+152) "/* exclude style elements before parsing */ +1" (view diff)
fuzzyhorns joined the channel
# 01:17 
tantek ah, yes it does!
# 01:17 
aaronpk I know I said I wasn't going to write actual code for the php mf2 parser, but I just broke that rule
# 01:17 
aaronpk but I have it removing script tags now, I think.
# 01:18 
aaronpk oh darn nevermind, I need to make it search recursively :(
# 01:26 
aaronpk ok yeah this is hard
# 01:39 
aaronpk I think I've got it
# 01:42 
aaronpk woo hoo all test are passing https://travis-ci.org/indieweb/php-mf2/builds/112496040
# 01:43 
aaronpk https://github.com/indieweb/php-mf2/pull/83
fuzzyhorns joined the channel
# 02:25 
@ProjectPeachUK We've #played with #microformats. Love the #idea of #marking up our #business data to #machines as well as #humans! #biznoticeUK #cpp (twitter.com/_/status/704130279326289920)
davidmead, fuzzyhorns, Zegnat, netweb, dogada, tantek, adactio and Garbee joined the channel
# 13:22 
@bittopper Microformats: Empowering Your Markup for Web 2.0 by John Allsopp - https://www.bittopper.com/item/82995851decb8bce65f154869d379ba4a3d79/ (twitter.com/_/status/704295722489815042)
fuzzyhorns, nitot, tantek, TallTed, misa_ and JohnBeales joined the channel
# 16:39 
tantek edited /microformats2-parsing-brainstorming (+1177) "/* Parse language information */ additionally id, and consider html-lang parsed property name" (view diff)
# 16:39 
misa__ hello channel, just found this line on the microformats wiki: microdata was explicitly dropped by the W3C (and therefore not part of W3C HTML5) due to a lack of interest by anyone to edit the spec and keep it up to date.
# 16:39 
kylewm edited /microformats2-parsing-issues (+21) "/* exclude style elements before parsing */ +1" (view diff)
# 16:40 
tantek misa__: welcome, and yes that's a summary result from the W3C HTML Working Group discussions on microdata
# 16:43 
misa__ I understand that, and just a note here, I am quite unexperienced regarding these concepts so I would appreciate some guidance from more experienced folks, just this seems a bit outdated as microdata blog seems to be quite a busy place...
# 16:44 
tantek misa__: it's up to date. there has been no further work on microdata at W3C so it's still just as dead/dropped from a W3C standards perspective
# 16:45 
tantek people work on all kinds of things outside W3C
# 16:45 
tantek which is fine too, it just means those things are not W3C standards
# 16:46 
tantek misa__: if you're interested in the latest work on microdata-like added markup, take a look at microformats2: http://microformats.org/wiki/microformats2
# 16:46 
@kykyorg Ð’Ð´Ð¾Ð³Ð¾Ð½ÐºÑƒ Ð¸ Ð²Ð¸Ð´ÐµÐ¾ Ð´Ð½Ñ: Ð›ÐµÐ¾ Ñ ÐžÑÐºÐ°Ñ€Ð¾Ð¼ Ð² Ñ€ÑƒÐºÐ°Ñ… Ñ€Ð°ÑÑÐºÐ°Ð·Ñ‹Ð²Ð°ÐµÑ‚ ÐºÐ¸Ð½Ð¾ÑÐ»Ð¸Ñ‚Ðµ Ð¿Ñ€Ð¾ Ð³Ð»Ð¾Ð±Ð°Ð»ÑŒÐ½Ð¾Ðµ Ð¿Ð¾Ñ‚ÐµÐ¿Ð»ÐµÐ½Ð¸Ðµ: http://kyky.org/microformats/2016-02-29/video (twitter.com/_/status/704346910224678913)
# 16:46 
tantek microformats2 is an even simpler replacement for microdata
# 16:46 
@kykyorg Ð˜ Ð²Ð¸Ð´ÐµÐ¾ Ð´Ð½Ñ: Ð›ÐµÐ¾ Ñ ÐžÑÐºÐ°Ñ€Ð¾Ð¼ Ð² Ñ€ÑƒÐºÐ°Ñ… Ñ€Ð°ÑÑÐºÐ°Ð·Ñ‹Ð²Ð°ÐµÑ‚ ÐºÐ¸Ð½Ð¾ÑÐ»Ð¸Ñ‚Ðµ Ð¿Ñ€Ð¾ Ð³Ð»Ð¾Ð±Ð°Ð»ÑŒÐ½Ð¾Ðµ Ð¿Ð¾Ñ‚ÐµÐ¿Ð»ÐµÐ½Ð¸Ðµ: http://kyky.org/microformats/2016-02-29/video (twitter.com/_/status/704347134519349248)
# 16:47 
misa__ yes I can see that, just deciding with what to get going is somehow overwhelming, then there is schema.org... could you care to share your thoughts in a brief on that?
# 16:48 
tantek misa__: schema.org is a Google run effort, with some contributions from Microsoft and Yandex. It's not an open standard.
# 16:49 
tantek I sympathize with the sense of being overwhelmed
# 16:49 
tantek hence why a lot of us have worked on simplifying things with microformats2
KartikPrabhu joined the channel
# 16:50 
misa__ so as far as indieweb is concerned this is not something i should care about?
# 16:51 
misa__ (google's shema.org)
# 16:54 
tantek correct, it's been pretty much completely ignored
# 16:54 
tantek there has been some use of OGP / Twitter Cards as a fallback for some use-cases, but schema is largely ignored for being overly complex and unnecessary
# 16:55 
tantek welcome KartikPrabhu !
# 16:56 
KartikPrabhu yo!
# 16:56 
KartikPrabhu was just checking logs
# 16:57 
KartikPrabhu misa__: you should decide what to use depending on what you'd like to use if for
# 16:57 
KartikPrabhu Schema.org has not been very useful for indieweb things mainly due to its complexity of parsing and publishing
# 17:05 
misa__ indieweb seems very right for me, I can comprehend most of its concepts, just this whole thing with microdata/microformats and the rest got me overwhelmed...
# 17:06 
tantek edited /microformats2-parsing-brainstorming (+141) "/* Parse language information */ first instance of id attributes only as a way to de-dup / uniqueify id attrs at parse time" (view diff)
# 17:06 
tantek misa__: microdata is ignorable for indieweb. no one is actively using it.
# 17:06 
KartikPrabhu misa__: I would suggest starting with microformats as they are the simplest ones to publish
# 17:07 
tantek there may be a few folks publishing a few microdata things experimentally, but it's never gotten any traction in the peer to peer independent web
# 17:07 
tantek misa__: glad to hear indieweb seems very right for you! come on by to #indiewebcamp and say hello to discuss indieweb concepts :)
# 17:07 
KartikPrabhu schema.org is so horribly over-thought that it is ignorable too
gRegorLove joined the channel
# 17:10 
tantek welcome gRegorLove !
# 17:18 
gRegorLove Howdy
# 17:20 
ben_thatmustbeme aaronpk: moving that over here
# 17:20 
aaronpk example URL: http://xray.p3k.io/parse?url=http%3A%2F%2F2015.aaronparecki.com%2Freplies%2F2015%2F10%2F07%2F1%2Famp
# 17:20 
KartikPrabhu amp;dr ;)
# 17:20 
aaronpk so notice how I made the value of in-reply-to just the URL, even though it's actually an h-cite on my site
# 17:21 
aaronpk and then moved the actual h-cite content outside the main entry
# 17:23 
aaronpk my goal with XRay (and jf2) is to reduce the number of exceptions you have to deal with when consuming a page
# 17:23 
ben_thatmustbeme hmmm, interesting
tantek joined the channel
# 17:24 
ben_thatmustbeme so if you had just an embedded object, would it also be just in "refs"
# 17:24 
aaronpk so rather than if(is object) {...} else if(is url) { ... } it's just always a URL, and if you want you can check if there's extra data about the URL
# 17:24 
aaronpk not sure about that case yet
# 17:24 
ben_thatmustbeme or is refs specific to thinks like in-reply-to like-of, etc
# 17:24 
aaronpk i'm taking the opposite approach we originally took with jf2
# 17:24 
aaronpk i'm explicitly adding things to the output when there's a reason to, rather than trying to map a complete mf2 document to this output
# 17:25 
tantek aaronpk, that's how mf2 JSON was built
# 17:25 
tantek "explicitly adding things to the output when there's a reason to"
# 17:25 
tantek hence the evolution of how the parsed rel values made it in there
# 17:26 
aaronpk this is one level above that though
# 17:26 
tantek so it will be interesting to see if you come to similar/different conclusions
# 17:26 
aaronpk at the author level
# 17:26 
tantek so was mf2 JSON
# 17:26 
aaronpk e.g. someone can put an h-card anywhere on the page, and it will end up who knows where in the mf2 JSON
# 17:26 
tantek started at the HTML author level
# 17:26 
ben_thatmustbeme i like the idea of pulling it all out, almost like the refs: section could be completely ignored since you have to fetch the content anyway
# 17:26 
aaronpk i'm only interested in that h-card if it has explicit meaning that I can consume
# 17:27 
tantek "put an h-card anywhere on the page" - then the h-card likely has different meanings, so it makes sense for it to show up different places in the mf2 JSON
# 17:27 
ben_thatmustbeme aaronpk: it looks like you are doing [] specifically for some values but not for others
# 17:27 
aaronpk actually tantek.com is a great example. There's an h-card as the last child object of the top-level Tantek h-card
# 17:27 
tantek aaronpk - that's not author-centric (as you claimed originally), that's *consuming* centric ("i'm only interested ... if it has explicit meaning that I can consume")
# 17:27 
aaronpk I have no idea what that means, so it's not going to show up in the XRay output
# 17:28 
aaronpk I didn't say author centric, I said author-level
# 17:28 
tantek but you're not doing author-level either, you're doing consuming-level
# 17:28 
aaronpk what was your intention of marking up that Rebecca Daniels h-card?
# 17:28 
tantek it's a reference to a person
# 17:28 
aaronpk (as a child h-card of your top-level one)
# 17:29 
tantek as a publisher, it makes sense to markup all your content that's meaningful with established microformats
# 17:29 
ben_thatmustbeme can we get back on to the point we were discussing?
# 17:29 
aaronpk there's no other references to it though, so it has no context. For example if there was some other object on the page with a u-url of rebeccadanielsphoto.com then I might know what it's for
# 17:30 
tantek right, no other context, and that's ok
# 17:30 
aaronpk and at that point it would show up in the "refs" list
# 17:30 
tantek all you know is, this is a person that is referenced on this page
# 17:30 
tantek that's it
fuzzyhorns joined the channel
# 17:30 
tantek so e.g. if you have a tool that shows you a list of people on a page, you can display them
# 17:30 
tantek (there are such tools like Operator FF add-on)
# 17:31 
tantek and that's useful because you can do things like keep a history of where people were mentioned
# 17:31 
tantek (e.g. in the browser)
# 17:31 
tantek histories of people mentioned are useful for things like search, auto-complete etc.
# 17:31 
ben_thatmustbeme *sigh*
# 17:31 
tantek plenty of applications for even minimal context like that
# 17:31 
tantek just maybe not your specific application today
# 17:32 
ben_thatmustbeme perhaps JF2 is evolving to a more specific use case of social rather than just general microformats
# 17:32 
tantek point is, if it doesn't make sense to your application, you can just ignore it
# 17:32 
ben_thatmustbeme microformats has a JSON representation already
# 17:32 
aaronpk ben_thatmustbeme: yeah that's kind of what I'm thinking
# 17:32 
tantek ben_thatmustbeme: that's how it starts, but as you add more use-cases, you'll likely end up making something very similar
# 17:32 
aaronpk the problem is when a property can be either an array or a string, then both cases end up needing to code exceptions for
# 17:33 
tantek e.g. every use-case I listed above for random h-cards on a page *IS* social
# 17:33 
aaronpk in mf2 json everything is an array, so most of the time you're doing [0] to get the first. but when jf2 makes a property a string if there's only one value, then you have to do a bunch of checks
# 17:34 
tantek aaronpk: precisely why that design decision was made for mf2 json
# 17:34 
ben_thatmustbeme what if we just make this one rule aaronpk, as soon as you hit somthing that has a specific URL outside the domain context (non-authoritative content) we move it over to refs.  Basically we could do that as the only change to the MF2 JSON and see what we get
# 17:34 
tantek so consuming code wouldn't have to do "bunch of checks" (or at least fewer)
# 17:34 
aaronpk i'm not saying that's bad, just what it is
# 17:34 
tantek right
# 17:34 
tantek which I'm happy to see the alternative being explored
# 17:34 
aaronpk so with the XRay output, I made it vocabulary-aware, so that it's easier to consume when you know what your'e consuming
# 17:35 
aaronpk e.g. "this is an h-entry. if there is a published date, it will always be a string. if it's a reply, you can find all the URLs it's in reply to in the array 'in-reply-to'"
# 17:35 
ben_thatmustbeme so, one of the biggest complaints i keep hearing is the need to check if something is an array, single item, or object
# 17:36 
aaronpk also the value of "in-reply-to" will never be an object with this, since if it was an object in the mf2 JSON, that object gets moved down to refs and the URL of the object is the value in the array
# 17:36 
ben_thatmustbeme shouldn't author: be a single array item then? couldn't you have multi-author posts?
# 17:36 
tantek where do you hear these complaints ben_thatmustbeme ?
# 17:36 
aaronpk some of them are from me
# 17:36 
tantek hah
# 17:37 
ben_thatmustbeme lol
# 17:37 
aaronpk but i've heard that from others as well
# 17:37 
ben_thatmustbeme i have heard others, i do not have citations right now, will try to keep them noted down from now on
# 17:37 
ben_thatmustbeme and i do rather agree with them, it is sort of annoying
# 17:38 
aaronpk it's very annoying. annoying enough that i'm encapsulating all this logic into XRay so I don't have to do it again
# 17:38 
aaronpk I need this for: readers, showing reply context, showing comments/reactions
# 17:39 
ben_thatmustbeme again, you are assuming only one author ever?
# 17:39 
aaronpk well so far i haven't seen any multi-author posts
# 17:39 
aaronpk and even if there was one, 99.9% of all posts i encounter are single author
# 17:39 
ben_thatmustbeme notices you aren't processing comments either
# 17:39 
ben_thatmustbeme is that just haven't gotten there yet?
# 17:39 
aaronpk no not yet. like I said I am only adding things when I want to consume them
# 17:40 
ben_thatmustbeme comments could likely just get reduced to a list of urls too
# 17:40 
ben_thatmustbeme unless they comment directly on the site
# 17:40 
ben_thatmustbeme i know some allow that
# 17:40 
aaronpk most likely I'm going to make the "comment" property a list of URLs, and the actual comment objects will live in the "refs" below
# 17:41 
ben_thatmustbeme exactly
# 17:41 
ben_thatmustbeme just for comments that don't have a URL, what do you do?
# 17:41 
aaronpk if there's no URL for a comment (including no fragment URL) then I'm just going to drop it, since nothing will be able to do anything with that comment anyway
# 17:41 
ben_thatmustbeme i don't think thats true, salmention would still work with a comment that doesn't have a url
# 17:41 
aaronpk stick a fragment URL on the inline comment and then it's useful again
# 17:42 
aaronpk ben_thatmustbeme: in practice, any consuming code trying to handle something that doesn't have a URL isn't going to end up with good results
# 17:42 
ben_thatmustbeme good point
# 17:42 
ben_thatmustbeme i feel like i'm responding just a moment too early to you
# 17:42 
ben_thatmustbeme :P
# 17:43 
aaronpk combine that with tantek's earlier suggestion of XRay returning the object inside the fragment identifier and then fragment comments act just like comments with their own URL
# 17:43 
aaronpk https://github.com/aaronpk/XRay/issues/8
# 17:43 
ben_thatmustbeme indeed, i'd love for the mf2 parser to be able to do that directly actually
# 17:44 
aaronpk i guess that's a totally fine job for the mf2 parser
# 17:44 
tantek yeah!
# 17:47 
ben_thatmustbeme still not totally sold on all the items that have been dropped, (location, shortlink, name)
# 17:48 
aaronpk name hasn't been dropped
# 17:48 
aaronpk but it's only there when it's actually a name
# 17:49 
ben_thatmustbeme ahhh
# 17:49 
aaronpk e.g. it gets removed if it is the same (or a subset) of the content
# 17:50 
ben_thatmustbeme or a subset? that seems... wrong.  As most people will reference the title of a post in the content
# 17:50 
aaronpk er, prefix
# 17:50 
ben_thatmustbeme okay
# 17:50 
aaronpk it's what's described on comments-presentation
# 17:51 
aaronpk now all of a sudden "name" is useful again
# 17:52 
aaronpk i'll add location soon
# 17:52 
aaronpk basically every property on http://microformats.org/wiki/h-entry should show up if present
# 17:53 
ben_thatmustbeme all seems to make sense, looks like uid and logo aren't carried over, but those aren't really needed / prefer url over uid and logo is just photo again
# 17:53 
aaronpk there is no "logo" on http://microformats.org/wiki/h-entry anyway
# 17:54 
ben_thatmustbeme indeed, so there is never any x- prefixes or anything parsed
# 17:54 
aaronpk yeah
# 17:55 
ben_thatmustbeme trying to think of a good argument for shortlink, i feel like it is needed as an authoriative alternate url
# 17:55 
aaronpk btw i'm not sure this is actually the best step for jf2, which is why i've just been building this as an API, but this is how I want to consume pages
# 17:55 
ben_thatmustbeme which is different from other redirects
# 17:55 
ben_thatmustbeme i know i made a bunch of optimizations with that
# 17:55 
ben_thatmustbeme some no, some yes
# 17:55 
ben_thatmustbeme i actually really like the refs: idea
# 17:56 
ben_thatmustbeme anything non-authoritative becomes SUPER easy to just throw away if you don't want to look at it
# 17:56 
aaronpk yeah, it's more like you have two options to find out about a URL that's in the in-reply-to or whatever
# 17:56 
aaronpk you can check the refs property, or go fetch the URL yourself
# 17:58 
ben_thatmustbeme indeed
# 17:59 
ben_thatmustbeme i may look at just applying that to a straight mf2 json output to see what it looks like
# 17:59 
ben_thatmustbeme keeping the "always an array" idea, and just cleaning up all that stuff
# 17:59 
aaronpk interesting idea
# 18:00 
aaronpk i think the rule would be if the object has a url property, replace the object with that URL and move the object to the refs array
# 18:02 
aaronpk btw gRegorLove do you have a sec to review my PR to the php parser? https://github.com/indieweb/php-mf2/pull/83
# 18:04 
gRegorLove Sure
# 18:08 
gRegorLove Looks good at a glance, without testing. I think the innerText method should remove the script and style, but I'm not aware of any problems explicitly removing them first, either.
# 18:10 
aaronpk i think innerText does
# 18:11 
aaronpk that's what's used for the "value" property
# 18:11 
gRegorLove Oh, you're stripping it from the 'html' value?
# 18:11 
aaronpk but the html property is built up with the calls to $node->C14N() which does not remove them
# 18:12 
aaronpk yes http://microformats.org/wiki/microformats2-parsing-issues#exclude_style_elements_before_parsing
# 18:12 
gRegorLove Ok
# 18:13 
gRegorLove Heh. Forgot my own bug report :)
# 18:21 
aaronpk oh ha
Calli, fuzzyhorns, Left_Turn, KartikPrabhu and Garbee joined the channel
# 19:48 
tantek edited /Template:MicroFormatCopyrightStatement (+53) "or already have submitted, updating purely for temporal prose accuracy" (view diff)
# 19:51 
tantek edited /rel-tag (+85) "note rel-tag incorporated into HTML5" (view diff)
tantek, fuzzyhorns, uf-wiki-visitor and mkaply joined the channel
# 21:21 
mkaply Has anyone else used microformats-shiv? I'm not seeing the results I expect and I'm trying to figure out what I'm doing wrong.
# 21:22 
mkaply I'm using it against this page.
# 21:23 
mkaply http://microformats.org/wiki/hcard-examples
# 21:23 
mkaply Getting no h-cards
# 21:23 
mkaply https://pastebin.mozilla.org/8861689
# 21:25 
aaronpk you might need to try with actual HTML, not just wiki text
# 21:25 
aaronpk also that page looks like it only contains microformats1 objects, not sure if the microformats-shiv library parses those or only microformats2
# 21:28 
mkaply It's supposed to parse 1. It's been so long since I've touched this stuff. I guess it's time to bring Operator back from the dead.
# 21:31 
aaronpk well if you're literally parsing the wiki URL, you probably won't find anything there, since it's all escaped HTML
# 21:32 
mkaply aaronpk: It's parsing the DOM directly, so it should be finding the nodes. Must be something else going on
# 21:33 
aaronpk that's what i'm saying though, the HTML on that page is stuff like class=&quot;vcard&quot;&gt;
# 21:33 
aaronpk it doesn't actually have any microformats in it
fuzzyhorns joined the channel
# 21:44 
mkaply I tried tantek's page too. same result. i must be doing something wrong. I'll keep looking
KartikPrabhu, MeanderingCode, fuzzyhorns and tantek joined the channel
# 22:47 
tantek mkaply: not sure. maybe ping mixedpuppy? I know he had tests working.
# 22:52 
mkaply tantek; i figured it out. I was passing a string as filters. I opened a bug against shiv to handle that.
# 22:54 
tantek were you able to get it to work without a filter?
fuzzyhorns and Chordachi joined the channel
# 23:00 
mkaply Yes. It's working now. Oddly if I add the filters: ["h-card"], it hangs the browser. But if I don't specify a filter, I get the h-card
# 23:02 
tantek that *is* odd. another bug?
# 23:07 
mkaply I'm debugging now. It's strange because it does work in our tests.
fuzzyhorns and KartikPrabhu joined the channel