#microformats 2017-04-30

2017-04-30 UTC
[eddie], davidmead and [kevinmarks] joined the channel
#
ben_thatmustbeme
woah, this one is weird
#
ben_thatmustbeme
automatically to <object class="h-card" data="jane.jpeg"/></object>Jane Doe
#
ben_thatmustbeme
apparently nokogiri will change <object class="h-card" data="jane.jpeg"/>Jane Doe</object>
#
ben_thatmustbeme
just realized the issue on that one
#
ben_thatmustbeme
and yes thats from a test
#
ben_thatmustbeme
two close tags, #broken
#
sknebel
is that an error in the test or a test that tests handling of broken html?
#
ben_thatmustbeme
well the output json from the test pretty much says its an error in the test
#
ben_thatmustbeme
man there are some tough bits in the test suite though
#
ben_thatmustbeme
!tell tantek should 'replacing any nested <img> elements with their alt attribute, if present; otherwise their src attribute, if present, adding a space at the beginning and end, resolving any relative URLs, and removing all leading/trailing whitespace. ' include video and audio tags as well ?
#
Loqi
Ok, I'll tell them that when I see them next
#
ben_thatmustbeme
!tell tantek that also doesn't seem to be specified when parsing for implied 'name' values etc, and is different for p- vs u- etc
#
Loqi
Ok, I'll tell them that when I see them next
#
ben_thatmustbeme
oh, e- value doesn't say to remove <script> / <style> tags either
#
gRegorLove
This appears to be a parser bug in mf2py as well: https://github.com/indieweb/php-mf2/issues/114
#
Loqi
[gRegorLove] #114 Fix u- parsing for <video> with poster attribute
#
ben_thatmustbeme
thats slightly different
#
ben_thatmustbeme
thats when parsing u-*
#
Loqi
[Tantek Çelik] microformats2 parsing specification
#
ben_thatmustbeme
almost wonder if it should just be any element with an alt tag should be replaced by its alt tag
#
gRegorLove
Sorry, separate from the earlier conversation.
#
gRegorLove
Just noting it here for mf2py people
#
gRegorLove
But on script and style, looks like it was intentional to leave it in e- parsed results, since that's HTML. http://microformats.org/wiki/microformats2-parsing-issues#exclude_style_elements_before_parsing
#
Loqi
microformats2-parsing-issues
#
ben_thatmustbeme
gRegorLove: it looks like that resolution wasn't applied to e- 's value: like it was supposed to be
#
ben_thatmustbeme
resolution makes sense, not reflected in spec page though
#
ben_thatmustbeme
okay, need sleep
#
gRegorLove
It's ambiguous at the top of that link, but it's proposal 2 that was accepted, to the best of my recollection. It was at IWS last year.
tantek joined the channel
#
gRegorLove
Added that <video> issue to mf2py as well, just linked to the php-mf2 issue: https://github.com/tommorris/mf2py/issues/76
#
Loqi
[gRegorLove] #76 Fix u- parsing for <video> with poster attribute
#
ben_thatmustbeme
Yeah, the other parts got fixed in the spec, e- html should include it, but it says value should leave them out too, the parsing spec didn't say that
[chrisaldrich], tantek, barpthewire and nitot_ joined the channel
#
schmarty
^^ more link spam :/
davidmead joined the channel
KartikPrabhu, [colinwalker] and tantek joined the channel
#
ben_thatmustbeme
don't know if you saw the logs from here or now
#
tantek
behind a bit on logs
#
Loqi
tantek: ben_thatmustbeme left you a message 14 hours, 58 minutes ago: should 'replacing any nested <img> elements with their alt attribute, if present; otherwise their src attribute, if present, adding a space at the beginning and end, resolving any relative URLs, and removing all leading/trailing whitespace. ' include video and audio tags as well ?
#
Loqi
tantek: ben_thatmustbeme left you a message 14 hours, 56 minutes ago: that also doesn't seem to be specified when parsing for implied 'name' values etc, and is different for p- vs u- etc
#
tantek
wow that's a good find
#
ben_thatmustbeme
also, another one case up last night too
#
tantek
I think in short it was only for img because that's where the use-cases were
#
tantek
rather than trying to generalize abstractly for other tags without actual use-cases
#
Loqi
microformats2-parsing-issues
#
tantek
oh? hmm - I thought I finished all the edits for that one
#
ben_thatmustbeme
fair point, i haven't seen it directly in the wild, but i haven't been looking
#
ben_thatmustbeme
e- value: doesn't mention anything about removing script and style tags
#
tantek
does it need to? I though it was removed earlier
#
ben_thatmustbeme
yes, it needs it, the HTML should have it
#
tantek
ben_thatmustbeme: ok then I'm confused. what in the spec disagrees with the resolution?
#
ben_thatmustbeme
: Drop both <script> and <style> elements completely when parsing any property (except for e-* HTML values, which preserve all markup).
#
ben_thatmustbeme
was the resolution
#
tantek
ah yes that makes sense, I remember that discussion last year
[eddie] joined the channel
#
ben_thatmustbeme
its dropped from dt-* u-*, and p-* but not dropped from e-* html OR e-* value
#
ben_thatmustbeme
its actually been a fun challenge writing this parser
#
ben_thatmustbeme
i think all the mf2 ones i'm failing now are datetime related
#
tantek
is this a new parser? or updating phpmf2?
#
aaronpk
new ruby parser!
#
aaronpk
ben_thatmustbeme++
#
Loqi
ben_thatmustbeme has 5 karma in this channel (206 overall)
#
ben_thatmustbeme
rewriting it from scratch pretty much
tantek, gRegorLove, uf-wiki-visitor, davidmead, [kevinmarks] and [eddie] joined the channel