#microformats 2017-04-30
2017-04-30 UTC
[eddie], davidmead and [kevinmarks] joined the channel
#
ben_thatmustbeme woah, this one is weird

#
ben_thatmustbeme automatically to <object class="h-card" data="jane.jpeg"/></object>Jane Doe

#
ben_thatmustbeme apparently nokogiri will change <object class="h-card" data="jane.jpeg"/>Jane Doe</object>

#
ben_thatmustbeme ohh, hah

#
ben_thatmustbeme just realized the issue on that one

#
ben_thatmustbeme and yes thats from a test

#
ben_thatmustbeme two close tags, #broken

#
ben_thatmustbeme well the output json from the test pretty much says its an error in the test

#
ben_thatmustbeme man there are some tough bits in the test suite though

#
ben_thatmustbeme !tell tantek should 'replacing any nested <img> elements with their alt attribute, if present; otherwise their src attribute, if present, adding a space at the beginning and end, resolving any relative URLs, and removing all leading/trailing whitespace. ' include video and audio tags as well ?

#
ben_thatmustbeme !tell tantek that also doesn't seem to be specified when parsing for implied 'name' values etc, and is different for p- vs u- etc

#
ben_thatmustbeme oh, e- value doesn't say to remove <script> / <style> tags either

#
gRegorLove This appears to be a parser bug in mf2py as well: https://github.com/indieweb/php-mf2/issues/114

#
ben_thatmustbeme thats slightly different

#
ben_thatmustbeme thats when parsing u-*

#
ben_thatmustbeme when parsing p-* http://microformats.org/wiki/microformats2-parsing#parsing_a_p-_property last line here

#
ben_thatmustbeme almost wonder if it should just be any element with an alt tag should be replaced by its alt tag

#
gRegorLove ?

#
gRegorLove Sorry, separate from the earlier conversation.

#
gRegorLove Just noting it here for mf2py people

#
gRegorLove But on script and style, looks like it was intentional to leave it in e- parsed results, since that's HTML. http://microformats.org/wiki/microformats2-parsing-issues#exclude_style_elements_before_parsing

#
ben_thatmustbeme gRegorLove: it looks like that resolution wasn't applied to e- 's value: like it was supposed to be

#
ben_thatmustbeme resolution makes sense, not reflected in spec page though

#
ben_thatmustbeme okay, need sleep

#
gRegorLove It's ambiguous at the top of that link, but it's proposal 2 that was accepted, to the best of my recollection. It was at IWS last year.

tantek joined the channel
#
gRegorLove Added that <video> issue to mf2py as well, just linked to the php-mf2 issue: https://github.com/tommorris/mf2py/issues/76

#
ben_thatmustbeme Yeah, the other parts got fixed in the spec, e- html should include it, but it says value should leave them out too, the parsing spec didn't say that

[chrisaldrich], tantek, barpthewire and nitot_ joined the channel
davidmead joined the channel
#
benthatmustbeme edited /citation-irc-notes-2006-04-09 (-50) "Undo revision 66281 by [[Special:Contributions/Cvkitssupplise|Cvkitssupplise]] ([[User talk:Cvkitssupplise|Talk]]) link spam" (view diff)
KartikPrabhu, [colinwalker] and tantek joined the channel
#
ben_thatmustbeme hey tantek

#
ben_thatmustbeme don't know if you saw the logs from here or now

#
ben_thatmustbeme not*

#
Loqi tantek: ben_thatmustbeme left you a message 14 hours, 58 minutes ago: should 'replacing any nested <img> elements with their alt attribute, if present; otherwise their src attribute, if present, adding a space at the beginning and end, resolving any relative URLs, and removing all leading/trailing whitespace. ' include video and audio tags as well ?

#
ben_thatmustbeme also, another one case up last night too

#
ben_thatmustbeme based on this http://microformats.org/wiki/microformats2-parsing-issues#exclude_style_elements_before_parsing i think there is some text missing for e- value:

#
ben_thatmustbeme fair point, i haven't seen it directly in the wild, but i haven't been looking

#
ben_thatmustbeme e- value: doesn't mention anything about removing script and style tags

#
ben_thatmustbeme yes, it needs it, the HTML should have it

#
ben_thatmustbeme : Drop both <script> and <style> elements completely when parsing any property (except for e-* HTML values, which preserve all markup).

#
ben_thatmustbeme was the resolution

[eddie] joined the channel
#
ben_thatmustbeme its dropped from dt-* u-*, and p-* but not dropped from e-* html OR e-* value

#
ben_thatmustbeme its actually been a fun challenge writing this parser

#
ben_thatmustbeme i think all the mf2 ones i'm failing now are datetime related

#
ben_thatmustbeme rewriting it from scratch pretty much

tantek, gRegorLove, uf-wiki-visitor, davidmead, [kevinmarks] and [eddie] joined the channel