#microformats 2018-02-18
2018-02-18 UTC
tantek, [cleverdevil] and KartikPrabhu joined the channel
# KartikPrabhu updated mf2py for the link[title] and video[poster] parsing for p-* and u-*
tantek joined the channel
# KartikPrabhu now we need to figure out how to incorporate these changes into the actual mf2py code base
# KartikPrabhu I did push the new p-* and u-* rules
# KartikPrabhu for the implied name parsing I am waiting to see if it makes into the spec
# KartikPrabhu yes
[mrkrndvs] and tantek joined the channel
# KartikPrabhu !tell [kevinmarks]: would be awesome if you could test https://github.com/kartikprabhu/mf2py/tree/implied-name-fix on the examples in https://github.com/microformats/microformats2-parsing/issues/6#issuecomment-366473390 for some reason my local install works correctly but not the installation on my server!
# KartikPrabhu !tell [kevinmarks]: would be awesome if you could test https://github.com/kartikprabhu/mf2py/tree/implied-name-fix on the examples in https://github.com/microformats/microformats2-parsing/issues/6#issuecomment-366473390 for some reason my local install works correctly but not the installation on my server!
# KartikPrabhu and anyone else too ^
[eddie], [kevinmarks], [tantek], [mrkrndvs], nitot, [miklb], iwaim___, [jeremycherfas], 5EXAAPICA, barpthewire, tantek, [cleverdevil] and [squorch] joined the channel
# KartikPrabhu [kevinmarks]: would appreciate your help/insight on this https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366533803
# KartikPrabhu Zegnat: yeah not sure what is going on
# KartikPrabhu sorry updated comment. the double url is only in html5lib not in html.parser
# KartikPrabhu html5lib is supposed to be better but has some funny quirks
# KartikPrabhu html5lib works with the old rules but seems to be doing funny things with the new ones!
# KartikPrabhu html.parser works with both rules
# KartikPrabhu so I am very confused since the mf2py code is the same but the parsers are different
# [kevinmarks] Beautiful soup is on top of them both iirc.
KartikPrabhu joined the channel
# KartikPrabhu Zegnat: this is happening because the <a>-tag in your 3rd example is not closed!
# KartikPrabhu and html5lib is doing some crazy stuff with it!
# [kevinmarks] S/crazy/spec compliant/
# KartikPrabhu [kevinmarks]: https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366536321 see the corrected tree outputs
# KartikPrabhu I have no idea why html5libis supposed to be the "correct" spec compliant one
# KartikPrabhu no browser does it I think
# KartikPrabhu Zegnat: yes that is what I thought too
# KartikPrabhu sknebel: with the repeated <a> ?
# KartikPrabhu yeah FF has 4 elements <a>
# KartikPrabhu wow!
# [kevinmarks] I'm not sure why the spec would expect that, but html5lib is very compatible with what browsers do.
# KartikPrabhu [kevinmarks]: here it depends on which browser since CHrome and FF do different things
# KartikPrabhu :P
# KartikPrabhu oh ok
# KartikPrabhu still differnet from the html5lib output
# KartikPrabhu <shrug> who knows!
# KartikPrabhu so I guess html5lib is the correct one
# KartikPrabhu or "more" correct
# KartikPrabhu I wonder how other parsers handle this one
# KartikPrabhu mf2 parsers
# KartikPrabhu Zegnat: yes but it would be good to see the differences since the mf2 parsed output depends on that
# KartikPrabhu Zegnat: with the corrected example mf2py with html5ib also works
# KartikPrabhu Zegnat: try it here https://kartikprabhu.com/connection/mfparser
[chrisaldrich] joined the channel
# [kevinmarks] I expect that the js one that uses browser dom would match. Php will depend on lxml doing its own thing.
# [kevinmarks] I am wondering if we should force html5lib in mf2py
# [kevinmarks] It is a bit slower
# KartikPrabhu [kevinmarks]: could you test that branch with lxml?
# KartikPrabhu I am not sure how to install a C lib in virtual env
# KartikPrabhu sknebel: mf2py has the option to specify a parser
# KartikPrabhu which it basically passes on the BS
# KartikPrabhu yes
# KartikPrabhu sknebel: yeah that seems reasonable. html5lib default unless user specified. if html5lib is not installed defer to BS defaults
# KartikPrabhu no I think it goes for lxml first since that is faster
# KartikPrabhu is finding docs for that^
# KartikPrabhu lxml > html5lib > html.parser
# KartikPrabhu now check lxml
# KartikPrabhu ok updated with all trees https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366536321
[eddie], [cleverdevil] and [kevinmarks] joined the channel
# KartikPrabhu ok whole new branch of mf2py with default html5lib and handling of redirect URLS along wth implied-name-fixes https://github.com/kartikprabhu/mf2py/tree/parsing-into-bs4
# KartikPrabhu enough mf2py for this weekend (for now)
GWG, [pfefferle], [miklb], KartikPrabhu and [cleverdevil] joined the channel