#microformats 2018-02-18
2018-02-18 UTC
tantek, [cleverdevil] and KartikPrabhu joined the channel
#
KartikPrabhu updated mf2py for the link[title] and video[poster] parsing for p-* and u-*

tantek joined the channel
#
KartikPrabhu now we need to figure out how to incorporate these changes into the actual mf2py code base

#
KartikPrabhu I did push the new p-* and u-* rules

#
KartikPrabhu for the implied name parsing I am waiting to see if it makes into the spec

#
KartikPrabhu yes

[mrkrndvs] and tantek joined the channel
#
KartikPrabhu !tell [kevinmarks]: would be awesome if you could test https://github.com/kartikprabhu/mf2py/tree/implied-name-fix on the examples in https://github.com/microformats/microformats2-parsing/issues/6#issuecomment-366473390 for some reason my local install works correctly but not the installation on my server!

#
KartikPrabhu !tell [kevinmarks]: would be awesome if you could test https://github.com/kartikprabhu/mf2py/tree/implied-name-fix on the examples in https://github.com/microformats/microformats2-parsing/issues/6#issuecomment-366473390 for some reason my local install works correctly but not the installation on my server!

#
KartikPrabhu and anyone else too ^

[eddie], [kevinmarks], [tantek], [mrkrndvs], nitot, [miklb], iwaim___, [jeremycherfas], 5EXAAPICA, barpthewire, tantek, [cleverdevil] and [squorch] joined the channel
#
KartikPrabhu [kevinmarks]: would appreciate your help/insight on this https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366533803

#
KartikPrabhu Zegnat: yeah not sure what is going on

#
KartikPrabhu sorry updated comment. the double url is only in html5lib not in html.parser

#
KartikPrabhu html5lib is supposed to be better but has some funny quirks

#
KartikPrabhu html5lib works with the old rules but seems to be doing funny things with the new ones!

#
KartikPrabhu html.parser works with both rules

#
KartikPrabhu so I am very confused since the mf2py code is the same but the parsers are different

#
[kevinmarks] Beautiful soup is on top of them both iirc.

KartikPrabhu joined the channel
#
KartikPrabhu Zegnat: this is happening because the <a>-tag in your 3rd example is not closed!

#
KartikPrabhu and html5lib is doing some crazy stuff with it!

#
[kevinmarks] S/crazy/spec compliant/

#
KartikPrabhu [kevinmarks]: https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366536321 see the corrected tree outputs

#
KartikPrabhu I have no idea why html5libis supposed to be the "correct" spec compliant one

#
KartikPrabhu no browser does it I think

#
KartikPrabhu Zegnat: yes that is what I thought too

#
KartikPrabhu sknebel: with the repeated <a> ?

#
KartikPrabhu yeah FF has 4 elements <a>

#
KartikPrabhu wow!

#
[kevinmarks] I'm not sure why the spec would expect that, but html5lib is very compatible with what browsers do.

#
KartikPrabhu [kevinmarks]: here it depends on which browser since CHrome and FF do different things

#
KartikPrabhu :P

#
KartikPrabhu oh ok

#
KartikPrabhu still differnet from the html5lib output

#
KartikPrabhu <shrug> who knows!

#
KartikPrabhu so I guess html5lib is the correct one

#
KartikPrabhu or "more" correct

#
KartikPrabhu I wonder how other parsers handle this one

#
KartikPrabhu mf2 parsers

#
KartikPrabhu Zegnat: yes but it would be good to see the differences since the mf2 parsed output depends on that

#
KartikPrabhu Zegnat: with the corrected example mf2py with html5ib also works

#
KartikPrabhu Zegnat: try it here https://kartikprabhu.com/connection/mfparser

[chrisaldrich] joined the channel
#
[kevinmarks] I expect that the js one that uses browser dom would match. Php will depend on lxml doing its own thing.

#
[kevinmarks] I am wondering if we should force html5lib in mf2py

#
[kevinmarks] It is a bit slower

#
KartikPrabhu [kevinmarks]: could you test that branch with lxml?

#
KartikPrabhu I am not sure how to install a C lib in virtual env

#
KartikPrabhu sknebel: mf2py has the option to specify a parser

#
KartikPrabhu which it basically passes on the BS

#
KartikPrabhu yes

#
KartikPrabhu sknebel: yeah that seems reasonable. html5lib default unless user specified. if html5lib is not installed defer to BS defaults

#
KartikPrabhu no I think it goes for lxml first since that is faster

#
KartikPrabhu is finding docs for that^

#
KartikPrabhu lxml > html5lib > html.parser

#
KartikPrabhu now check lxml

#
KartikPrabhu ok updated with all trees https://github.com/kartikprabhu/mf2py/issues/58#issuecomment-366536321

[eddie], [cleverdevil] and [kevinmarks] joined the channel
#
KartikPrabhu ok whole new branch of mf2py with default html5lib and handling of redirect URLS along wth implied-name-fixes https://github.com/kartikprabhu/mf2py/tree/parsing-into-bs4

#
KartikPrabhu enough mf2py for this weekend (for now)

GWG, [pfefferle], [miklb], KartikPrabhu and [cleverdevil] joined the channel