2015-06-08 UTC
tantek, KevinMarks__, KevinMarks_, fuzzyhorns, KartikPrabhu, danielfilho, KevinMarks and krijnhoetmer joined the channel
Soopaman, fuzzyhorns, gRegorLove, Left_Turn, KevinMarks__, ChiefRA, kez, KevinMarks_, eschnou, glennjones, KartikPrabhu, chiui, adactio and csarven joined the channel
Left_Turn and KevinMarks_ joined the channel
KartikPrabhu and benborges joined the channel
fuzzy_horns and KartikPrabhu joined the channel
TallTed, KevinMarks__, KartikPrabhu, KevinMarks_ and KevinMarks___ joined the channel
glennjones and gRegorLove joined the channel
KevinMarks_, KevinMarks__, tantek, KevinMarks, KartikPrabhu, eschnou and GWG joined the channel
Zegnat joined the channel
# 19:24 gRegorLove Perhaps because the simplest case is to cite just the text, sans markup. The example given there is "short text notes"
# 19:24 tantek I suppose the question there is whether a repost should be using h-cite at all
# 19:24 tantek since a repost is intended as *post* of the original content
# 19:30 tantek kylewm: that is, asking for e-content is solving the wrong problem IMO - it may be expedient, but it's not actually helping the markup be correct
# 19:31 kylewm I've gone through a couple iterations of my own markup with reposts actually
# 19:32 kylewm right now i have an u-repost-of h-cite INSIDE the e-content of the post itself
# 19:32 kylewm that way you can include the permalink of the original post as well as the url of your repost
# 19:37 kylewm (I have both right now... so like h-entry > e-content > u-repost-of h-cite > e-content
chiui joined the channel
# 19:41 tantek that's e-content *inside* h-cite, not e-content *on* h-cite
KevinMarks joined the channel
# 20:02 Loqi glennjones: KevinMarks left you a message 6 days ago: said text was OK as a single string, but kyle's issue of pointing to the same URL twice may suggest we need an array. Are there in the wild exampels fo linking to the same URL wiht multiple types? I think that is contrived, but linking to the same URL with multiple texts is quite plausible
# 20:06 KevinMarks reading now - I thought trimming whitespace was expected in general, but I may have over-interpreted
# 20:09 glennjones As far as I can see its only mention one on the parsing page in the “implied name rules” ie “drop leading & trailing white-space from…”
# 20:10 KevinMarks IMO, strip leading/traing universally is always good, and collapse interior whitespace in implied name makes sense
# 20:10 tantek hmm - KevinMarks that issue doesn't make much sense
# 20:11 tantek so it is "premature to say we are going with the tenor of"
# 20:12 tantek sorry - example is not broken, just confused where the p-name was
# 20:13 tantek KevinMarks: and white-space collapsing in general for implied names
# 20:15 tantek would be preferable since I don't even know of any h-review-aggregate posts yet
# 20:16 glennjones I have add it as an option to my parser just in case we move to all property text at some stage, but at moment I am trying to get the test to match current rules
# 20:18 glennjones The test are a real mixture of old real world examples, a few taken from wiki examples and a hand full made up to help parser authors
# 20:20 glennjones Happy to see them replaced over time with indieweb patterns of use, but I need to get the current set in working order first
KartikPrabhu joined the channel
# 20:26 tantek glennjones - get the current set in working order first makes sense
# 20:30 glennjones KevinMarks: you OK with me updating your changes? I known you must of put a lot of time into the pull request.
# 20:31 KevinMarks Is there any pushback from anyone on leading/trailing stripping for p- properties?
# 20:31 KevinMarks 'cos I'd rather make that 1 line change to the parsing spec at least
# 20:32 tantek and what about implied-name whitespace normalizing?
# 20:33 tantek glenn what is your opinion of trim leading/trailing whitespace on p-* properties?
# 20:34 tantek KevinMarks: right now we don't really have consensus - we have lack of opinions
# 20:34 glennjones Yes I have it not just p-* but all properties - but yes behind a falg switch off by default
# 20:34 tantek the only people agreeing explicitly are you (proposer of issue/solution), and me (spec editor)
# 20:35 tantek would prefer to have at least one more parser dev opinion
# 20:35 tantek glennjones, could you explain why "all properties" is better than just for p-* ? (honestly curious)
# 20:36 tantek KevinMarks: proposal in the wiki that you and I +1 is " keep as is but have mf2 parser trim leading/trailing whitespace" which does not limit to p-* properties
# 20:38 kylewm is there ever any such thing like "first name: <span class="p-name">Kyle </span> last name: <span class="p-name">Mahan</span>"
# 20:39 tantek e.g. should be more like "given name: <span class="p-give-name">Kyle </span> family name: <span class="p-family-name">Mahan</span>"
# 20:39 kylewm I can +1 stripping leading/trailing whitespace for all properties, but I agree with Kevin's comment that ot dpesm
# 20:39 kylewm that it doesn't help with our test case failures
# 20:39 tantek well if we can at least get that resolved we can move forward with spec edit and implementation updates
# 20:40 glennjones I think I did across all properties just to safe guard against me adding leading/trailing single spaces by mistake. Not sure I work out all the possible impacts, but it was how my parser has work for last two years with any noticalbe problem
# 20:40 tantek all the same arguments apply - in terms of having to put markup on separate lines from the thing being marked up
# 20:40 kylewm <pre> -- that's what i meant rather than e-content
KartikPrabhu joined the channel
# 20:42 KevinMarks I meant if you had <div class="e-content">\n<pre>\n\nhello\n</pre></div> you are ok losing the \n before the <pre>
# 20:44 tantek KevinMarks: more likely that that \n is extra from the publishing system that you have no control over
elux joined the channel
# 20:44 tantek if you really must include an extra return like that, you can
# 20:45 tantek so I'm leaning towards glennjones position of all properties
# 20:48 kylewm i got fewer failures on 3.4, but the number of failures was consistent with/without unmung
# 20:49 kylewm (i'm intrigued/concerned why 3.4 is getting ~10 fewer failing tests than 2.7, but haven't looked into it yet)
# 20:49 KevinMarks I'll adda check fro which parser is being used, as I think we want html.parser to DIAF
KartikPrabhu and glennjones_ joined the channel
# 21:04 kylewm KevinMarks: i don't think html.parser should have a different result unicode-wise though, does it??
# 21:11 kylewm with python 2.7.6 and html.parser I get 64 failures with and without unmung :/
# 21:12 kylewm what is the different result you were getting?
tantek joined the channel
netweb joined the channel
# 21:22 kylewm oh huh, yeah it's possible i was using lxml ... really thought i was not though
# 21:22 kylewm well... we can pick a parser to go with for the test cases without limiting the parsers that you can use in general use, maybe
glennjones joined the channel
# 21:23 kylewm that's what i mean, i thought i was getting them with html5lib, but it's possible i had lxml on the path and didn't realize it
# 21:25 KevinMarks I mean I get missing unicode with lxml and not with html.parser or html5lib
# 21:29 KevinMarks no, lxml is doing it on purpose: "In Python 2, lxml's API returns byte strings for plain ASCII text values, be it for tag names or text in Element content."
# 21:32 tantek in general if there are problems with using html5lib we really should work on resolving them rather than avoiding using html5lib
# 21:34 KevinMarks I do get different numbers of fails with the 3 parsers though
KartikPrabhu joined the channel
glennjones joined the channel
# 21:48 tantek KevinMarks: interesting that they're looking at such structured information - that's perhaps a good chance to get them to look at adding microformats2 support
# 21:48 tantek do we know anyone there that would be good to ask about that?
# 21:50 tantek also interesting that their example used recipe
kez joined the channel
csarven joined the channel
tantek joined the channel
# 23:52 KevinMarks it seems to compress whitespace to a single char, but preserve an \n if there is one
# 23:59 KevinMarks html.parser also fails test_parser.test_multiple_root_classnames