#gRegorLoveI don't think the parser should do anything additional in that case
[pfefferle] joined the channel
#ZegnatIt should do what it says there, normalise http://www.southside.de] based on how the HTML spec wants you to do, right? So how does HTML handle these faulty URLs?
#KartikPrabhuZegnat: I don't think there is any URL validation inside HTML
#sknebelZegnat: that's what I'm trying to figure out. I think it returns an error case, which has to be handled upstream somehow, but I'm not all that familiar with this stuff
#sknebelgRegorLove: not doing anything additional means what? return the url unmodified? abort processing the property?
#KartikPrabhusknebel: I would return it as is without doing anything
#ZegnatI wouldn’t surprise me if HTML says that it isn’t relative (because it has a scheme) and just return it as is as an already absolute URL that needs no processing
#gRegorLoveYeah, I would return "http://www.southside.de]"
#ZegnatBut I am not 100% sure at what HTML spec says. And can’t D&D and read spec at the same time.
#gRegorLoveThe HTML has <a href="http://www.southside.de]" so the URL normalization shouldn't be a factor
#gRegorLoveI mean, every URL is being run through the normalization method I'm sure; mf2py just needs to not die if it's an invalid URL
#sknebelthat's the question. what's the "normalized absolute URL" of something that isn't a URL
#ZegnatPer spec the question is what HTML resolves "http://www.southside.de]" to. I imagine HTML resolves it to that literal string, so the parser should return the literal string as well.
#gRegorLovemf2py should see it starts with http:// or https:// and if it does, just return the value
#sknebelbut ok, general votes towards "just pass crap through", I guess a consumer has to expect that anyways since other steps don't care about the URL-ness of things
[miklb] joined the channel
#gRegorLoveGood point, hadn't considered other normalization, like making sure scheme is lowercase.
#Zegnatyeah, this is the oposite of dt-. vcp of u- never gets normalised, vcp of dt- normalises more than other ways to provide the value ...
#ZegnatDid we find out that vcp for dt- validation returned "" (empty string) if no values were found? Would that be expected for failed u- validation?
#gRegorLovemf2py and php-mf2 don't normalize scheme capitalization. That could be a nice-to-have.
#gRegorLoveI don't think the *lack* of those causes problems for consumers, though
#sknebelyeah. I think since you can't rely on the output, just passing it through and letting the consumer blow up if it doesn't handle it probably is acceptable
#ZegnatI wonder if that should get clarified in the mf2 spec. Change away from “return the normalized absolute URL of it” to something like “if the URL does not start with a scheme, apply the containing document's language's rules for resolving relative URLs, else return the gotten value”?
#sknebelpossibly. gotta compare what exactly the various parsers do and don't do normalization wise. and of course read the HTML stuff again when I'm more awake
#ZegnatI don’t think the original intent of the mf2 spec was to have to study the HTML spec though. So might be easier to clarify within mf2 spec.
#gRegorLoveAgreed, clarification between normalization and absolutizing would be good for the mf2 spec.