#microformats 2018-08-07
2018-08-07 UTC
[kiai], CGML2, [jgmac1106], [matpacker], mnemonic and [jgarber] joined the channel
# gRegorLove PR to fix the implied photo issue today: https://github.com/microformats/php-mf2/pull/191
# gRegorLove sknebel++ for catching that
[jgarber] joined the channel
# gRegorLove That was a php-mf2 bug not a spec change, but yeah I am working on adding some tests for implied properties -- specifically negative tests, to cover some instances when they shouldn't be implied.
# [jgarber] I do have a couple of newbie questions about https://github.com/microformats/tests:
[kiai] joined the channel
# gRegorLove I'm new to it as well. I've wanted to get php-mf2 using it for a while but kept working on other issues :)
# gRegorLove I don't know node very well, so not sure about 1.
# gRegorLove 2, yeah composer.json is PHP package control, so you can include it in other libs. php-mf2 uses that to load in the tests
# gRegorLove Not sure / no strong opinion on 3
# gRegorLove Submodule could work. With composer it's nice that we can specify microformats for development only: https://github.com/microformats/php-mf2/blob/master/composer.json#L19
# gRegorLove So if you just want to put php-mf2 in a project and don't need all the tests, you don't have to get them.
# gRegorLove Ohh. Yeah, composer.json and package.json make pulling them into other projects easier. Composer lets you specify a git repo, but we can just use "mf2/tests" instead.
# gRegorLove So I think they're fine there
# gRegorLove Gotcha
# gRegorLove knows very little Ruby :)
[xavierroy], Sove, ketralnis, MylesBraithwaite, [Serena] and [kevinmarks] joined the channel
# Loqi Zegnat rebase and rerun https://github.com/microformats/php-mf2/pull/163 . see how we’re doing
samouy15, Davnit18, Tux10, disi14, rolig and [pfefferle] joined the channel
[jgmac1106], adactio, Guest29805, [pfefferle], aaronpk and [kevinmarks] joined the channel
# [kevinmarks] I'm thinking that we should refactor the tests repo, as there are a lot of odd synthetic tests there, and not many spot tests for the parsing algorithm.
# [kevinmarks] The basic idea of parallel html and json files is good and should be usable by multiple parsers; the actual content is a bit annoying.
# [kevinmarks] The original idea of sending the same html through multiple parsers and comparing the json is a good one, but that bit isn't really maintained.
# [kevinmarks] Would it make sense to use some kind of existing CI tool for this?
[jgarber] joined the channel
# [kevinmarks] That each parser is spawning parallel spec tests instead is a marker of failure.
# [kevinmarks] What a CI is good at is running code in multiple configurations
# [kevinmarks] So showing the responses of all known parsers is good.
# [kevinmarks] Doing it with the current tests may not be helpful, as they have a lot of cruft there, but starting with a simple subset and then merging them over could be good.
# [kevinmarks] Each time I sit down to look at them with mf2py I get bogged down in whitespace and awkward json comparisons.
# [kevinmarks] It would be, but that would be helpful in spotting if the test or the parsers are wrong.
# Zegnat Maybe we can pull a variety of real-world examples from indiemap? Maybe store them in a folder structure like https://indieweb.org/IndieArchive#Storage ?
# sknebel right now I can think of 3 areas where parsers might differ somewhat intentionally: a) e-* serialisation (although one could test if the resulting HTML parses the same) b) url resolving (depending on what steps specific libs take/not take) c) dt-parsing (there's a long-standing proposal to imply the dat part of a datatime always, not just in VCP, and some parsers already implement that)
thk127 joined the channel
# [kevinmarks] So compile examples of these from the parsers existing unit tests, and put the current ones in a historic directory?
# [kevinmarks] The current structure makes some sense for mf1, where you do need to know all the properties possible, but very little sense for mf2, agreed.
# [jgarber] This may be a tangent or a dead end, but… Would http://json-schema.org be at all helpful with the test suite?
# Zegnat [jgarber], http://microformats.org/wiki/microformats2-json actually links to two independent JSON Schema implementations for validation
# [kevinmarks] Json schema is overkill perhaps. There are other json issues - unicode vs UTF-8 for example
# [kevinmarks] (I dislike \u in utf8)
beaky10 joined the channel
# Zegnat [kevinmarks], if we decide to go with RFC 7493 JSON (https://github.com/microformats/microformats2-parsing/issues/23) we would be saying all mf2 JSON is always utf-8 encoded. Else it breaks spec.
simon_-_28 joined the channel
# [kevinmarks] Yes, and I uni we should require not using \u escaping too (it can be hard to test as language json parsers may convert both to native unicode strings)
# [kevinmarks] We should also maybe have some html in other encodings to turn into utf8 mf2.
jalcine, barpthewire, tantek, K0HAX4, bradenslen, tantek__ and snarfed joined the channel
# snarfed FYI all mf2py users! candidate for the next release (1.1.2) is available, including whitespace bug fixes and performance improvements. please try it out and let us know if you hit any problems! you can install with: pip install -e git+https://github.com/microformats/mf2py.git#egg=mf2py
[Serena], beaver24, ForexTrader, [jgmac1106], tantek, TallTed, Demp10, [chrisaldrich], adactio_, Monkeh13, cooled, jackjamieson, [johnjohnston], KartikPrabhu, [grantcodes] and [cleverdevil] joined the channel
# @metbril I was wondering if your #indieweb #microformats2 #wordpress mf2_s theme repo is still under active development? It has not been updated for a year. https://github.com/dshanske/mf2_s (twitter.com/_/status/1026873802519662593)
snarfed and kloeri24 joined the channel
# snarfed hey sknebel, thanks again for the mf2py optimization PR, https://github.com/microformats/mf2py/pull/123 . mind rebasing it on current master now that the revert is merged?
# snarfed the urljoin ValueError PR looks great too. mind adding a test? looks like you have a test case in https://github.com/microformats/mf2py/issues/79
[jgmac1106] joined the channel
# Zegnat Just opened a PR for more implied property fixes: https://github.com/microformats/php-mf2/pull/192 - feels like we also need more creative tests for all of these ...
loppy2 joined the channel
# gRegorLove Thanks for the xpath cleanup Zegnat! I was trying to simplify that but didn't think to try traversal. Appears to work in local testing.
# gRegorLove There's a couple other locations using that sibling-counting method we can probably simplify.
# gRegorLove Cool, will do
snarfed joined the channel
have joined the channel
# snarfed sknebel: huh, odd. are you on the new UI? ie https://console.cloud.google.com/bigquery
# snarfed (sknebel also https://github.com/microformats/mf2py/pull/124 needs a resolve. thanks in advance!)
# gRegorLove Zegnat: Ok, moved those into the xpath. I'm thinking we should add a check for `false` returned from xpath->query() as well
# gRegorLove Oh, I should use resolveUrl in there as well
israfel11 and tantek joined the channel
# gRegorLove Ok, final changes pushed
snarfed joined the channel
# KartikPrabhu sknebel: snarfed: maybe before you guys release new mf2py it might be good to do some clean up e.g. https://lgtm.com/projects/g/tommorris/mf2py/alerts/?mode=list
Cisien27 joined the channel
VoidWhisperer joined the channel
# gRegorLove aaronpk, mf2/tests is showing "abandoned" on packagist https://packagist.org/packages/mf2/tests
# gRegorLove haha :shrug:
# gRegorLove Sounds good
# gRegorLove Any objection to parseImpliedPhoto() always returning an absolute URL (or false, if none)?
iamtakingiteasy joined the channel
# gRegorLove I don't remember exactly. Mostly aesthetics, probably, since parseH is so long
# KartikPrabhu sknebel: can we collect a large repo of in the wild examples to do this stress test before live deplyment to bridgy?
[cleverdevil] joined the channel
# snarfed yeah indiemap has those examples if we really want them, 5.8M pages worth :P https://indiemap.org/docs.html#crawl
# KartikPrabhu we probably could pick about 100/1000 at random of differnet page sizes
# KartikPrabhu snarfed: both
# KartikPrabhu it would be useful for future testing
# KartikPrabhu I have no idea how to set such things
# KartikPrabhu sure
# sknebel I see a few things: a) performance - we can repeat measurements against different versions, and a representative sample means profiling gives more representative results for various cases (e.g. I have a commit here that I think should help in some cases, but in my test example it doesn't do much since the example markup is "too nice")
# @FusedMind ↩️ @jenchan_atl @apply_imagine How does structured markup (RDFa, JSON-LD, and microformats) affect SERP's? (twitter.com/_/status/1026909466292310016)
# Zegnat gRegorLove: https://github.com/microformats/php-mf2/pull/193 :) That should clear up that part of the parser I suppose
# gRegorLove No more exceptions, woo!
# gRegorLove I'll take a closer look later. I'm sure it's good though.
# gRegorLove Zegnat++
yawkat1, barpthewire and Contessa joined the channel
bradenslen joined the channel
snarfed, [eddie], Brace18, [jgmac1106], [kevinmarks], mist25 and [pfefferle] joined the channel
nOgAnOo joined the channel
jackjamieson, KartikPrabhu, barpthewire1, snarfed, snarfed1, SailorHaumea24, Guest29805, Shanmugamp723, tantek__ and LooCfur joined the channel
KindOne7, [shaners], [kevinmarks], KartikPrabhu, [jgmac1106] and [matpacker] joined the channel