#microformats 2021-04-08

2021-04-08 UTC
# 00:09 
KartikPrabhu btrem: MathML can go in e-content
# 00:10 
btrem Like in h-entry.
# 00:10 
KartikPrabhu yeah
# 00:10 
btrem Right. I was writing about using elements *inside* svg in my post.
# 00:11 
KartikPrabhu can you use MathML in SVG? does it render in any browser?
# 00:11 
btrem Well, I discussed both: using e-logo for a container wrapping svg, but also p-name on a `<text>` element.
# 00:11 
btrem No. We're crossing messages.  :/
[KevinMarks] joined the channel
# 00:12 
KartikPrabhu oh, I guess we are agreeing on the MathML as e-content but there a separate question on elements inside SVG
# 00:13 
btrem !@#!@#!@ I hate this web/chat window! Just lost my whole message.  @$E#@
# 00:13 
sknebel I mentioned MathML for completeness/context because it and SVG are the formats that are explicitly called out for parsing in the HTML5 parsing spec after Zegnat asked if a HTML parser would put them in the DOM
# 00:14 
btrem Right. My real world use case is adding `p-name` on a `<text>` element for an `h-card`.
# 00:15 
btrem I don't see that coming up with MathML.
# 00:15 
KartikPrabhu if the svg is directly in the HTML then the HTML parsers  should pick it up
# 00:15 
KartikPrabhu if it is in an <img> tag then they won't
# 00:15 
sknebel and given that its explicitly mentioned I would expect an HTML parser to produce a DOM for it, and thus a microformats parser to find classes and everything to work
# 00:16 
btrem So far, the parsers I've tested have picked them up.
# 00:16 
btrem Using the python and php microformats validators linked from the mf wiki.
# 00:16 
KartikPrabhu yes
# 00:17 
sknebel (there is always some risk that not all parsers are always using HTML5 parsers and might do different things, but it answers "what do I expect" to me)
# 00:17 
btrem There's a screenshot of the results in the article, using the python one.
# 00:17 
KartikPrabhu yes that risk will always be there for any mf2 parsing
# 00:18 
sknebel right, and very much in "bug"/"you choose this tradeoff, we tell you to do otherwise" fields.
# 00:18 
btrem Well, it *appears* that Google is doing something useful with it. It does show the street-address, locality, etc. Those are on html elements. It also shows the name of the business that is definitely coming from the svg text element.
# 00:19 
KartikPrabhu oh that is interesting
# 00:19 
btrem Whether that's due to microformats h-card classed, or just Google using that text, is hard to say.
# 00:19 
sknebel might be worth testing with PHP and python when they are not using HTML5 parsers - I'd vote for explicitly making them support SVG if they happen to fail it in that case
# 00:19 
btrem Yes, the search results for Google and DuckDuckGo are very encouraging.
# 00:20 
btrem the python validator does work with the html.parser option selected.
# 00:21 
sknebel thats good, because that's truly terrible :D
# 00:21 
btrem haha
# 00:22 
btrem Until you posted that, I didn't even realize what that meant!  :-D
# 00:22 
btrem That is truly terrible! SVG and MathML and html is a real mess.
# 00:23 
btrem I'm lmao.
# 00:27 
btrem Hmm, I did just notice something odd, though: the logo is an svg inside `<h1 class="e-logo">`. So in the JSON there's logo.0.html is the embedded svg markup, as it should; and logo.0.value, which is "Greenbank Plant Nursery  Greenbank  Plant Nursery".
# 00:27 
btrem Because the SVG has a title set to "Greenbank Plant Nursery", and there's also a `text` element with "Greenbank Plant Nursery."
# 00:30 
sknebel right, and the algorithm doesn't care about the tags its in, it just uses the text content of all the tags
# 00:31 
btrem That's correct, according to the parsing rules.
# 00:31 
btrem But not ideal.
# 00:32 
btrem The `title` element does offer accessibility. But maybe it isn't needed in this case, since the text is visible and accessible to e.g. a screen reader.
# 00:32 
sknebel right, not sure about best practices here
# 00:34 
btrem Probably not worth solving right now, but it might be worth keeping in the back of the mind, if svg starts getting used more.
# 00:35 
btrem The rule now is drop <script> and <style>. Maybe drop <script>, <style>, and <title> instead.
# 00:36 
btrem Because the <title> does not add to the text content of an html document (does it?). So maybe it shouldn't for svg, as well.
j12t joined the channel
# 02:55 
@aaronpk ↩️ Nice! The thing that does most of the work for me is all in a library XRay:

https://github.com/aaronpk/xray

It normalizes content from Microformats feeds as well as Twitter and other stuff. Then it's a matter of reformatting the resulting JSON into a nice feed using some templates (twitter.com/_/status/1379990018911993856)
KartikPrabhu, Seirdy, mxd, [tw2113_Slack_], [jeremycherfas], hendursaga, [Murray], [KevinMarks], [snarfed], [scojjac], [tantek], [kimberlyhirsh], [schmarty], tomlarkworthy, [Rose], [aciccarello], IWSlackGateway and indy joined the channel