gRegor, wackycity[d], jeremycherfas and [snarfed] joined the channel
#@SamRose↩️ In the beginning of all of these centralized services: they offered RSS feeds, used microformats (so their pages could be crawled and data extracted), they used OpenID, they offered freely available API's. Until bigger capital got involved ... (twitter.com/_/status/1476044514255155203)
#jessealama[d]very impressive Loqi! I am indeed in 🤣
#jessealama[d]not sure if this is the right place to ask this kind of thing, but I'll fire away:
#jessealama[d]I've got a Schema.org puzzle I can't get my head around
#jessealama[d]I sell ebooks on my website (entirely handmade) and would like to represent the sales page as both a Book and as a Product. There's also an Offer (capitalized words refer to schema.org entities). The page is arguably *both* a Book and a Product. But it seems I need to pick one or the other. Anyone know how to cut this knot?
#aaronpkYou have found the problem with schema.org 😉
#aaronpkMost of Schema.org is theoretical, and all that really matters is what you want someone (probably google) to get out of it, so your best bet is to look at what you want from the end result and read those docs
#[tantek]There's a theory that Schema-org's complexity/theoretical nature is a giant nerdsnipe/honeypot. Like semantic HTML debates of old of em vs strong, or b vs i vs em vs strong, except 10x worse.
#jessealama[d]I guess the countertheory would be that adding good metadata helps make people more likely to find your stuff. Of course this comes with the caveat that if your site sucks or if you have very little content, it's a waste of time
#aaronpkIt only helps people find the stuff is there is something helping them find the stuff
#jessealama[d]I ask myself whether search engines really *need* the extra help that schema.org allows you to give them. Whether natural language techniques combined with hierarchical analysis of HTML documents (and possibly knowledge of who links to whom) is enough
#aaronpkin other words, without a consumer of the data it's useless
#jessealama[d]the theory to refute would be: investing time into nailing your schema.org metadata is totally worthless (even for someone who genuinely has something to offer on their site)
#[tantek]other way around. burden of proof (of relevance/utility/benefit) is always on the proposal to do extra work (in this case to do the work of figuring out schema-org nerdsnipe questions, and then adding it to your site)
#[tantek]"adding good metadata helps make people more likely to find your stuff." <-- needs proof that there is/are implementations that actually "helps make people more likely to find your stuff" for any particular specific "good metadata".
#[tantek]lacking that proof/evidence, there is no basis for that theory
#[tantek]closed implementations (like big search engines) aren't consistently testable/verifiable (as scientific experiments would require)
#[tantek]thus making it even more difficult to substantiate any such theory
#jessealama[d]agreed. It makes the whole investigation really unsatisfactory
[fluffy] and ur5us_ joined the channel
#PK[d]Schema is gaining popularity in SEO circle, but it’s difficult to create it plus implement it for a non tech person. It needs a team in an organization usually. I find different datapoints it has for any entity like restaurant, business helpful to create content around.
#PK[d]Ive been thinking about a browser plug-in that would highlight visible micro data on a web page. Like the plug-in will just highlight the h card entries on the page.