#microformats 2015-01-26

2015-01-26 UTC
KartikPrabhu, prtksxna, KevinMarks_, gRegor`, kez_, tantek and csarven joined the channel
#
csarven
Can someone point me to why microformats2? For instance, what was at stake with microformats1 in for example hCard that needed a revisit for parsing? I'm sure this is documented somewhere. Would appreciate a pointer.
#
csarven
I should clarify: IIRC, in microformats, the storyline was to simplify authoring, and think of scripts later. In microformats2, however, the story appears to have changed a little i.e., authoring is slightly more complex or involved (depending on how you look at it), in order to improve how machines parse the information.
#
tantek
csarven: yup - all documented at the "obvious" place :)
#
tantek
csarven - the "storyline" for microformats, was authors before parsers
#
tantek
for microformats2, the basic question we asked was, could we make thing simpler for BOTH authors and parsers
#
tantek
and that's what we ended up doing
#
tantek
the first part of the microformats2 page is more like a spec now rather than "story"
#
tantek
but the background is still there - let me get a fragment
#
tantek
your questions "what was at stake with microformats1" - if you mean what were the problems - are documented there
#
tantek
HTH and definitely let me know if you have any specific follow-ups - happy to improve the documentation accordingly
#
tantek
(but am calling it a night soon - willl check the logs - or I'll be back in the morning PST)
adactio joined the channel
#
csarven
tantek Thanks!
#
csarven
Perhaps that was wording. I didn't necessarily mean "problems". It was more about the cause/initiative to move towards microformats2.
#
csarven
re: "all microformats are simply an object with a set of properties with values." from http://microformats.org/wiki/microformats2#Background . That's pretty much EAV model. Which is used by RDF as well.
#
tantek
nah - RDF complicates the model unnecessarily with basing it on "triples" http://microformats.org/wiki/triples
#
tantek
there's no "pretty much" about it
#
tantek
more like "ugly much"
#
csarven
As well as /triples
#
csarven
In fact, it is very opinionated.
#
tantek
nope - such challenges are based on tons of experience
#
csarven
"unnecessarily complicated" ? C'mon
#
tantek
takes longer to explain = unnecessarily complicated
#
tantek
anyway - no interest in arguing about RDF because that's a useless waste of time - since it's unnecessary
#
tantek
it does seem to float some plumbing boats, but so does plenty of backend futzing
#
csarven
First of all, the problem space for LD is completely different than mf. It is misleading to suggest that LD/triples is "unnecessarily complicated" and that mf should be preferable.
#
tantek
csarven: no who does LD bothers to actually document specific problem spaces with open research, so I categorically reject your statement
#
csarven
tantek This is not about arguing. We are discussing. I have championed mf for a long time and still do. So, I don't like being positioned somewhere as if I'm fighting against mf.
#
csarven
Supporting LD doesn't mean that I don't support mf.
#
tantek
not saying you're fighting mf
#
tantek
I'm saying that LD is a waste of time, that's a separate problem
#
tantek
plenty of people support both, that's their hobby
#
csarven
Well, I disagree. LD solves problems that mf can't.
#
csarven
Neither is it that mf is intended to solve those problems either.
#
tantek
csarven - well, when you find some actual scientific documentation of such problems then let me know
#
tantek
because they're usually framed in terms of abstractions and what ifs, and but if I want tos
#
csarven
I and many would argue that SW/LD is more "scientific" than mf :)
#
csarven
It is not at all about what ifs.
#
tantek
so that's the essence of the problem. I ask you for "research" (e.g. URLs pointing to), and you say you would "argue"
#
csarven
We have all sorts of data. Data is not only bound to what exists on Web pages that should be easy to author for Web developers. That's a very narrow POV.
#
tantek
you offering to "argue" is "what ifs"
#
csarven
Well, how about we back up and try to back up the statement "unnecessarily complicated" scientifically?
#
csarven
Can you provide surveys?
#
tantek
again, you're arguing hypotheticals, like I said, let me know when you have documented research about specific problems at a public URL, until then, you're wasting time with handwaving
#
csarven
I'm simply asking for documentation on "unnecessarily complicated".
#
tantek
yup - microformats are useful (plenty of specific use-cases on the wiki), microformats solve those problems without any need for triples
#
tantek
ergo, no need for concepts of triples for those use-cases
#
csarven
It is a strong claim. I'd like to know what type of research went into concluding that.
#
tantek
ergo any use of triples would add *unnecessary* complexity
#
tantek
triples vs. property: value
#
csarven
No, you are unfairly comparing the problem.
#
tantek
nope, I'm comparing a documented problem
#
tantek
vs. no documented problems that you need triples for
#
tantek
the burden of proof is on needing triples, not on not needing them
#
csarven
What you are saying is that, given the problem space of mf, triples/LD complicates the problem. I'm saying that, well, that's not an accurate picture.
#
tantek
I'm saying you have no picture
#
tantek
you have no documented research of specific problems
#
tantek
you have handwavings about what a picture could be
#
tantek
like I said, let me know when you have URLs to specific documentation about specific problems / use-cases
#
tantek
until then - you're just wasting time arguing
#
csarven
Documented problem? Based on what? Information based on the microformats wiki about LD? And that you conclude based on that documentation, LD is complicated?
#
csarven
Ok
#
tantek
no - real world problems for users
#
csarven
You are repeating yourself. DRY.
#
csarven
You need to revisit your axioms.
#
csarven
"real world problems for users"
#
tantek
occam's razor - triples/LD unnecessary
#
tantek
until proven otherwise. hence burden of proof
#
csarven
I see. So, you arbitrarily come up with a simple view of what triples/LD is, .. then go ahead and document that in the wiki and call it a victory for mf?
#
tantek
no it's more a defense against wasting time
#
tantek
victories for mf havee nothing to do with LD being good or not
#
csarven
Why bother with the documentation on the alternative in the wiki any way?
#
tantek
victories come from solving real world use-cases
#
csarven
mf is victorious in its own right.
#
tantek
right
#
csarven
There is no need to bash the alternatives.
#
tantek
the /triples etc. documentation is because people keep bringing them up
#
tantek
like an FAQ
#
tantek
so it's a summary answer
#
tantek
and it's usually sufficient
#
csarven
Well, I appreciate your POV.
#
csarven
I agree, it is sufficient for many.
#
tantek
there is actually a need to filter out crap
#
tantek
in everything
#
tantek
and filter out inefficiencies
#
csarven
But I disagree on the approach taken "against" LD
#
tantek
it's a trivial debunking, that's all
#
tantek
if you disagree - you can provide research that substantiates LD
#
tantek
until then - there is no point to it
#
csarven
That's trivial. Data exists outside of Web pages that are not "common".
#
tantek
if it's so trivial, point me to a URL to research
#
tantek
barring that, the research doesn't exist, because it's not trivial, or the problems don't actually exist that *require* LD
#
csarven
You want me to point you to some research that says "data exists everywhere... not only on web pages"?
#
tantek
no to specific such data
#
tantek
that somehow has a specific aspect that *requires* LD
#
tantek
point to actual research, not meta research
#
csarven
LD is a pretty good candidate. How about that? If there is an alternative approach (and often there and being argued) that can be compared.
#
csarven
That's an axiom.
#
tantek
nah - you have no problems being solved, so it's just theoretical handwaving
#
tantek
it's philosophy, not science
#
csarven
RDF is a a good candidate for the problem space. And it is based on EAV. .. Just as mf2. The fact that they differ on syntax/namespaces or not.. or whatever, it is a very minor
#
csarven
Are you serious?
#
tantek
you sound like you're actually asking for extensible vocabulary though, not triples, by your referencing "data exists outside"
#
csarven
Do you expect CERN to output their data from LHC into Web pages?
#
csarven
(I'm not arguing about LD here.. but that data exists elsewhere and that needs to be captured and modelled..)
#
tantek
so now you're approaching a problem statement - so that's better
#
tantek
can you point to a URL documenting the specific problems of CERN needing to output the data from the LHC?
#
csarven
Uhm.. they already do! http://opendata.cern.ch/
#
csarven
And there is more to it. One can't expect all roads to lead to mf.
#
tantek
that's a strawman
#
tantek
no one said all roads
#
tantek
I'm just saying I don't accept any "this is a solution!" statements without documentation of the problem
#
csarven
mf is not intended to deal with all those "problems". And that is perfectly fine. Just because mf can't, it doesn't mean that others are irrelevant or are "unnecessarily complex".
#
tantek
still don't see any documentation of any such problems
#
tantek
sorry - you're not providing *any* actual problem documentation
#
csarven
Well, if you want an occam's razor, then EAV, RDF are good candidates.
#
tantek
therefore you can't argue about it
#
csarven
We are discussing!
#
csarven
You want me to address all your issue with URLs on the spot?
#
tantek
nope, occam's razor is property:value works, don't need triples
#
csarven
Especially when you leave an unscientific statement like "unnecessarily complex" up on the wiki?
#
csarven
But then go ahead and argue for something scientific?
#
tantek
if you can't back up your claims about problems with documentation of specific problems, your arguments are baseless
#
csarven
.. for LD?
#
csarven
C'mon.
#
tantek
unnecessarily complex -> occam's razor
#
tantek
already answered, quit asking same question
#
csarven
I've already explained to you that data exists everywhere. That's trivial. That's an axiom. Can we not agree to that?
#
tantek
nope. document a speciifc problem.
#
tantek
not some handwaving about data everywhere
#
csarven
We have a lot of data, and we want to "connect" this data with each other so we can have a interesting insights about societies, build better systems, make better decisions...
#
csarven
Ok.
#
csarven
That's *good enough*
#
tantek
again you're speaking in generalities
#
csarven
Not at all.
#
tantek
stop describing, and start providing URLs to documentation of specific research
#
csarven
Very concrete.
#
csarven
Did you skip over the whole Data Science trend nowadays?
#
tantek
don't care. specific URL or stop talking.
#
csarven
You are asking me to justify the problem for the users for CERN's data.
#
csarven
.. practically.
#
csarven
:)
#
tantek
"socieites", "systems", = generic
#
tantek
nothing specific
#
csarven
Okay, lets leave it at that.
#
csarven
The moment you are tellin gme to stop talking ... well, there is no discussion.
#
tantek
right, no point in any discussion since you cannot provide a specific URL to specific research about a specific problem
#
csarven
I think TimBL made a pretty good case about "linking data" 25 years ago aka Web.
#
tantek
barring that, no need for tripls/RDF etc.
#
csarven
Do we need to revisit that?
#
tantek
the web didn't need RDF/LD
#
tantek
and succeeded without it
#
tantek
more occam's razor
#
tantek
thanks for the proof
#
csarven
Web didn't need HTML5+JS+Flash... either
#
csarven
Web succeeded because of HTML.
#
csarven
More generally about linking documents.
#
tantek
yup - and the features added to HTML5 were all added one at a time based on documented use-cases
#
csarven
Linking "things" is a specialization of that.
#
tantek
web succeeded because HTML was *simple*
#
csarven
Agreed.
#
tantek
TimBL said so himself
#
csarven
Yes, and that he decided on HTML instead of something like TeX
#
tantek
generalizing and building abstractions without a problem to solve is philosophy not sceince
#
csarven
But the point is that, HTML opened up the idea for linking stuff across the globe. I fyou have some data and put it up somewhere, we can link to it.
#
tantek
here's the difference
#
tantek
HTML5 audio and video tags - clear documented use-cases
#
tantek
LD/RDF abstractions - no clear documented use-cases
#
csarven
I'm sorry to say but, I strongly dislike your position on mf being somehow "scientific", but that upper-case SW or LD is not.
#
tantek
science involves documenting your problems, and research
#
tantek
SW/LD advocates don't actually bother with that - they just invent stuff and prescribe it
#
tantek
no homework, no showing of steps
#
tantek
and frankly, there were areas where we didn't do enough documentation with microformats (classic) either
#
tantek
and most of those failed
#
csarven
I will entertain your idea for a moment. But, have you heard of "stamp collecting"?
#
tantek
we were not *strict* enough
#
tantek
with asking for documented research
#
tantek
the irony of LD advocates - they can't provide links to back up their statements
#
csarven
If mf was so "scientific", I'd expect a proper methodology. Starting from hypothesis and null hypothesis, and moving up. Certainly that's not the case. Did mf reject a null hypothesis somewhere? Is that in the wiki?
#
csarven
mf is "stamp collecting" just as much as SW/LD/RDF
#
csarven
Information Science.
#
csarven
Where is the hypothesis?
#
tantek
you start with not needing anything
#
tantek
and then documentation is the first step - of the problem etc.
#
tantek
you're leading with hypothesis and that's your problem
#
tantek
with science, you lead with *observation*
#
tantek
i.e. research
#
tantek
then you document it
#
csarven
That's an axiom. I'm looking for a hypothesis. And that at some point, mf rejected the null hypothesis and went along with the altnerative. Where is that mentioned clearly?
#
tantek
only after you have documented observations do you go to a hypothesis
#
tantek
that's scientific method 101
#
csarven
".... hence, we reject the null hypothesis "
#
tantek
LD/RDF advocates skipped the observation and documentation steps
#
tantek
so thus, unscientific
#
csarven
What you are talking about is stamp collecting. Not some brute force testing.
#
csarven
tantek Like I said, where is the blurb on rejecting the null hypothesis in the mf wiki?
#
tantek
scientific method doesn't need reject null hypothesis
#
tantek
thus we don't need it
#
tantek
we document existing real world user problems through observation
#
csarven
"All science is either physics or stamp collecting" -- Lord Rutherford
#
csarven
Thanks. So, again, mf is as "scientific" as SW/LD.
#
tantek
nope because we document our problems
#
tantek
and require it in our method
#
csarven
Unless you want to show me that hypothesis, than we can classify mf taking on the "hard-science" approach.
#
tantek
whereas SW/LD folks make up vocabularies first, then try to apply them
#
csarven
Ok. I stand by my position. I don't think we are disagreeing.
#
tantek
it's ok - eventually made-up stuff without documented problems / use-cases whithers and falls by the wayside
#
csarven
They didn't come up with a vocab out of thin air. Surely that's based on observing patterns or needs. You may argue that their documentation sucks (and I won't necessarily disagree with that). However, it is wrong to suggest that they are somehow doing something that's not scientific.
#
tantek
why are you assuming "based on observing patterns or needs"?
#
tantek
that's your flaw
#
tantek
I'm asking for proof in the form of a URL to documentation of observing patterns or needs
#
tantek
but you're willing to accept it on faith
#
csarven
No, that's your flaw. Just because yo udon't know it, it doesnm't mean that it doesn't exist.
#
tantek
so without that documentation I say it's a waste of time
#
csarven
That's a clear distinction to be made.
#
tantek
it doesn't exist until evidence is provided
#
csarven
Personally, I am a pragmatic.
#
csarven
I don't see a flaw in there :)
#
tantek
you're taking it on faith
#
tantek
I'm saying I don't believe it until you give me a URL to the documentation
#
csarven
We all start with axioms.
#
tantek
not how you do science sorry
#
tantek
you start with observation and documentation
#
tantek
philosophers start with axioms
#
tantek
hence my point about SW/LD being philosophy, not science
#
csarven
You may not believe it because you haven't seen a documentation, yet, you come up with a belief that something is "unnecessarily complex" because that's ... occam's razor?
#
csarven
Do you realize how absurd that sounds?
#
tantek
no that's the default
#
tantek
without evidence, something is unnecessary
#
tantek
do you know how absurd it is to suggest otherwise?
#
tantek
to suggest you need something without evidence?
#
tantek
that's called marketing
#
csarven
The fact that there are "observable" 65 billion triples + across ... is not some "philosophy". It exists. Deal with it.
csarven joined the channel
#
csarven
Sorry, ran out of battery :) ... And you need to go to bed :)
#
csarven
(if have not already)
#
csarven
Any way.. I appreciate the chat regardless
#
tantek
csarven, again, I'll leave you with, why is it so hard for LINKed data advocates to actually provide LINKs to substantiate their arguments? ;)
#
csarven
I tried to explain.. but I probably didn't do a good job. I'm fairly certain that you are quite aware of the SW/LD position. I suspect that issues are not due to technical differences. Some of the arguments against SW/LD (from the mf position) has different roots - some of which I'm aware but that's not the point.
#
tantek
this is not unique to SW/LD btw
#
csarven
So, when a debate arises, it is not essentially about the technical differences. It gets philosophical.
#
tantek
most standards (web or otherwise) don't provide documentation of their problems and use-cases
#
tantek
which means they get bloated and political
#
csarven
I agree.
#
tantek
instead of simple and pragmatic
#
tantek
SW/LD is just one example
#
tantek
a specific example
#
tantek
but there are many (most?) others
#
csarven
That's all valid. But, poor communication on that front doesn't equate to problem existing. Communicating well is an art.
#
csarven
So, don't let the SW/LD "research" "papers" get in the way.
#
tantek
not even asking for good documentation
#
tantek
just *some* real world documentation
#
tantek
yeah the research paper problem
#
csarven
IMO, this is a solid documentation as it gets: http://www.w3.org/History/1989/proposal.html
#
tantek
the fact that they're not publishing on the web at stable URLs with open acccess
#
csarven
In there, I can see SW/LD/mf... all coexisting, and they do!
#
tantek
can coexist doesn't mean must
#
tantek
that's the point
#
tantek
from a pragmatic minimalist viewpoint, everything must be justified
#
tantek
not just by political statemetns like "coexist"
#
csarven
I think that's the point. What you just said.. Many see SW/LD as sufficiently justified.
#
tantek
you're right that there's a lot of specific problems described in http://www.w3.org/History/1989/proposal.html
#
tantek
every standard / spec developed is seen as sufficiently justified by "many"
#
tantek
that doesn't mean they are actually justified, by documented research
#
tantek
it would be an interesting exercise to extract the specific problems mentioned/described in http://www.w3.org/History/1989/proposal.html and document them at their own URLs
#
csarven
You know, many in the LD community disagree on what LD is too. There is an RDF-only camp vs. RDF is one of many.
#
tantek
is that a syntax argument? or a model argument?
#
tantek
edited /to-do (+316) "more documentation and research, extract from TimBL's 1989 proposal"
(view diff)
#
tantek
csarven thanks for the reminder about and URL for TimBL's paper and his documentation or at least referencing of specific use-cases
#
csarven
Essentially syntax on the surface but I would say both. Some view HTML/mf/Microdata to all belong to the LD goal.
#
csarven
All "linked data". As opposed to whatever "Linked Data" is.
#
csarven
TimBL's http://www.w3.org/DesignIssues/LinkedData originally didn't mention RDF/SPARQL. There was an update to include them. So, some confusion arises from that as well.
#
tantek
yeah - most use of microformats2 is in a very lowercase "linked" data way - with emphasis on URLs
#
tantek
except without all the formalities with URLs for predicates
#
tantek
links for the *data* that is, not the predicates/relationships/vocab
#
csarven
Yeap. All URIs are welcome in RDF, but for LD, HTTP is most *useful*
#
tantek
oh that distinction. yeah the URN thing was hilarious.
#
tantek
Urns are what you put dead things into ;)
#
csarven
Pretty much. Just a string. Essentially good as any other unique srying
#
tantek
right - and thus not as valuable / useful as an actual *link*
Acidnerd joined the channel
#
@BloggingStart
Author hReview Coupon – Add Star Ratings And Boost Search Engine Results With Author hReview… http://www.bloggingstart.com/author-hreview-coupon/
(twitter.com/_/status/559682500012670976)
Musk joined the channel
#
@SEMrushcoupon
Author hReview Coupon – Add Star Ratings And Boost Search Engine Results With Author hReview: One of the bigge... http://www.bloggingstart.com/author-hreview-coupon/
(twitter.com/_/status/559694653021118464)
krijnhoetmer, ChiefRA, prtksxna, pfefferle, TallTed, pfefferle_, AndrewBC__, gRegor`, kez_ and tantek joined the channel
#
@Polo_Seo
@kmaaouni lá ils ont celui lá en place hreview-aggregate
(twitter.com/_/status/559766170971430914)
voxpelli, prtksxna, chiui, pfefferle, KartikPrabhu, csarven, Musk, Phae, benward______, tommorris, bret and twisted` joined the channel