2012-04-19 UTC
tantek, Chief|AFK, jkridner_ and voxpelli joined the channel
chiui_ and csarven joined the channel
# 08:13 tot ra created /xrate (+1420) "New page: == Problem == Explicit content is a media that affects human psyche and disturbs it. Formatting violent, pornographic and other content should allow control with browser plugins, search cr..." (
view diff )
rossputin and danbri joined the channel
nonge_ joined the channel
weavejester joined the channel
andr3 joined the channel
MacTed, tantek, Rasifiel, csarven, danbri and barnabywalters joined the channel
# 16:54 Loqi Got it! There are now 30 spammers blacklisted
jkridner_, patcito, rossputin, weavejester, barnabywalters and tantek joined the channel
# 19:21 rossputin Hi, we have an idea for a sitemap protocol microformat which we have not seen under existing, draft or exploratory-discussions
# 19:21 rossputin We would therefore like to ask here on IRC whether anyone has started any such effort or whether it has been previously dismissed/abandoned ?
# 19:22 rossputin Is it permissible to post a link to an informal document detailing our idea ?
# 19:22 tantek have you found interesting examples / patterns of publishing sitemaps?
# 19:23 rossputin yes, I ticked off the previously mentioned steps so far
# 19:23 rossputin and thought IRC was next ?
# 19:23 rossputin I am working through that page this evening
# 19:23 tantek so you've started documenting examples and previous formats?
# 19:23 rossputin have not done the interesting examples step yet
# 19:24 tantek then you haven't ticked off the previously mentioned steps before jumping to detailing an idea
# 19:24 tantek ideas/brainstorming must always come *after* research into existing publishing examples and previous formats
# 19:25 rossputin Hi apologies...
# 19:25 rossputin I interpreted the 'Why?' section a little different
# 19:26 weavejester Isn't microformats.org a fairly complete list of all known microformats?
# 19:26 rossputin so now I should be looking at 'Why examples first?' under 'Document Current Behaviour' ?
# 19:26 weavejester i.e. if an equivalent isn't on microformats.org, can we assume that one doesn't exist?
tantek joined the channel
# 19:28 tantek rossputin - you did the right thing to come to IRC to talk about it
# 19:29 rossputin just moving on to 'Why examples first' now
# 19:29 tantek from the Why section … what's the real world use case?
# 19:29 tantek right - always good to discuss the use case first
# 19:29 weavejester barnabywalters: Ah good.
# 19:30 weavejester I've been meaning to submit a microformat draft for documenting forum posts (followup, author, in-reply-to, etc.)
# 19:30 weavejester Though I haven't got around to writing it up yet.
# 19:30 tantek weavejester - good thing, because submitting a draft is premature without first researching the area and documenting your research etc.
# 19:31 rossputin ok so, sorry for the confusion but do I state a use case in here first before writing anywhere else ?
# 19:31 tantek why is it useful to markup a sitemap in HTML?
# 19:32 tantek (if so, which sites, etc. and have you documented any limitations on the respective issues page?)
# 19:32 rossputin we think machines could generate a sitemap.xml by reading the ever present nav on so many sites… if a little semantic information was provided for priority/changereq etc
# 19:33 rossputin we have a requirement to do it ourselves in a product we are working on
# 19:33 weavejester tantek: hatom seemed to be linear last time I looked.
# 19:33 rossputin but think it is useful outside of our toolsets
# 19:33 tantek ok, so automatic generation of sitemap.xml from existing HTML, rather than having to hand-author sitemap.xml - that's a decent use case.
# 19:33 barnabywalters I thought sitemap.xml was intended to tell search engines about sites that they weren't likely to find out about via crawls?
# 19:33 rossputin currently not done AFAIK
# 19:33 rossputin it is intended to provide ancillary info to crawlers
# 19:34 rossputin such as how often is the site updated...
# 19:34 weavejester tantek: I was thinking of a microformat to semantically document a hierarchical set of messages, such as a forum, or newsgroup archive.
# 19:34 rossputin and how important does the author think certain pages are etc
# 19:34 rossputin sort of tailoring the crawl process a little
# 19:38 tantek weavejester - ok I think I understand the publishing distinction (nesting, threads?). now what's the use-case? how would marking up a hierarchical set of messages, such as a forum, or newsgroup archive be useful?
# 19:39 rossputin ok, I will do
# 19:39 rossputin thank you for the feedback and guidance guys
# 19:40 tantek rossputin - no problem - I'll be back in a bit
# 19:41 tantek also - this conversation is helping to improve the process - I'm going to edit it to make the next steps clearer
# 19:41 tantek (i.e. including what I just noted above about starting with the comment page as an example)
# 19:42 weavejester tantek: Well, personally speaking I want to be able to store a forum archive in a format that is both human readable and parsable by a machine. By adding in information like rel="followup", a parser can understand the structure of the archived posts, so that it doesn't confuse a link referencing a previous post, with a link to the response of a post. If that makes sense.
# 19:42 weavejester Ah, he/she left.
# 19:45 weavejester I might implement it in code first, so I'm sure it's useful, before publishing a draft. If I use it in anger, I have a better idea of which elements of the design work and whether it's useful.
chiui_ joined the channel
# 19:49 weavejester barnabywalters: One post could have many followup links.
# 19:50 barnabywalters yep, but are all the responses considered responses to the original post, or can responses be directed at other responses?
# 19:50 weavejester Ah, no, I'm assuming that there's one post per page.
# 19:51 weavejester Yes, if there are multiple messages on one page, it's difficult to use links to structure them
# 19:51 weavejester And for the job I need it for, each message will have its own URL.
# 19:52 weavejester Most forums have "permalinks" to posts anyway.
# 19:53 weavejester It might be that the scheme I have in mind has limited use, so perhaps I should just implement it for the use I have in mind first.
tantek joined the channel
# 19:54 weavejester The hard part is doing that without making it more complex :)
# 19:55 barnabywalters my publishing system currently has a pertaining_to field, which is pretty abstract
# 19:57 barnabywalters how about marking up <blockquote> elements with links back to the posts they originated from?
# 19:59 barnabywalters so you could link each response to the original post, link any blockquoted material back to the post it's from, and order the whole shebang by the publish date to get a linked-up thread
# 20:00 weavejester barnabywalters: Perhaps. It's starting to get beyond the itch I want to scratch though ;)
# 20:00 weavejester barnabywalters: Linking back blockquoted material is a good idea, though...
# 20:06 rossputin created /sitemap (+309) "New page: <entry-title>Sitemap</entry-title> Enable the automatic generation of sitemap.xml from existing HTML by specifying a minimal amount of semantic information. Per the microformats [[proces..." (
view diff )
# 20:07 weavejester barnabywalters: Essentially web site A wants to be able to import threaded messages from web site B
# 20:09 weavejester barnabywalters: Yes. I can also assume that each post is its own resource.
# 20:10 barnabywalters so site A needs to be able to discover all possible replies to a given resource, and order them
# 20:11 weavejester Right. And also any parents. I was thinking links for "followup" and "in-reply-to". Also "author" (already in use) and "recipient".
# 20:12 weavejester The subject of the message would just be the title
Rasifiel joined the channel
# 20:12 weavejester The body would, well, be the body.
# 20:12 barnabywalters so, given any thread, Site A needs to traverse both up and down the thread and discover all of the posts related to the one it's given
# 20:13 weavejester barnabywalters: Yes… I hadn't thought much about transversing up, but I guess it should do.
# 20:13 weavejester barnabywalters: I was going to sanitize the HTML and cache the result.
# 20:15 weavejester I'll have to see how it works in practise
# 20:15 weavejester The other way is to just use JSON, but...
# 20:16 weavejester Semantic HTML seems a better solution.
# 20:16 barnabywalters look at it this way: if you do an implementation yourself, you'll have a really good example if you want to make a threading-examples page on the wiki ;)
# 20:16 weavejester barnabywalters: Hah! Yes, that's true. :)
# 20:22 weavejester barnabywalters: Ultimately my plan is to make plugins for this.
# 20:22 weavejester barnabywalters: Effectively my plan is to send POST requests with a single "href" parameter that signifies a new or updated resource
# 20:22 weavejester barnabywalters: So two forums could effectively stay in sync via an open protocol.
# 20:23 weavejester barnabywalters: I wasn't aware of activitystreams. I'll take a closer look.
# 20:23 weavejester Though at first glance it seems more complex.
# 20:24 weavejester I wanted something where I could write: curl -X POST -d "href=http://..."
# 20:24 weavejester And for that to be basically the entire protocol :)
# 20:25 weavejester Plus some microformat on the resource itself.
# 20:27 barnabywalters well, I think that using in-reply-to and responses(/whatever ) is going to be the simplest and most solid solution
danbri_, globbot, danbri, krijnh, KevinMarks and andr3 joined the channel