#dev 2016-08-13

2016-08-13 UTC
KevinMarks, KevinMarks_, doesntgolf and ChrisAldrich joined the channel
# 03:59 
loqi.me created /Robert_Scoble_Effect (+240) "prompted by ChrisAldrich and dfn added by ChrisAldrich" (view diff)
miklb, KevinMarks_ and snarfed joined the channel
# 06:10 
snarfed hey aaronpk, for indieauth's auth flow, if i don't include a state param when i redirect, but it gives me a state param value back in the callback URL, is that expected?
# 06:10 
snarfed did that maybe change recently?
gRegorLove joined the channel
# 06:19 
snarfed interesting, doesn't always happen. looks like this was an exception
# 06:19 
snarfed happy to send details if you want
KevinMarks, tantek_, tantek and cmal joined the channel
# 10:45 
tantek hello briefly #indieweb-dev!
# 10:52 
voxpelli hello tantek!
# 11:00 
cmal hello people:)
# 11:52 
cmal hmmmm, it seems surprisingly tricky to insert data in the middle of a file
# 11:52 
cmal I'm genuinely surprised, out of all the bottlenecks I was expecting… this, I was really not expecting :D
# 11:53 
@franckpaul @elpep fais jouer la garantie ! Fais-toi rembourser ou mieux, publie un billet et on fera des webmentions … #ohwait (twitter.com/_/status/764429685526827008)
# 11:53 
cmal so if anyone has pointers (pun intended) regarding a filesystem or just about anything that could allow me to inject binary data at a certain point without overwriting the data already present, and without requiring to read/copy the entire file content… :-/
# 11:56 
cmal (full-disclosure: I was just trying to implement some binary search to find entries (works fine) and insert entries (doesn't) in huge ordered collections
# 12:05 
cmal or I could just give up on the time-ordering but then, if I'm adding an article to a collection (say, a tag) it will appear at the top of the collection (which is expected in some collections for reshare purposes, but I would not expect this behaviours in streams (as defined by ActivityPub)
# 12:15 
cmal the closest I've found so far would be FALLOC_FL_INSERT_RANGE (supported by ext4 and xfs), but the practical limitation is that the inserted data must be a multiplication of the logical block size (to keep block-alignment)
# 12:19 
cmal so that would mean inserting data in the middle of the file would require to allocate N blocks and rewrite there data from the next blocks until we reach alignment, which is probably less horrible than rewriting all the data until EOF, but is tricky when the size of data doesn't really fit in the block (for example, 200 bytes blocks inserting 199 bytes data would be terrible)
mindB joined the channel
# 12:33 
cmal or my collections could just be comprised of symlinks to smaller indexes which could then be rewritten as needed, I guess?
# 12:45 
sknebel cmal: maybe there is something tricky you can do with sparse files, at least for a while.
# 12:46 
sknebel cmal: also, look into how real databases structure their data on disk for inspiration
# 12:47 
cmal sknebel: I'll check this out, thanks :)
# 12:47 
sknebel an external index strukture doesn't have to be so bad
# 12:50 
cmal yeah I'm not really convinced by sparse files, mostly because the program running will just see a bunch of empty bytes (which somewhat mitigates my attempt to move throughout the file using fseek)
# 12:51 
sknebel yeah, you'd need to store offsets somewhere. and then you could just have random order and keep the data-offsets in an external file...
# 12:53 
cmal not sure I need them at all, I could just implement binary search on top of these chunks, first looking into the middle of the middle-chunk, etc…
# 12:54 
cmal or I could indeed store the offset along with the chunk reference in the main index and that would make finding the right timespan even easier I guess
doesntgolf joined the channel
# 12:55 
cmal yeah I don't know, I'll do some thinking, some digging around and some experiments and we'll see :D
# 12:55 
cmal thank you so much for the ideas sknebel :-)
doesntgolf, KevinMarks, ChrisAldrich and KevinMarks_ joined the channel
# 16:35 
KevinMarks_ If you're doing your own block management you're recreating the file system, but without the hardware coupling. You're probably better off just rewriting the file
# 16:41 
aaronpk yeah that sounds like a path i never want to go down :)
# 16:42 
aaronpk in my QuartzDB i only ever append to the file. i made a "maintenance task" that I can run periodically to re-sort the file if I'm worried that some data was inserted out of order.
# 16:42 
aaronpk that task really just re-creates the file from scratch
# 16:50 
KevinMarks_ Right, which is pretty much what databases do too
# 16:50 
aaronpk yeah
# 16:51 
KevinMarks_ Defragmentation
# 16:51 
KevinMarks_ Though on ssd that is moot really
# 16:52 
aaronpk well i want the file in sort order so it's easier to seek through in my code
# 16:52 
aaronpk i just do a simple iteration over each line
ChrisAldrich joined the channel
# 17:42 
cmal I'm actually running down the path of a collection comprised of different chunks (each one in a file) and I think it's not such a bad way to deal with the issue at all
# 17:42 
cmal I mean, let's talk about it again when I'm done implementing and I have actual code to show :)
# 17:43 
cmal (hopefully just a few hours/days of messing around ^^)
# 19:04 
sknebel cmal: since I don't think I mentioned it: my solution for time-ordered posts is a directory full of symlinks, named after the timestamp ;)
# 19:08 
cmal I've thought about this, but then depending on your filesystem your folder may end up full
# 19:09 
cmal also I'm curious, how do you retrieve the latest articles?
# 19:09 
aaronpk the main limitation i've found with working with filesystems is having a single folder filled with tons of files
# 19:10 
aaronpk so i always break mine up, usually by year/month/day
# 19:13 
sknebel cmal: right now I don't have that many posts, once that becomes an issue I'll probably add directory layers like aaronpk said. Or replace the directory with sqlite, or ...
# 19:13 
cmal fair enough :-)
# 19:13 
sknebel I can always get the timestamp from the post metadata, this is just for easy lookup
# 19:15 
sknebel (and yes, listing all files in a directory is a relatively slow operation, but I'm sure there are a lot of relatively slow operations in my setup ;))
# 19:25 
cmal :)
doesntgolf, KevinMarks, ChrisAldrich and cmal joined the channel
# 20:05 
aaronpk wow neat SVG tricks, i bet KevinMarks would like this http://codepen.io/tylersticka/pen/NAojkB?editors=1100
# 20:05 
aaronpk that was in response to my CSS post https://aaronparecki.com/2016/08/13/4/css-thumbnails
# 20:05 
Loqi [Aaron Parecki] Centered and Cropped Thumbnails with CSS https://aaronparecki.com/2016/08/13/4/cropped-thumbnails.jpg
# 20:10 
KevinMarks Can't you just use object-fit:cover?
# 20:12 
KevinMarks Might have to polyfill for ie http://caniuse.com/#search=Object-fit
# 20:12 
aaronpk this has better browser support
# 20:20 
cmal aaronpk: there's also https://github.com/imazen/imageflow, but I'm not sure whether the project has stopped or is just on summer vacation :)
# 20:20 
cmal it's not much at the moment, but the goals and means expressed are very interesting
# 20:21 
aaronpk that's overkill :)
# 20:21 
aaronpk and holy crap complicated
# 20:21 
cmal depends for what precisely
# 20:21 
cmal yeah okay that's a fair point :D
# 20:22 
aaronpk also doesn't mention cropping to squares
# 20:23 
aaronpk for all their talk about efficiency, their website is super slow https://www.imageflow.io/
# 20:24 
cmal that's just a jekyll site, doesn't reflect the code :P
# 20:24 
aaronpk uh huh
# 20:24 
aaronpk it does look like a neat project though
# 20:24 
cmal agreed
# 20:25 
cmal as I said, it's more the goals and means that are interesting more than the implementation itself (at least for the moment, we'll see what it turns out to be a year from now)
# 20:25 
aaronpk "face-aware cropping" cool
# 20:26 
voxpelli Face-aware seems fairly standard nowadays :)
# 20:26 
aaronpk well i'll keep an eye on that and come back to it in a year or so when i finally get around to starting my shoebox project
# 20:26 
voxpelli Been deploying face-aware cropping on sites for the last 3-4 years I think, including on my current newspaper job
# 20:27 
aaronpk nice, what tools?
# 20:27 
voxpelli Tricky thing with image flow is their Affero license – I won't go near anything with that
# 20:28 
voxpelli aaronpk: Cloudinary at first, now Imgix, both hosted solutions
# 20:28 
aaronpk ah
# 20:28 
voxpelli But there are face-detection scripts one can employ oneself
# 20:29 
aaronpk i'm pretty pleased with my flickr-like autolayout that avoids needing to crop images at all
# 20:29 
aaronpk i'm almost ready to launch it on my site actually
# 20:34 
miklb I looked at Imigix as they have a Jekyll plugin
# 20:36 
aaronpk imgix is kind of expensive. not sure i would use that for a personal site
# 20:36 
voxpelli Both Cloudinary and Imgix API:s are the URL one loads the image through so easy to use anywhere
# 20:36 
miklb that was my final thought, though maybe because I've never served any images, they haven't asked for money after the trial :)
# 20:37 
cmal voxpelli: what's wrong with the AGPL?
# 20:37 
voxpelli I used Cloudinary on personal stuff mainly because they let me upload and store at their place as well
# 20:37 
voxpelli Which I needed due to hosting stuff at Heroku
# 20:38 
miklb voxpelli did you move your blog from gh?
# 20:39 
voxpelli cmal: the way it can taint code and force one to release it wherever one uses it
# 20:39 
voxpelli miklb: no, meant personal as in not my day job, eg. the freelance work I did at the site for The Conference
# 20:39 
miklb ah
# 20:40 
voxpelli My day job uses Imgix
# 20:40 
cmal and what's the problem with releasing code? (I'm sorry, unless it's a pro-propriety argument I don't see your point)
# 20:41 
voxpelli cmal: when it comes to GPL the web world has had a very easy ride as the server-client pattern meant they never "distributed" any code
# 20:42 
voxpelli So only with AGPL do one get the true feeling of GPL
# 20:42 
voxpelli And I'm no big fan of forceful freedom, I'm more a fan of freedom to be free
# 20:43 
cmal still, I don't see the problem with releasing server code (that's even a common practice nowadays)
# 20:43 
cmal but I get the political argument
# 20:44 
miklb don't see many new projects adopt GPL for those reasons IMO
# 20:44 
aaronpk hey this looks pretty neat https://github.com/thoas/picfit
# 20:45 
voxpelli risk assessments etc makes it a lot harder to include AGPL code than to include MIT/BSD one. I barely know where to start to get an approval for that kind of code inclusion
# 20:46 
voxpelli I much rather have everyone be able to use my code and to contribute to it, no matter if their legal or business teams find the risk of AGPL to be acceptable or not
# 20:47 
cmal although, I'd say a freedom-enforcing mechanism like copyleft, which only practices constraints on those trying to privatize stuff, is actually the closest you get to "freedom to be free" in the context of systemic oppressions (like standard copyright when it comes to code)
# 20:49 
voxpelli aaronpk: Nice, I think Will Norris has one as well: https://willnorris.com/go/imageproxy
# 20:49 
Loqi [Will Norris] imageproxy is a caching image proxy server written in go.  It features: 
basic image adjustments like resizing, cropping, and rotation 
access control using host whitelists or request signing (HMAC-SHA256) 
support for jpeg, png, and gif image f...
# 20:51 
voxpelli cmal: the classic case of MIT/BSD vs GPL :) And funnily enough those using AGPL the most is those who wants to keep their open stuff as private as possible ;)
# 20:51 
cmal yeah, long story
# 20:52 
voxpelli I've only seen AGPL used when one also sells a proprietary license for the same software as well, like Neo4j and this Imageflow hing
# 20:52 
miklb wait until you throw Apache Software License into the mix ;-)
# 20:52 
cmal anyway I'll settle for saying this is just legalese bullshit anyway and once we've abolished the State we won't have to talk about such boring lawyering anymore, deal?
# 20:52 
voxpelli miklb: that one actually does some good stuff ;) It handles patents
# 20:53 
miklb yes. I don't disagree with its use. Just adds another layer to the discussion.
# 20:53 
voxpelli Indeed it does :P Happy that software patents don't exist here in Sweden
# 20:53 
miklb We chose ASL for Habari
ben_thatmustbeme joined the channel
# 20:55 
miklb partly because we thought about getting into the Apache Incubator, but mostly for the freedoms it affords
# 20:55 
voxpelli I think Google picks ASL and I guess that can have its benefits over eg Facebook that picked a standard BSD for React, but with a custom patents clause that has caused some drama
# 20:56 
miklb truthfully my eyes glaze over license discussions after a bit :-)
# 20:57 
voxpelli Yeah :P
# 20:57 
cmal :-)
# 20:57 
voxpelli At the end of the day I just want to code and not get sued for doing so :)
# 20:57 
aaronpk whoa http://code.flickr.net/2015/06/25/real-time-resizing-of-flickr-images-using-gpus/
# 20:58 
cmal as look as share your code under any kind of free license and build blocks to tear down GAFAMs I think everyone will be okay with it ;)
# 20:59 
voxpelli aaronpk: I think Imgix uses GPU:s as well, at least I hope so considering their servers: http://photos.imgix.com/racking-mac-pros ;)
# 21:01 
aaronpk holy cow
# 21:03 
sknebel that Imgix stuff always confused me (MacOS APIs can't be *that* superior to make that worth it, can they?), but that's the kind of thing GPUs are really good at, so if you have to do it a lot...
# 21:04 
aaronpk okay that was a fun rabbit hole
# 21:04 
aaronpk now back to what i was actually doing...
ChrisAldrich joined the channel
# 21:23 
aaronpk aaand got it
# 22:10 
aaronpk i really wish css pseudo elements could contain html
# 22:11 
aaronpk although if that were possible i may have just coded an infinite loop in css
# 22:27 
aaronpk omg i think i'm going to be able to launch photo albums
doesntgolf and ChrisAldrich joined the channel