2021-10-06 UTC
# 3 weeks ago LAION-400M dataset (now a billion+), first Image-Alt-text pair dataset of this scale was released. @vinayprabhu, @MannyKayy & I dug into it https://arxiv.org/abs/2110.01963 Long tread 1/ Warning: paper contains NSFW content that may be disturbing, distressing &/or offensive https://pbs.twimg.com/media/FBA9JQvUYAoUeHe.png ( twitter.com/_/status/1445723482231173120)