• #dev 2022-11-26
  • Prev
    Next
  • #indieweb
  • #dev
  • #wordpress
  • #meta
  • #stream
  • #microformats
  • #known
  • #events
#dev ≡
  • ←
  • →
2022-11-26 UTC
# 09:47
[KevinMarks]
Corlaez: depends what you're trying to do. Beautiful Soup will do a good job of letting you pull specific bits out of a page, but you are going to have to maintain a lot of mappings. There's granary.io which parses a lot of places into a common format, and there's https://github.com/postlight/parser which works on a lot of news sites. There's https://indieweb.org/XRay too.