2022-11-26 UTC
# [KevinMarks] Corlaez: depends what you're trying to do. Beautiful Soup will do a good job of letting you pull specific bits out of a page, but you are going to have to maintain a lot of mappings. There's granary.io which parses a lot of places into a common format, and there's https://github.com/postlight/parser which works on a lot of news sites. There's https://indieweb.org/XRay too.