从Web文章中提取内容并以很好的方式显示它们

I am trying to make something which allows people to put in a url from an article from for example the verge. What it does is reads the url/article and display it in a nice way like readability. But i am really stuck i can't find information anywhere on how to do it. Is there any api out there on how to do this. It's actually instead of scanning a whole rss feed only one article.

You are looking for boilerpipe. It should do exactly what you want. There is even a web API. You can also download the module and use it from a Python script.

You can test it out on an article of your choice here: http://boilerpipe-web.appspot.com. Just select ArticleExtractor as the extractor.

Should be the easiest way: http://simplehtmldom.sourceforge.net/

You can simply target elements like with css/jquery

You can do this quick-and-dirty with regular expressions, or you can import the DOM. Note that the solution that works for one website is very unlikely to work for another with no changes, whether you use regex or properly parse the DOM.