Diffbot automatically extracts content from any web page into structured content – without rules or training. Products like Instapaper, Digg, Longform, Adobe, and Cisco rely on Diffbot to provide clean text and HTML for articles, discussion threads, image