DiffBot icon
DiffBot icon

DiffBot

*Get data from web pages automatically: Diffbot's computer vision APIs turn the web into your database.

*AUTOMATIC APIs: Extract Automatically Get structured content from articles, products, and other familiar page types.

Image content detection: show all info of the page and image on http://alternativeto.net

Cost / License

  • Subscription
  • Proprietary

Platforms

  • Online
-
No reviews
5likes
0comments
0news articles

Features

Suggest and vote on features
No features, maybe you want to suggest one?

 Tags

DiffBot News & Activities

Highlights All activities

Recent activities

Show all activities

DiffBot information

  • Developed by

    Unknown
  • Licensing

    Proprietary and Commercial product.
  • Pricing

    Subscription ranging between $299 and $3999 per month.
  • Alternatives

    59 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

Development
DiffBot was added to AlternativeTo by Jpotato on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is DiffBot?

Why Diffbot?

We're focused exclusively on getting you better web data. Some of the reasons hundreds of customers make (hundreds of) millions of calls every month:

#The Web's Best Content Extractor:

Diffbot works automatically—without rules or training. There's no better way to extract data from web pages. See how Diffbot stacks up to other content extraction methods: Feature Comparison Text-Extraction Quality Shootout

#Identify Pages Automatically:

Use the Analyze API to automatically find and extract all products, articles, discussions or images while crawling any site. Analyze API

#Detailed product data:

The Product API automatically returns complete product info, including all pricing data, product IDs, brand and full specifications tables. Product API

#Clean text and html:

Articles, discussion threads, product descriptions and image captions are returned in pure text and sanitized HTML. Start testing today

#Structured Search:

Search structured content from any crawl on-the-fly using our Search API, returning only the matching results.

Plus...

¤ All APIs execute Javascript so content is parsed like a regular browser. ¤ Works on most non-English pages thanks to visual processing. ¤ Date normalization: Datestamps are normalized and presented in RFC 1123 (HTTP/1.1) standard format. ¤ Multipage articles are automatically joined together in a single API response. ¤ Entity extraction: automatic tagging identifies major topics and entities within article text. ¤ Fix any issues realtime with the API Toolkit. ¤ Bulk API allows the extraction of hundreds to hundreds-of-thousands of pages. ¤ Access Crawlbot and Bulk job data in full JSON or CSV formats. ¤ Optionally crawl using a diverse array of IP addresses.