Diffbot
Diffbot provides a suite of products to turn unstructured data from across the web into structured, contextual databases. Our products are built off of cutting-edge machine vision and natural language processing software that's able to parse billions of web pages every day.
Our Knowledge Graph product is the world's largest contextual database comprised of over 10 billion entities including organizations, people, products, articles, and more. Knowledge Graph's innovative scraping and fact parsing technologies link up entities into contextual databases, incorporating over 1 trillion "facts" from across the web in nearly live time.
Our Enhance product provides information about organizations and people you already hold some information on. Enhance let's users build robust data profiles about opportunities they already hold some data on.
Our Extraction APIs can be pointed to a page you want data extracted from. This can be product, people, article, organization page, or more.
Learn more
Microlink
Microlink is a fast, scalable, and reliable high-level API that controls a headless browser as a service, turning any web page into structured data, screenshots, PDFs, metadata, link previews, and performance metrics. It offers dedicated endpoints for metadata extraction, full-page and element screenshots, PDF generation, SDK-powered link previews, Lighthouse-based performance insights, and favicon retrieval, all accessible via a simple, declarative RESTful interface with interactive documentation. Built on optimized, serverless infrastructure and a global CDN of over 240 edge nodes, Microlink delivers consistent 99.9% uptime, built-in caching, request isolation, and automated proxy resolution without shared browser instances. Customizable features include configurable time-to-live, custom HTTP headers, and seamless scaling from free trials to millions of requests per month. Security compliance is ensured through isolated browser sessions per request.
Learn more
Decodo
Decodo (formerly Smartproxy) offers advanced proxy infrastructure and web scraping solutions to streamline web data collection for businesses and developers. With over 125 million ethically sourced IP addresses (residential, mobile, datacenter, and static residential proxies), Decodo helps users efficiently bypass geo-restrictions, CAPTCHAs, and other web access barriers. Decodo's intuitive APIs enable effortless, structured data scraping from websites, eCommerce platforms, search engines, and social media, supporting outputs in HTML, JSON, and CSV formats. The platform includes the Universal Scraper for easy real-time data extraction and an upcoming AI-powered Parser to minimize tedious manual data processing. Ideal for price aggregation, SEO monitoring, ad verification, multi-account management, AI training, and private browsing. Decodo also offers comprehensive documentation, responsive support, and transparent policies, including a 3-day trial and clear refund guidelines.
Learn more
OpenGraphr
We have prepared this API with the most advanced scraping techniques so that you can focus on your product while we handle the open graph data scraping. Our scraping engine uses Chromium under the hood, so it's also prepared to scrape JavaScript-based websites without hassle. We frequently improve our scraping algorithms so that you only worry about your business. Powered by Chromium under the hood, we support the extraction of OG tags of JS-powered websites (i.e. Angular, VueJS, React) Most websites will not be prepared for the Open Graph protocol, but we are smart enough to extract the information even in those cases. We work hard on making our scraper undetectable by using proxies and other evasion techniques. We are integrated with TailGraph and we can generate open graph images when the website does not comply with the OG protocol. We have a free-forever plan with 100 requests each month, no card required.
Learn more