Home
News
This AI startup is reading the entire internet non-stop. Why?

This AI startup is reading the entire internet non-stop. Why?

Subscribe for your daily dose of unconventional business news 🚀

Please provide a valid email address.

Trung T. Phan

Published: September 08, 2020

Updated: February 09, 2024

Photo Credit: Donald Iain Smith

This AI startup is reading the entire internet non-stop. Why?

Open-AI’s latest natural language process (NLP) model, GPT-3, is an astonishing feat. The tool is able to generate poems, short stories, songs, and technical specs that can pass off as human creations.

But as cool as it is, GPT-3 doesn’t actually understand what it’s creating. AI needs to demonstrate a deeper level of comprehension to gain our trust.

Enter Diffbot

To address this issue, the machine-learning company Diffbot is building an AI that reads every page on the entire public web, in multiple languages, extracting as many facts as it can.

Rather than using this info to train a language model like GPT-3, Diffbot turns it into a series of 3-part factoids that relates one thing to another: subject, verb, and object.

This approach creates a more accurate knowledge graph

In addition to the subject-verb-object paradigm, Diffbot’s founder Mike Tung tells The Hustle that his startup is building an AI system that consumes information like humans do.

Among other parameters, it takes into consideration things like; 1) trustworthiness (“Did this come from an official source, or social media?”); and 2) up-to-date-ness (“Is this information stale?”), so that you can see where the facts it generates come from on the web.

The startup already has ~400 paying customers

Diffbot is the only US company (aside from Google and Microsoft) crawling the entire web, and the knowledge graph it’s building is being deployed across various industries:

DuckDuckGo uses it to create Google-like answer boxes
Snapchat uses it to extract highlights from news pages
Adidas and Nike use it to find counterfeits

What’s next for Diffbot?

Making it easy to use information from their knowledge graph for popular business tools like Excel, Google Sheets, and Salesforce.

Topics:

Emerging Tech

Easily distracted? This AI focus tool will scold you into staying on task

Feb 26, 2026
This AI device lets you smell your memories

Feb 12, 2026
Will AI find soccer’s next MVP?

Feb 09, 2026
Eyewear innovations are coming into focus

Jan 26, 2026
Your dog has something to say. The market is listening

Jan 13, 2026
Cuddly and kinda creepy — AI toys are here

Nov 25, 2025
AI won’t replace athletes — but it might coach them

Nov 10, 2025
How man’s best friend became man’s favorite podcaster

Nov 06, 2025
In the AI age, human-made is the new organic

Nov 04, 2025
A startup that uses AI to help musicians get paid

Oct 08, 2025

This AI startup is reading the entire internet non-stop. Why?

Enter Diffbot

This approach creates a more accurate knowledge graph

The startup already has ~400 paying customers

Need the full story?

Related Articles

Easily distracted? This AI focus tool will scold you into staying on task

This AI device lets you smell your memories

Will AI find soccer’s next MVP?

Eyewear innovations are coming into focus

Your dog has something to say. The market is listening

Cuddly and kinda creepy — AI toys are here

AI won’t replace athletes — but it might coach them

How man’s best friend became man’s favorite podcaster

In the AI age, human-made is the new organic

A startup that uses AI to help musicians get paid

Want even more business resources? Checkout Trends.co to access exclusive research and connect with business builders from around the globe.

Thank you for subscribing!

Congrats on joining the best damn newsletter in the world

100% Free CRM

News

News

News Briefs

Hustle Originals

Past Newsletters

Videos

Videos

The Hustle

My First Million

Podcasts

Podcasts

The Hustle Daily Show

My First Million

Resources

How You Hustle

HubSpot Products

The HubSpot CRM Platform

Free HubSpot CRM

Overview of all products

Marketing Hub

Sales Hub

Service Hub

Content Hub

Data Hub

Commerce Hub

About HubSpot

Contact Us

Customer Support

This AI startup is reading the entire internet non-stop. Why?

Enter Diffbot

This approach creates a more accurate knowledge graph

The startup already has ~400 paying customers

Follow us on social media

Need the full story?

Related Articles

Easily distracted? This AI focus tool will scold you into staying on task

This AI device lets you smell your memories

Will AI find soccer’s next MVP?

Eyewear innovations are coming into focus

Your dog has something to say. The market is listening

Cuddly and kinda creepy — AI toys are here

AI won’t replace athletes — but it might coach them

How man’s best friend became man’s favorite podcaster

In the AI age, human-made is the new organic

A startup that uses AI to help musicians get paid

Thank you for subscribing!

Congrats on joining the best damn newsletter in the world

100% Free CRM