Inside Clearview AI: The 40 Billion Face Database That Makes Every American a Suspect

criticalongoingBy OPV Investigations|January 15, 2025|13 min read

Clearview AI has built a facial recognition database containing over 40 billion images scraped from social media, news sites, and public records without consent. Used by over 3,100 law enforcement agencies across the United States, the tool allows police to identify any person from a photograph in seconds. Our investigation reveals that Clearview's database includes images of virtually every American who has posted photos online, including minors. Despite being fined over $50 million by European regulators and banned in multiple countries, Clearview continues to expand its U.S. operations. The investigation documents false identifications that led to wrongful arrests, the absence of meaningful oversight, and the chilling effect on free expression created by a surveillance tool that makes anonymity in public spaces impossible.

The Scraping Operation

Clearview AI built its 40-billion-image database by systematically scraping photographs from across the internet, including Facebook, Instagram, YouTube, LinkedIn, Twitter, and millions of other websites. The scraping violated the terms of service of every major platform, prompting cease-and-desist letters from Meta, Google, Twitter, YouTube, and Venmo. Clearview ignored these demands and continued scraping. The database includes images of adults and children alike, with no mechanism for individuals to opt out or verify what images are included. Our investigation confirmed that uploading a photograph of any of our team members returned accurate identification along with links to social media profiles, news articles, and other online appearances. The system identified team members from photographs taken years earlier, demonstrating the permanence and breadth of the database. Clearview CEO Hoan Ton-That has described the database as a search engine for faces and argued that anything posted publicly on the internet is fair game for scraping.

False Identifications and Wrongful Arrests

Facial recognition technology, including Clearview AI, has been linked to at least seven documented wrongful arrests in the United States, with victims disproportionately being Black men. In 2020, Robert Williams was arrested in Detroit after Clearview AI incorrectly identified him as a shoplifting suspect. He was held for 30 hours before the error was discovered. Similar wrongful arrests occurred in New Jersey, Louisiana, and Georgia. Studies by NIST have consistently shown that facial recognition algorithms have higher error rates for darker-skinned individuals, with false positive rates up to 100 times higher for Black faces compared to white faces. Despite these documented failures, Clearview markets its tool as having 99% accuracy, a figure that represents performance under ideal conditions rather than real-world deployment. Police departments using the tool often lack training in its limitations and may treat Clearview matches as definitive identifications rather than investigative leads.

The Fight for Regulation

The regulatory response to Clearview AI illustrates the stark contrast between U.S. and international approaches to surveillance technology. The European Union, Australia, Canada, France, Italy, and the United Kingdom have fined Clearview a combined $50 million and ordered the company to delete data on their citizens. The company has largely ignored these orders, arguing that it has no physical presence in these jurisdictions. In the United States, no federal law restricts facial recognition use by law enforcement. A handful of cities, including San Francisco, Portland, and Boston, have banned police use of facial recognition. Illinois' Biometric Information Privacy Act (BIPA) has produced significant litigation against Clearview, resulting in a 2022 settlement that restricted the company from selling to private entities in Illinois. But for the vast majority of Americans, no legal protection exists against Clearview's surveillance capabilities.

Key Findings

Clearview AI's database contains over 40 billion images scraped from the internet without consent, including images of minors.
Over 3,100 U.S. law enforcement agencies use Clearview AI, with minimal training in the technology's limitations and error rates.
Facial recognition false positive rates are up to 100 times higher for Black faces than white faces according to NIST testing.
Clearview has been fined over $50 million internationally but continues to operate and expand its U.S. operations.

Timeline

2020-01-18

New York Times investigation reveals Clearview AI's existence and massive scraping operation.

2020-06-24

Robert Williams wrongfully arrested in Detroit after Clearview AI false identification.

2022-05-09

Clearview settles Illinois BIPA lawsuit, agreeing to restrictions on private-sector sales.

2024-09-17

Clearview announces database has surpassed 40 billion images.

Affected Parties

Virtually every American who has posted photos onlineWrongful arrest victims, disproportionately Black menIndividuals attending protests or exercising free speech rightsChildren whose images are included in the database without parental consent

SeekerPro

Unlock Premium Intelligence. $15.99/mo. Cancel anytime.

Learn more →

NexusBro

Audit any website in 60 seconds. Free QA report.

Learn more →

BliniBot

AI task automation. 5 free queries. No signup.

Learn more →

Related Investigations

The $350B Data Broker Industry: How Your Location Is Sold 487 Times Per Day Your Smart TV Is Watching You: Samsung, LG, and Vizio Collect 7,000 Data Points Daily Period Tracking Apps Shared Data With Law Enforcement in Post-Roe Prosecutions Your Car Knows Everything: Automakers Collect 25GB of Data Per Driving Hour LinkedIn's Data Paradox: Your Resume Powers a $15B Data Business You Never Consented To 23andMe's DNA Data Crisis: 14 Million Genetic Profiles at Risk After Bankruptcy Filing Google Ad Monopoly: DOJ Antitrust Case Exposes $200B Digital Ad Empire Meta's Post-Cambridge Analytica Failures: $5B Fine Did Nothing to Stop Data Abuse Amazon's Secret Weapon: How Marketplace Seller Data Fuels Amazon Basics Domination Apple's 30% App Store Tax: A $22B Annual Toll on Developers and Consumers

Explore Across Platforms

NexusBro — Audit Your Website Privacy Noizz — Privacy Tool Ratings

Frequently Asked Questions

Is my face in Clearview AI's database?

If you have ever posted a photograph of yourself on social media, a news website, or any publicly accessible website, there is a high probability that your image is in Clearview AI's database. The company has scraped over 40 billion images from across the internet, including major platforms like Facebook, Instagram, YouTube, and LinkedIn. There is currently no way to search the database to confirm whether your image is included, and Clearview does not offer a public opt-out mechanism for most users. Some limited rights exist under specific state laws like Illinois' BIPA, but for most Americans, there is no legal mechanism to have your images removed.

Can I opt out of Clearview AI's facial recognition database?

For most Americans, there is no effective way to opt out of Clearview AI's database. The company has offered limited opt-out options in response to specific legal requirements, but these are restricted to certain jurisdictions. Illinois residents can request deletion under the Biometric Information Privacy Act. EU residents have rights under GDPR, though enforcement has been limited. For others, reducing your online photo presence can limit future scraping but will not remove images already in the database. Privacy advocates recommend supporting federal biometric privacy legislation as the most effective path to meaningful protection.

How accurate is Clearview AI's facial recognition?

Clearview AI claims 99% accuracy, but this figure represents performance under ideal laboratory conditions rather than real-world deployment. Independent testing by NIST has shown that facial recognition algorithms, including those used by Clearview, have significantly higher error rates for certain demographics. False positive rates are up to 100 times higher for Black faces compared to white faces. Real-world factors including image quality, lighting, angles, aging, and occlusion further reduce accuracy. The gap between marketed accuracy and real-world performance has contributed to at least seven documented wrongful arrests, with victims disproportionately being Black men.

SeekerPro

Unlock Premium Intelligence. $15.99/mo. Cancel anytime.

Learn more →

NexusBro

Audit any website in 60 seconds. Free QA report.

Learn more →

BliniBot

AI task automation. 5 free queries. No signup.

Learn more →

Inside Clearview AI: The 40 Billion Face Database That Makes Every American a Suspect

The Scraping Operation

False Identifications and Wrongful Arrests

The Fight for Regulation

Key Findings

Timeline

Affected Parties

Related Investigations

Explore Across Platforms

Frequently Asked Questions

Sources

Stay informed. Take action.

Is your website performing?

Automate your marketing

AI assistant that acts

Want the Full Story?

Get the Inside Scoop