Reddit sues Perplexity over alleged data scraping for AI training

The lawsuit also accuses Oxylabs, AWMProxy, and SerpApi of helping Perplexity collect data by hiding their identities and mimicking normal user behaviour.

October 23, 2025

Reddit’s lawsuit is part of a broader industry dispute over the use of publicly accessible content to train large language models. Credit: gguy/Shutterstock.com.

Reddit has initiated legal proceedings against AI company Perplexity in New York federal court, citing unauthorised scraping and use of user-generated content from its platform for AI model training.

The lawsuit also names Oxylabs, AWMProxy, and SerpApi, alleging that these companies aided Perplexity’s data collection by concealing their identities and using techniques designed to imitate typical user behaviour.

Access deeper industry intelligence

Experience unmatched clarity with a single platform that combines unique data, AI, and human expertise.

Find out more

Perplexity, which has built an AI-driven search service, has rejected the allegations.

In a statement posted on Reddit, the AI company asserted it does not train models on the social media platform’s content but instead provides summaries and citations of public discussions. The company added that “it is ‘impossible’ to sign a licence agreement” for this reason.

The statement further read: “A year ago, after explaining this, Reddit insisted we pay anyway, despite lawfully accessing Reddit data. Bowing to strong arm tactics just isn’t how we do business.”

Perplexity characterised the lawsuit as “a show of force in Reddit’s training data negotiations with Google and OpenAI.”

SerpApi stated to CNBC that it “strongly disagrees” with Reddit’s claims and will defend itself in court. CNBC did not receive responses from Oxylabs or AWMProxy.

Reddit’s legal action is part of a wider industry conflict regarding the use of publicly available content in training large language models.

The company previously filed a similar suit against Anthropic in June 2025. According to the complaint, Perplexity increased its referencing of Reddit content by 40 times following receipt of a cease-and-desist letter from the latter.

Reddit claims posts from its platform are a frequent source for AI-generated responses on Perplexity’s service.

With more than 100,000 communities, Reddit is a major source of publicly available user conversations.

Researchers have previously noted that Reddit’s volume and moderation provide a valuable training dataset for generating more conversational AI outputs.

The social media company has pursued data licencing strategies with enterprises such as OpenAI and Alphabet, restricting AI-related access to its data to those who have paid for specific agreements.

Earlier in 2025, Reddit COO Jen Wong told Adweek that AI licencing deals with Google and OpenAI account for close to 10% of the firm’s revenue.

In a statement provided to CNBC, Reddit chief legal officer Ben Lee said: “AI companies are locked in an arms race for quality human content,” describing the process as fuelling an “industrial-scale ‘data laundering’ economy.”

Reddit sues Perplexity over alleged data scraping for AI training

Go deeper with GlobalData

ChatGPT Trailblazers - How Startups Democratize Generative Artificial Intelligence (AI)

Innovation in Artificial Intelligence: AI-assisted OCR

Data Insights

Access deeper industry intelligence

ChatGPT Trailblazers - How Startups Democratize Generative Artificial Intelligence (AI)

Innovation in Artificial Intelligence: AI-assisted OCR

Go deeper with GlobalData

Carriers swap discounts for targeted promos to boost ARPU, convergence, and retention

Uber taps Amazon custom chips to support computing and AI training

Intel to join Elon Musk’s Terafab AI chip initiative

Salesforce nails collaboration with new Slackbot features but security is a concern

Sign up for our daily news round-up!

Sign up to the newsletter: In Brief

Go deeper with GlobalData

Data Insights

Access deeper industry intelligence

Sign up for our daily news round-up!

Give your business an edge with our leading industry insights.

Go deeper with GlobalData

Go deeper with GlobalData

Access deeper industry intelligence

Sign up for our daily news round-up!

Sign up to the newsletter: In Brief

I would also like to subscribe to:

Thank you for subscribing