Why Every Reddit Scraper Broke (And How ScrapeBadger Fixed It)

If your Reddit data pipeline stopped returning results in the last 24 hours, you're not imagining it. Reddit pushed two significant platform updates yesterday that broke every scraper relying on unauthenticated access โ which is the majority of tools in the market.
Here's exactly what happened, why it broke things, and where ScrapeBadger stands right now.
What Reddit Announced
Two separate announcements dropped simultaneously from Reddit's admin team.
The first, posted to r/modnews under the title "Protecting communities from scrapers and platform abuse," contained the line that broke pipelines everywhere:
"Deprecating unauthenticated JSON access: We'll also be shutting down unauthenticated .json endpoints."
This is the change that matters most for anyone scraping Reddit data. The .json trick โ appending .json to any Reddit URL to get structured data back without authentication โ has been the backbone of Reddit data collection for years. It worked without API keys, without OAuth registration, without any account. Reddit explicitly supported it, and the entire ecosystem of Reddit scrapers, open-source tools, and data pipelines was built on top of it.
That endpoint is being shut down. Unauthenticated JSON access will no longer return data.
The second announcement, posted to r/RedditSafety, clarified Rule 8 ("Don't break the site") to now explicitly state:
"No unauthorized scraping. Accessing or collecting Reddit data without explicit permission continues to be a violation of our terms."
Rule 8 was previously focused on operational abuse โ spamming, coordinated inauthentic behaviour, API misuse. The explicit inclusion of "unauthorized scraping" as a Rule 8 violation puts scraping without permission on the same level as breaking Reddit's infrastructure. It also applies to AI agents โ the post explicitly names "agentic account creation" as a target, and the companion post from u/spez confirms that automated or suspicious accounts may be challenged to verify a human is behind them.
Why Scrapers That Relied on the JSON Trick Stopped Working
The .json endpoint deprecation is the direct cause of the outage. Here's why it hit so broadly:
Almost every open-source Reddit scraper, every PRAW-adjacent tool, and every lightweight scraping script used this endpoint as its primary data source. It was the path of least resistance: no authentication required, structured JSON data returned instantly, no rate limit configuration needed for low volumes.
When Reddit turns off unauthenticated .json responses, these tools get back either empty responses, authentication redirects, or outright errors. The data pipeline doesn't fail loudly โ it often appears to run successfully while returning nothing useful. Teams that weren't actively monitoring data completeness discovered the breakage when they looked at their dashboards and found empty datasets.
The tools affected include:
Every scraper using the public .json URL pattern as its primary endpoint. Most Apify community Reddit actors that hadn't already moved to authenticated access. Any custom Python scraper built with direct URL calls rather than Reddit's official API. Browser-extension scrapers that relied on public URL access patterns. Third-party tools that advertised "no API key required" as a feature โ because that feature was built on the unauthenticated endpoint that no longer works.
The tools not affected: those already using Reddit's official OAuth API with authenticated credentials, and infrastructure-level scrapers that don't depend on the public JSON endpoints.
Why This Change Was Coming
Reddit has been signalling this direction since the 2023 API pricing change that ended Apollo and most third-party Reddit clients. Yesterday's announcements are the continuation of the same strategy: tightening control over programmatic access to Reddit data, forcing any meaningful automated access through channels Reddit can monitor, rate-limit, and โ eventually โ monetise.
The u/spez post about bot labeling ("Humans welcome (bots must wear name tags)") frames this as a user experience initiative โ Reddit wants users to know when they're talking to a human versus an automated account. That's genuine. But the operational consequence is that Reddit now has a policy and technical framework to restrict, block, and penalise automated access that isn't going through approved channels.
The broader context: Enterprise agreements for Reddit data access start around $12,000 per year and require direct negotiation. There is no self-serve upgrade path. The deprecation of unauthenticated access moves more teams toward either the official paid API or scraping tools that operate at a layer below Reddit's authentication checks โ which is exactly where infrastructure-level scrapers like ScrapeBadger operate. Firecrawl
What ScrapeBadger Did
Our Reddit Scraper launched recently and the timing of this announcement โ within days of our launch โ meant we were dealing with a live platform change in a newly released product. That's not comfortable. Here's what we did.
The unauthenticated JSON endpoint deprecation affects scrapers that hit Reddit URLs directly and parse the .json response. ScrapeBadger's infrastructure doesn't depend on this endpoint. Our Reddit scraping operates at the request level with authenticated session management โ the same infrastructure that handles Cloudflare, Imperva, Akamai, and other protected targets. When Reddit shut off unauthenticated JSON responses, our bypass layer adapted because we weren't relying on that pathway in the first place.
The Rule 8 clarification adding explicit "no unauthorized scraping" language is a ToS change, not a technical one. Accessing publicly visible Reddit content through the web interface โ the same data any user sees when they browse Reddit without logging in โ is what our scraper accesses. The legal landscape for scraping public web data hasn't changed with this announcement; as covered in the web scraping legality section of our e-commerce scraping guide, the hiQ v. LinkedIn precedent establishing that scraping public data doesn't violate the CFAA remains in effect.
ScrapeBadger's Reddit Scraper is returning complete data โ posts, nested comments, subreddit metadata, user profiles, and search results โ as of this writing.
What This Means for Teams Using Other Tools
If your Reddit pipeline is broken today, you have a few options depending on what tool you were using:
If you were using the raw .json endpoint directly โ through a custom script, PRAW with public access, or any tool that advertised "no API key required" โ you need to migrate to a tool that operates at the infrastructure level or move to Reddit's official authenticated API. The .json deprecation is permanent and Reddit has been clear that it's not rolling it back.
If you were using an Apify community actor โ check whether that specific actor's implementation depended on unauthenticated access. Some Apify actors already used authenticated sessions; others relied on the public JSON endpoints. Actor maintainers will update their implementations, but timeline depends on individual contributors.
If you need data today โ ScrapeBadger's Reddit Scraper is operational. Free trial includes 1,000 credits with no credit card required. If you were previously getting data through a tool that's now broken, you can test ScrapeBadger against your specific subreddits and queries before making any infrastructure decision.
The Broader Lesson
Reddit's announcement isn't an isolated event. It's part of a pattern: every major platform that hosts valuable data is progressively restricting programmatic access. Twitter did it in 2023. Reddit did a version of it then too with API pricing. Now they're doing it again at the infrastructure level.
The tools that survive these changes are those that don't depend on the features platforms deliberately deprecate. The .json trick was convenient because Reddit allowed it. Reddit is no longer allowing it. Any tool whose architecture depended on that allowance has a gap to fill.
ScrapeBadger's approach โ operating at the browser execution and infrastructure level rather than relying on platform-provided conveniences โ means platform policy changes affect our legal analysis, not our technical capability. We access publicly visible data through the same channel any browser uses. When platforms change their convenient developer features, we keep working.
We'll continue monitoring Reddit's rollout and update this post if anything changes. The ScrapeBadger blog covers breaking changes to scraping infrastructure as they happen. If you have questions about migration or data continuity, reach out directly.
Written by
Domas Sakavickas
Domas Sakavickas is the Co-founder of ScrapeBadger, building web scraping infrastructure for developers and data teams. He writes about the web data market, tool comparisons, business use cases for scraping, and what it takes to turn public web data into a competitive advantage.
Ready to get started?
Join thousands of developers using ScrapeBadger for their data needs.