Back to Blog

How to Scrape Zillow, Redfin & Rightmove Without Getting Blocked (Best APIs Compared)

Thomas ShultzThomas Shultz
18 min read
15 views
real estate scraping with scrapebadger

You try to scrape Zillow with Python requests, and you get a 403 Forbidden error in under a second. You switch to Redfin, and it works perfectly for 50 requests before going completely silent. You try Rightmove, and you immediately hit an impenetrable Cloudflare wall.

If you have ever tried to extract property data, you already know the truth: real estate is one of the most aggressively protected categories on the entire internet. The reason is simple. Real estate data is worth billions of dollars. Platforms like Zillow, Redfin, and Rightmove have built massive empires on proprietary datasets, and they deploy military-grade anti-bot systems to ensure competitors, aggregators, and third-party developers cannot extract that data for free.

This is the complete guide to scraping real estate data without getting blocked. We will cover exactly why these platforms are so difficult to scrape, what specific anti-bot systems they use to detect you, and which scraping APIs actually work in 2026 at scale.

We have scraped millions of real estate listings across 12 countries. Here is what we have learned about bypassing the hardest protections on the web.

⚡ TL;DR: If you just want the answer: ScrapeBadger is the fastest, most reliable API for scraping real estate data at scale. It automatically handles Imperva, Cloudflare, and PerimeterX. Start your free trial here.

Why Real Estate Websites Are So Hard to Scrape

Real estate data is proprietary and commercially sensitive. Zillow, Redfin, and Rightmove do not just display data; they monetize it through lead generation, advertising, and proprietary valuation models like the Zestimate. They do not want competitors, aggregators, or third parties extracting their core asset for free. To protect this data, they invest millions annually in enterprise-grade web application firewalls (WAFs) and anti-bot systems.

The Anti-Bot Systems Each Platform Uses

Different platforms rely on different security vendors. Understanding what you are up against is the first step to bypassing it.

Platform

Anti-Bot System

Difficulty

Zillow

Imperva Incapsula

🔴 Very Hard

Redfin

Custom rate limiting + JS

🟡 Medium

Rightmove

Cloudflare

🟠 Hard

Realtor.com

PerimeterX (HUMAN)

🔴 Very Hard

Zoopla

Cloudflare + custom

🟠 Hard

Trulia

Imperva (shares Zillow infra)

🔴 Very Hard

Idealista

Cloudflare

🟠 Hard

What Exactly Gets You Blocked

When you send a request to a protected real estate site, the anti-bot system evaluates your request across three distinct detection layers. Failing any single layer results in an immediate block or a CAPTCHA challenge.

IP Reputation: The most basic check is the origin of your IP address. Datacenter IPs (like those from AWS, DigitalOcean, or Google Cloud) are blacklisted within seconds on Zillow and Trulia. To pass this layer, you must route your requests through high-quality residential IPs that look like real home internet connections.

TLS and Browser Fingerprinting: Modern anti-bot systems like Imperva check the JA3/JA4 TLS signatures of your request. A standard Python requests call has a completely different cryptographic fingerprint than a real Google Chrome browser. If your TLS fingerprint does not match the User-Agent header you are sending, you are instantly flagged as a bot.

Behavioral Analysis: Even if you have a residential IP and a perfect browser fingerprint, you can still be blocked based on your behavior. Anti-bot systems analyze request rates, scroll patterns, mouse movements, and the timing between requests. Bots move too uniformly and too fast. If you request 100 pages at exactly 1.5-second intervals, the system will recognize the non-human pattern and block the session.

Why DIY Solutions Break

Rolling your own scraper for real estate sites is a recipe for endless frustration. DIY solutions fail at scale because residential proxy costs spiral out of control quickly. Furthermore, browser fingerprints go stale as Chrome updates, and sites update their anti-bot rules weekly. Maintaining the bypass logic required to scrape Zillow or Realtor.com is a full-time engineering job. This is why serious data teams abandon DIY scripts and move to purpose-built scraping APIs.

Platform-by-Platform Breakdown: What You're Up Against

Every real estate platform has a unique architecture and a different security posture. Here is exactly what you need to know to scrape the major players.

How to Scrape Zillow (Without Getting Blocked)

Zillow is the largest real estate platform in the US, and it is notoriously difficult to scrape.

Data available: Listing price, Zestimate, price history, beds/baths, square footage, days on market, agent info, and high-resolution photos.

Protection: Zillow uses Imperva Incapsula. This system deploys aggressive JavaScript challenges, behavioral fingerprinting, and strict IP blacklisting.

Key technical detail: Zillow embeds its core listing data in a __NEXT_DATA__ JSON blob directly within the page HTML. If you can successfully retrieve the raw HTML without triggering a block, parsing the data is incredibly straightforward because it is already structured as JSON.

What you need to get through: You absolutely must use premium residential proxies. Furthermore, you need a real browser fingerprint (including proper TLS and HTTP/2 negotiation) and robust session management to handle cookies correctly.

Without an API: Scraping Zillow manually is possible but highly fragile. It requires near-constant maintenance as Imperva updates its detection models.

With ScrapeBadger: You send one API request and receive clean JSON back, with no configuration required — it just works.

How to Scrape Redfin

Redfin is generally considered slightly easier to scrape than Zillow, but it still requires careful handling at scale.

Data available: Listing price, Redfin Estimate, days on market, price history, HOA fees, lot size, and school ratings.

Protection: Redfin's protection is less aggressive than Zillow's. They rely heavily on custom rate limiting and JavaScript challenges rather than a strict enterprise WAF. Interestingly, Redfin has a semi-accessible internal API (/stingray/api/gis), but it rate-limits very quickly if you attempt to pull data at scale.

Key technical detail: When you hit Redfin's internal endpoints, it returns JSON prefixed with {}&&&. This is a security measure to prevent JSON hijacking, but it is easy to strip away and parse once you are past the rate limiting.

What you need: The key to Redfin is session rotation combined with proxy rotation to stay under their strict rate limit thresholds.

With ScrapeBadger: The API automatically rotates sessions and proxies to keep your Redfin extraction running smoothly without hitting rate limits.

How to Scrape Rightmove

Rightmove is the undisputed leader in the UK property market, making its data highly coveted.

Data available: Listing price, property type, bedrooms, tenure, agent details, photos, floor plans, and EPC ratings.

Protection: Rightmove is protected by Cloudflare. They deploy JavaScript challenges, browser fingerprint checks, and aggressive CAPTCHAs on any session deemed suspicious.

Key technical detail: Because Rightmove is the UK's most visited property portal, its data is entirely unavailable through any official public API. You must scrape the frontend.

What you need: You need Cloudflare-capable proxies, browser-level rendering to execute the JS challenges, and proper header management.

With ScrapeBadger: ScrapeBadger's built-in Cloudflare bypass handles the JS challenges automatically, returning the raw property data instantly.

How to Scrape Realtor.com

Realtor.com is arguably the hardest target on this list.

Protection: It uses PerimeterX (now HUMAN Security), which is one of the most sophisticated anti-bot systems in existence. It relies heavily on behavioral biometric analysis.

Data: It pulls directly from the MLS, meaning it often has the most accurate and up-to-date active listing data in the US.

Requirements: Scraping Realtor.com requires full browser-level automation with advanced stealth patching to evade PerimeterX's biometric checks.

How to Scrape Zoopla & Rightmove (UK Market)

Zoopla: Zoopla's public API was retired, and access now requires a strict commercial agreement. They use Cloudflare combined with custom session validation.

Strategy: For comprehensive UK market coverage, Zoopla is best scraped as a pair alongside Rightmove.

Other Key Platforms Worth Scraping

Trulia: Trulia shares Zillow's infrastructure and Imperva protection. If you build a scraper for Zillow, you can generally scrape Trulia simultaneously using the same bypass techniques.

Idealista: The dominant portal in Spain, Italy, and Portugal. It is Cloudflare-protected and offers a gated partner API that is difficult to access.

OnTheMarket: The UK's third-largest portal. It has lighter protection than Rightmove and is generally easier to scrape.

Apartments.com / CoStar: The giant of the US rental market. They offer a private partner API only, meaning scraping is absolutely necessary to access public rental data at scale.

The Best APIs for Scraping Real Estate Websites in 2026

Rolling your own scraper for Zillow or Rightmove isn't a weekend project — it's a month of engineering, then a month of maintenance. These APIs do it for you. Here's how they actually stack up.

1. ScrapeBadger — Best for Real Estate Scraping Overall ⭐

ScrapeBadger is a purpose-built scraping API with dedicated real estate endpoints — not a generic wrapper. It handles Imperva, Cloudflare, PerimeterX, and DataDome automatically, which are the four systems that protect the major real estate platforms.

The API routes requests through a massive residential proxy pool with geo-targeting, which is critical for Zillow because it serves different data based on location. It returns clean, structured JSON containing addresses, prices, beds/baths, square footage, price history, and agent info, meaning no parsing logic is required on your end.

ScrapeBadger uses real browser fingerprinting (TLS + HTTP/2 + behavioral simulation) to pass Imperva's most aggressive challenges. It supports both synchronous and async modes, allowing you to get real-time data or run large batch jobs.

Coverage: All major US platforms (Zillow, Redfin, Realtor.com, Trulia, Apartments.com) + UK (Rightmove, Zoopla, OnTheMarket) + EU (Idealista, Immowelt).

Pricing: Transparent pricing with no credit multiplier surprises. View pricing here.

Support: Fast support and clear documentation.

✅ Pros

❌ Cons

Purpose-built for real estate

Newer platform compared to legacy providers

Automatic anti-bot bypass

Advanced AI extraction costs extra credits

Clean JSON output

Transparent pricing

Geo-targeted residential proxies

Best for: Developers, prop-tech companies, investment analysts, price monitoring tools, and listing aggregators.

2. Bright Data — Enterprise Power, Enterprise Complexity

Bright Data boasts the world's largest proxy network (72M+ IPs across 195 countries). These residential IPs pass even Imperva's most aggressive fingerprint checks.

They have a 98.44% success rate in independent benchmarks, which is industry-leading. They even achieved 100% on Zillow in Scrape.do's 2026 test. They offer a dedicated Real Estate Scraper with pre-built extractors for Zillow, Redfin, Rightmove, and Zoopla, providing structured output out of the box. They are also GDPR/CCPA/ISO 27001 compliant, which is important for enterprise real estate data teams.

The catch is the complexity and cost. Their Web Scraper IDE starts at $499/month. Pay-per-record billing gets complex at scale, and there is a steep learning curve. They are commonly cited for billing confusion in user reviews.

✅ Pros

❌ Cons

Massive proxy network

Very expensive starting price ($499/mo)

Industry-leading success rates

Complex billing structure

Pre-built extractors

Steep learning curve

Best for: Large enterprises with dedicated data engineering teams and strict compliance requirements.

3. Oxylabs — Strong Infrastructure, Proxy-First Mindset

Oxylabs has a 100M+ IP pool across 195 countries and a Real Estate Scraper API with dedicated endpoints for Zillow, Redfin, and Zoopla.

They achieved an 85.82% success rate in Proxyway's 2026 benchmark, which is solid but slightly below Bright Data. Their OxyCopilot AI assistant generates scraping code from natural language prompts, which is useful for non-technical teams.

However, their pay-per-GB bandwidth pricing model (around $9.40/GB for Web Unblocker) is unpredictable for high-volume real estate scraping where pages are large. Furthermore, their scraping API is a secondary product built on top of their proxy infrastructure, meaning they can be slower to adapt when anti-bot systems update.

✅ Pros

❌ Cons

Huge IP pool

Unpredictable pay-per-GB pricing

AI code generation

Slower to adapt to anti-bot changes

Dedicated real estate endpoints

Best for: Technical teams already invested in the Oxylabs proxy infrastructure.

4. Scrapfly — Developer-Focused, High Success Rate

Scrapfly is a purpose-built scraping API (not just a proxy add-on) with a 98% success rate on protected sites per Scrapeway benchmarks.

Their ASP (Anti-Scraping Protection) parameter uses a single flag to handle Cloudflare, DataDome, PerimeterX, and Imperva automatically. They offer pre-built real estate data extraction models, so no custom selectors are needed.

It starts at $30/month, which is a 39% cheaper entry point than many competitors. The main limitation is that it is a newer platform with a smaller community and documentation base than Bright Data or Oxylabs, and some advanced real estate-specific features are still maturing.

✅ Pros

❌ Cons

High success rate (98%)

Smaller community/docs

Simple ASP parameter

Real estate features still maturing

Affordable starting price ($30/mo)

Best for: Developer teams who want high success rates with minimal configuration overhead.

5. ScrapingBee — Simple API, General Purpose

ScrapingBee offers a clean, easy-to-use API with good documentation that is fast to integrate.

They achieved an ~84% average success rate in Proxyway's 2026 benchmark. However, they have no dedicated real estate endpoint, meaning you must write the parsing logic yourself.

Their credit multiplier system is a significant drawback for real estate: stealth proxies cost 75 credits per request. A $49/month plan evaporates very fast when scraping Zillow or Rightmove.

✅ Pros

❌ Cons

Easy to use

No dedicated real estate endpoints

Good documentation

Expensive credit multiplier system

Best for: Teams already using ScrapingBee who want to add occasional real estate scraping to existing workflows.

6. Apify — Flexible Platform, Variable Quality

Apify is a marketplace of 4,000+ community-built Actors, with multiple Zillow, Redfin, Rightmove, and Idealista actors available.

Because it is community-maintained, quality varies wildly. Actors can break without warning when sites update their anti-bot rules. The billing is also complex, combining compute units, proxy GB, and storage, making it hard to predict monthly costs for real estate workloads.

✅ Pros

❌ Cons

Huge marketplace of Actors

Quality varies by Actor

Highly flexible

Complex billing (compute + proxy + storage)

Best for: Technical teams building custom pipelines who want full control and don't mind actor research and vetting.

Head-to-Head Comparison Table

This table compares the core capabilities of the top real estate scraping APIs. It highlights which platforms offer dedicated endpoints, which anti-bot systems they can bypass, and their starting prices.

Feature

ScrapeBadger

Bright Data

Oxylabs

Scrapfly

ScrapingBee

Apify

Dedicated real estate endpoint

⚠️

⚠️ Actor

Zillow support

⚠️

✅ Actor

Rightmove support

⚠️

✅ Actor

Structured JSON output

❌ (raw HTML)

Imperva bypass

⚠️

⚠️

Cloudflare bypass

⚠️

PerimeterX bypass

⚠️

⚠️

Residential proxies included

Extra cost

Transparent pricing

⚠️

⚠️

⚠️

Starting price

$0.60/1k req

$499/mo

$49/mo

$30/mo

$49/mo

$39/mo

Best for

Real estate first

Enterprise

Proxy-heavy teams

Developers

General scraping

Custom pipelines

What Real Estate Data Can You Actually Extract?

When you successfully bypass the anti-bot protections, the amount of data available is staggering. Here is a breakdown of what you can extract from the major platforms.

Data Category

Fields

Platforms

Listing details

Address, price, beds, baths, sqft, property type

All

Valuation

Zestimate, Redfin Estimate, price per sqft

Zillow, Redfin

Price history

All previous listing prices + dates

Zillow, Redfin

Days on market

Current + historical

All

Property features

HOA, lot size, year built, parking, heating

All

Photos

All listing images

All

Agent/seller info

Name, agency, contact

All

School ratings

Elementary, middle, high school scores

Zillow, Redfin

Neighborhood data

Walk score, transit score, demographics

Zillow, Redfin

Rental data

Monthly rent, lease terms, availability

Zillow, Apartments.com

Sold data

Sale price, date sold, price delta

Zillow, Redfin, Rightmove

This data powers a massive ecosystem of secondary applications. Investment analysts use it to spot undervalued markets. Prop-tech startups use it to build automated valuation models. Listing aggregators use it to create comprehensive market overviews.

Real Estate Scraping Use Cases

Extracting the data is only the first step. Here is how companies are actually using scraped real estate data in production.

Property Investment Analysis

Investors track price trends, days on market, and price reductions across specific ZIP codes to identify undervalued markets before they heat up.

Rental Market Monitoring

Property managers monitor rental listings across Zillow, Apartments.com, and Rightmove to track vacancy rates, price movements, and seasonal trends in their target neighborhoods.

Listing Aggregation Platform

Startups build niche portals (e.g., "homes with large garages" or "eco-friendly properties") by aggregating and filtering listings from the major platforms.

Competitive Intelligence for Real Estate Agents

Agencies monitor competitor listings, pricing strategies, and time-on-market to sharpen their own negotiation tactics and pricing models.

Automated Price Alerts

Buyers and investors set up real-time scrapers that trigger SMS or email alerts when properties in target areas drop below a specific price threshold or when new listings match exact criteria.

Academic & Policy Research

Housing economists, urban planners, and NGOs use scraped real estate data to study affordability, gentrification, and housing supply dynamics at a granular level.

This is the most common question developers ask before building a scraper. While this is not legal advice, the landscape is generally favorable for scraping public data.

Scraping publicly visible listing data is generally lawful in the US and EU. The landmark hiQ Labs v. LinkedIn ruling (affirmed in 2022) established that scraping publicly accessible data on the internet does not violate the Computer Fraud and Abuse Act (CFAA). Real estate listing data — such as addresses, prices, and photos visible to any website visitor without logging in — falls squarely into the "public data" category.

The key risks involve Terms of Service (ToS) violations. Zillow and Rightmove explicitly prohibit automated access in their ToS. However, violating a ToS is generally a civil risk (breach of contract), not a criminal one. The risk increases significantly if you engage in the commercial redistribution of raw data rather than using it for internal analysis.

In the EU, GDPR is a major consideration. Personal data about sellers or agents requires careful handling and compliance.

The best practice is to use reasonable request rates, avoid hammering servers, never resell raw scraped data directly, and consult legal counsel if you plan on commercial redistribution.

ScrapeBadger scrapes only publicly available data and is fully compliant with GDPR and CCPA.

Frequently Asked Questions (FAQ)

Q: What is the best API for scraping Zillow?

A: ScrapeBadger is the best API for scraping Zillow because it automatically bypasses Imperva Incapsula, rotates residential proxies, and returns clean, structured JSON data without requiring custom parsing logic.

Q: Does Zillow block web scrapers?

A: Yes, Zillow aggressively blocks web scrapers using Imperva Incapsula. It detects bots via IP reputation, TLS/browser fingerprinting, and behavioral analysis, often returning a 403 Forbidden error or a CAPTCHA challenge.

Q: Is scraping Rightmove legal?

A: Scraping publicly available property data on Rightmove is generally considered legal, as it does not violate computer fraud laws. However, it does violate Rightmove's Terms of Service, which carries civil, not criminal, risks.

Q: Can I scrape Redfin without getting blocked?

A: Yes, you can scrape Redfin without getting blocked by using a scraping API that handles session rotation and residential proxy management to stay under their strict rate-limiting thresholds.

Q: What data can you extract from real estate websites?

A: You can extract listing prices, property features (beds, baths, sqft), price history, days on market, agent contact information, high-resolution photos, and automated valuations like the Zestimate.

Q: Why do real estate scrapers get blocked so often?

A: Real estate data is highly valuable, so platforms invest heavily in enterprise anti-bot systems like Imperva, Cloudflare, and PerimeterX to prevent competitors and aggregators from extracting their proprietary datasets for free.

Q: How much does a real estate scraping API cost?

A: Costs vary widely. ScrapeBadger starts at around $0.60 per 1,000 requests, while enterprise solutions like Bright Data start at $499 per month. Most APIs charge based on successful requests or bandwidth used.

Q: Which anti-bot system does Zillow use?

A: Zillow uses Imperva Incapsula, a sophisticated Web Application Firewall (WAF) that deploys JavaScript challenges, behavioral fingerprinting, and strict IP blacklisting to block automated traffic.

Conclusion

Real estate is the hardest scraping category on the web precisely because the data is so incredibly valuable. A DIY Python script might work for 50 requests, but it will absolutely fail when you need 50,000. To scale, you need a dedicated infrastructure.

Here is the final verdict on the best tools for the job:

  • ScrapeBadger — The best end-to-end solution for real estate data, from Zillow to Rightmove, offering high success rates and clean JSON.

  • Bright Data — Unmatched success rates if you have an enterprise budget and a dedicated data engineering team.

  • Oxylabs — A solid choice if you are already deeply invested in their proxy ecosystem.

  • Scrapfly — Strong developer experience and a high success rate at a lower entry cost.

  • ScrapingBee — Fine for occasional scraping, but the credit multiplier makes it too expensive for real estate at scale.

  • Apify — Powerful and flexible, but requires significant actor vetting and engineering investment.

If you are building anything on real estate data, ScrapeBadger is the fastest path from being blocked to being in production. Start your free trial today.

Thomas Shultz

Written by

Thomas Shultz

Thomas Shultz is the Head of Data at ScrapeBadger, working on public web data, scraping infrastructure, and data reliability. He writes about real-world scraping, data pipelines, and turning unstructured web data into usable signals.

Ready to get started?

Join thousands of developers using ScrapeBadger for their data needs.

Scrape Zillow, Redfin & Rightmove Without Getting Blocked (2026) | ScrapeBadger