Web scraping realtor.com

Name: realtor.com Web Scraping API Benchmark
Creator: Scrapeway

Last updated: 2024-04-08

Realtor is the second biggest real estate listing website in the United States based in California with over 100 million monthly users. This makes it one of the most popular real estate targets for web scraping.

Realtor.com is using Kasada anti-bot protection together with proprietary anti-scraping technology to block web scraping. This makes it difficult to scrape Realtor property data reliably and this is where web scraping APIs come in handy.

Overall, only few of web scraping APIs we've tested through our benchmarks perform well for Realtor.com at $3.29 per 1,000 scrape requests on average.

Realtor.com scraping API benchmarks

Scrapeway runs bi-weekly benchmarks for Realtor Listings against the most popular web scraping APIs. Here's the ranking for this period:

Web scraping API benchmark for realtor.com — success rate, speed, cost per 1,000 requests. Data: 2026-07-11 to 2026-07-17.
#	Service	Success	Speed	Cost/1k
1 🥇	Scrapfly	97% =	5.0s +0.1	$4.37 =	(237) ★ 4.9
2 🥈	Firecrawl	97% +25	8.1s -1.3	$6.86 -0.32	—
3 🥉	WebScrapingAPI	75% -1	19.7s +1.3	$2.71 =	—
4	Zenrows	33% -5	36.3s +1.1	$6.9 =	(103) ★ 4.8
5	Scraperapi	17% +3	8.0s +4.0	$0.49 =	(62) ★ 4.6
6	Scrapingdog	12% =	18.8s -1.1	$5.0 =	—
7	Scrapingbee	0% —	— —	— —	(137) ★ 4.9
8	Scrapingant	0% =	— -11.9	— -1.9	—

Data range Jul 11 – Jul 17

All Benchmarks →

How to scrape realtor.com?

Realtor is one of the easiest targets to scrape as it's a highly efficient javascript application that stores all of its data in JSON format which means headless browser use is not required.

That being said, Realtor.com has a lot of anti-scraping technologies in place, so it's recommended to use a reliable web scraping service that can bypass the constantly changing anti-scraping measures. See benchmarks for the most up-to-date results.

Realtor's HTML datasets contain their data in JSON variables under NextJS framework variables like __NEXT_DATA__ and can be easily extracted for full listing datasets making it an easy scraping target overall.

Code example

realtor_scraper.py

import json
from parsel import Selector

# install using `pip install scrapfly-sdk`
from scrapfly import ScrapflyClient, ScrapeConfig, ScrapeApiResponse

# create an API client instance
client = ScrapflyClient(key="YOUR API KEY")

# create scrape function that returns HTML parser for a given URL
def scrape(url: str, country: str="", render_js=False, headers: dict=None) -> Selector:
    api_result = client.scrape(ScrapeConfig(
            url=url,
            asp=True,
            render_js=False,
            cache=False,
            cache_ttl=900,
            url='https://www.realtor.com/realestateandhomes-detail/306-Baden-St_San-Francisco_CA_94131_M11858-73784',
            method='GET',

    ))
    return api_result.selector

url = "https://www.realtor.com/realestateandhomes-detail/16-Sea-Cliff-Ave_San-Francisco_CA_94121_M21813-49460"
selector = scrape(url)

# The entire dataset can be found in a javascript variable:
data = selector.css("script#__NEXT_DATA__::text").get()
data = json.loads(data)["props"]["pageProps"]["initialReduxState"]

# The resulting dataset is pretty big but here are some example fields:
from pprint import pprint
pprint(data)