Web scraping walmart.com

Name: walmart.com Web Scraping API Benchmark
Creator: Scrapeway

Last updated: 2024-04-08

Walmart is one of the biggest e-commerce retailers in the United States containing product data of brick and mortar stores as well as online stores.

Walmart is using proprietary web scraping protection mechanisms that are constantly evolving. This makes it difficult to scrape Walmart data reliably and this is where web scraping APIs come in handy.

Overall, most web scraping APIs we've tested through our benchmarks perform well for Walmart at $2.88 per 1,000 scrape requests on average.

Walmart.com scraping API benchmarks

Scrapeway runs bi-weekly benchmarks for Walmart Products against the most popular web scraping APIs. Here's the ranking for this period:

Web scraping API benchmark for walmart.com — success rate, speed, cost per 1,000 requests. Data: 2026-07-11 to 2026-07-17.
#	Service	Success	Speed	Cost/1k
1 🥇	Scrapfly	100% =	2.8s -2.9	$0.21 -3.38	(237) ★ 4.9
2 🥈	Scrapingant	91% =	42.6s +2.0	$1.9 =	—
3 🥉	Firecrawl	89% -3	5.2s -0.8	$7.84 -2.51	—
4	Scraperapi	89% =	7.9s -2.6	$2.45 =	(62) ★ 4.6
5	WebScrapingAPI	87% +2	14.6s +3.4	$2.71 =	—
6	Scrapingdog	83% -4	7.2s -0.3	$1.0 =	—
7	Zenrows	45% -49	12.2s -0.1	$6.9 =	(103) ★ 4.8
8	Scrapingbee	0% —	— —	— —	(137) ★ 4.9

Data range Jul 11 – Jul 17

All Benchmarks →

How to scrape walmart.com?

Walmart is relatively easy to scrape as it's mostly static content with a few dynamic elements so headless browser use is not required.

That being said, Walmart has a lot of anti-scraping mechanisms in place, so it's recommended to use a reliable web scraping service that can bypass the constantly changing anti-scraping measures. See benchmarks for the most up-to-date results.

Walmart's HTML datasets can be difficult to parse just because of sheer data point scale however many of the datapoints can be accessed through NextJS framework variables walmart is using. To do this look for the __NEXT_DATA__ variable in the HTML source.

Code example

walmart_scraper.py

import json
from parsel import Selector

# install using `pip install scrapfly-sdk`
from scrapfly import ScrapflyClient, ScrapeConfig, ScrapeApiResponse

# create an API client instance
client = ScrapflyClient(key="YOUR API KEY")

# create scrape function that returns HTML parser for a given URL
def scrape(url: str, country: str="", render_js=False, headers: dict=None) -> Selector:
    api_result = client.scrape(ScrapeConfig(
            url=url,
            asp=True,
            render_js=False,
            cache=False,
            cache_ttl=900,
            url='https://www.walmart.com/ip/Unique-Bargains-Women-s-Crop-Shrug-Long-Sleeve-Knit-Open-Front-Casual-Bolero-Cardigan-XL-Apricot/5331410912',
            method='GET',

    ))
    return api_result.selector

url = "https://www.walmart.com/ip/Apple-MacBook-Air-13-3-inch-Laptop-Space-Gray-M1-Chip-8GB-RAM-256GB-storage/609040889"
selector = scrape(url)

# Walmart is using NextJS framework so the product data is stored in a JSON variable
data = selector.xpath('//script[@id="__NEXT_DATA__"]/text()').get()
data = json.loads(data)
product = data["props"]["pageProps"]["initialData"]["data"]["product"]

# the resulting dataset is pretty big but here are some example fields:
from pprint import pprint
pprint(product)