Web scraping instagram.com

Name: instagram.com Web Scraping API Benchmark
Creator: Scrapeway

Last updated: 2024-04-08

Instagram is one of the biggest social networks focusing on media sharing and public announcements making it a popular target for web scraping.

Instagram.com is using proprietary web scraping protection tech that is updated constantly. This makes it difficult to scrape Instagram at scale reliably and that's where web scraping APIs can really come in handy.

Overall, most web scraping APIs we've tested through our benchmarks perform well for scraping Instagram.com at $1.53 per 1,000 scrape requests on average.

Instagram.com scraping API benchmarks

Scrapeway runs bi-weekly benchmarks for Instagram Pages against the most popular web scraping APIs. Here's the ranking for this period:

Web scraping API benchmark for instagram.com — success rate, speed, cost per 1,000 requests. Data: 2026-07-11 to 2026-07-17.
#	Service	Success	Speed	Cost/1k
1 🥇	Scrapfly	99% +1	16.5s +9.0	$4.32 -0.07	(237) ★ 4.9
2 🥈	Scrapingant	93% -3	17.3s +4.3	$1.9 =	—
3 🥉	WebScrapingAPI	82% +5	14.6s -0.8	$2.71 =	—
4	Scrapingbee	24% -5	3.1s -0.2	$3.29 +0.04	(137) ★ 4.9
5	Zenrows	0% —	— —	— —	(103) ★ 4.8
6	Scrapingdog	0% —	— —	— —	—
7	Scraperapi	0% —	— —	— —	(62) ★ 4.6
8	Firecrawl	0% —	— —	— —	—

Data range Jul 11 – Jul 17

All Benchmarks →

How to scrape instagram.com?

Instagram.com can be surprisingly complex to scrape as it's a giant web app with graphql backend. For people unfamiliar with reverse engineering using browser network inspect it's probably best to pay extra and use headless browser to fully render pages.

In this case, see web scraping API services that support full browser automation that can click on specific posts and scroll through comments.

That being said, Instagram can be scraped without the use of headless browsers by using it's graphql backend like this python example for user profile scraping:

Code example

instagram_scraper.py

import json
from parsel import Selector

# install using `pip install scrapfly-sdk`
from scrapfly import ScrapflyClient, ScrapeConfig, ScrapeApiResponse

# create an API client instance
client = ScrapflyClient(key="YOUR API KEY")

# create scrape function that returns HTML parser for a given URL
def scrape(url: str, country: str="", render_js=False, headers: dict=None) -> Selector:
    api_result = client.scrape(ScrapeConfig(
            url=url,
            asp=True,
            render_js=True,
            cache=False,
            cache_ttl=900,
            url='https://www.instagram.com/samsmith',
            method='GET',

    ))
    return api_result.selector

# this example show how instagram can be scraped through their backend API
username = "google"
selector = scrape(
    url=f"https://i.instagram.com/api/v1/users/web_profile_info/?username={username}",
    headers={"x-ig-app-id": "936619743392459"},  # this is needed to access IG backend API
)

# this returns a giant JSON dataset with all Instagram profile details
dataset = selector.get()['data']['user']

# some examples of what can be found in the dataset:
from pprint import pprint
pprint(dataset)