Web scraping linkedin.com

Name: linkedin.com Web Scraping API Benchmark
Creator: Scrapeway

Last updated: 2024-04-08

Linkedin is by far the biggest career focused social network and thus contains incredibly amounts of job related data. From job listings to company profiles and public CVs of individuals - all very popular web scraping targets.

Linkedin.com is using its own proprietary web scraping protection technology that is being constantly updated and is one of the toughest to bypass. This makes it difficult to scrape Linkedin pages and this is where web scraping API value trully shows.

Overall, not many web scraping APIs we've tested through our benchmarks are able to scrape LinkedIn reliably and those which can set the average price at $5.26 per 1,000 scrape requests on average.

Linkedin.com scraping API benchmarks

Scrapeway runs bi-weekly benchmarks for Linkedin public profiles against the most popular web scraping APIs. Here's the ranking for this period:

Web scraping API benchmark for linkedin.com — success rate, speed, cost per 1,000 requests. Data: 2026-06-13 to 2026-06-19.
#	Service	Success	Speed	Cost/1k
1 🥇	Scrapfly	98% +1	11.6s -21.9	$7.75 +1.36	(237) ★ 4.9
2 🥈	Scrapingdog	96% =	0.2s +=	$10.0 =	—
3 🥉	WebScrapingAPI	64% -24	13.8s +0.1	$2.71 =	—
4	Scraperapi	53% -37	19.4s -0.4	$14.7 =	(62) ★ 4.6
5	Zenrows	46% -21	15.5s -6.7	$6.9 =	(103) ★ 4.8
6	Scrapingbee	0% —	— —	— —	(137) ★ 4.9
7	Firecrawl	0% —	— —	— —	—
8	Scrapingant	0% —	— —	— —	—

Data range Jun 13 – Jun 19

All Benchmarks →

How to scrape linkedin.com?

With the anti-bot bypass provided by web scraping APIs Linkedin.com is not very difficult to scrape. Most of LinkedIn content is static thus headless browser is not required to scrape LinkedIn effectively. See benchmarks for the most up-to-date results.

LinkedIn's HTML pages are well structured so all of it can be easily parsed using traditional HTML parsing tools like XPath or CSS selectors. To add, big chunk of the dataset is also available through json ld microdata that is embedded in the HTML page.

Code example

linkedin_scraper.py

import json
from parsel import Selector

# install using `pip install scrapfly-sdk`
from scrapfly import ScrapflyClient, ScrapeConfig, ScrapeApiResponse

# create an API client instance
client = ScrapflyClient(key="YOUR API KEY")

# create scrape function that returns HTML parser for a given URL
def scrape(url: str, country: str="", render_js=False, headers: dict=None) -> Selector:
    api_result = client.scrape(ScrapeConfig(
            url=url,
            asp=True,
            render_js=False,
            cache=False,
            cache_ttl=900,
            url='https://br.linkedin.com/in/pedro-mac-dowell',
            method='GET',

    ))
    return api_result.selector

url = "https://www.linkedin.com/in/adammgrant"
selector = scrape(url)

# big chunk of the dataset can be found in microdata markup:
data = json.loads(selector.xpath("//script[@type='application/ld+json']/text()").get())

# the resulting dataset is pretty big but here are some example fields:
person_data = next(d for d in data['@graph'] if d['@type'] == "Person")
from pprint import pprint
pprint(person_data)