How to handle timeouts in Python Requests

Q: How do I set a timeout for requests.get in Python?

Pass a numeric value or a tuple: requests.get(url, timeout=5) or requests.get(url, timeout=(2, 10)).

Q: What is the difference between connect timeout and read timeout in Python Requests?

Connect timeout is how long the client waits to open the connection, while read timeout is how long it waits for the server to start sending data after connecting.

Q: How should I handle a Python Requests timeout exception without crashing my script?

Wrap the request in try/except, catch requests.exceptions.Timeout or the specific connect/read variants, then log, retry, or skip the item without stopping the entire script.

Q: What are good timeout values for Python web scraping and external APIs?

Fast internal APIs usually work with 2–3 seconds, public APIs with 5–10 seconds, and heavier or slow pages with around 10–20 seconds.

Q: When should I switch from plain Python Requests to a web scraping API like ScrapingBee to avoid timeouts?

When sites rely on JavaScript, bot protection, IP throttling, or blocking, switching to a managed layer like ScrapingBee helps because it handles proxies, rendering, and retries — reducing timeout failures dramatically.

Ilya Krukowski | 02 February 2026 | 27 min read

Table of contents

If you've ever run a scraper or API script and it just sat there doing nothing, there's a good chance you hit a Python Requests timeout without even noticing. A missing or poorly chosen timeout can make a simple job freeze, waste runtime, or stall an entire scraping workflow. Getting your Python requests timeout settings right isn't optional: it's what keeps your scripts fast, predictable, and sane.

In this guide we'll break down how Requests timeout behavior actually works, clean up the common traps in native Requests, and show where better retry patterns and safer defaults save you a lot of pain. And when the real issue isn't your code at all (bot protection, heavy client-side rendering, IP throttling) we'll talk about the point where a Python request timeout stops being a code tweak and starts being a job for a managed layer like a proper web scraping API.

If you're new to scraping in general, these are solid primers before diving deeper:

How to handle timeouts in Python Requests

Quick answer: Fixing Python Requests timeouts step-by-step

Here's the fast track so you don't drown in walls of text:

Always set a timeout — never call requests.get(...) or anything else without a Python Requests timeout or timeout=(connect, read) value unless you clearly understand what you're doing. If you skip it, your script might just hang forever. Note however that connect/read aren't wall-clock; real elapsed time can be higher.
Split connect vs read timeouts — keep the connect timeout short and give the read timeout a bit more room. It prevents instant failures on slow servers while keeping things predictable.
Catch timeout exceptions — wrap your call in try/except so a single Requests timeout Python moment doesn't kill the entire run.
Use a session with defaults — create a custom Session with a default timeout so you don't repeat yourself or forget timeouts in bigger projects.
Add retries with backoff — don't hammer a dying server with instant retries. Use exponential backoff with a bit of jitter to keep things civil.
Know when it's not your fault — if the timeouts come from IP blocks, bot protection, or heavy JS, timeout settings often won't fix that.
Use a managed layer when needed — when the site is actually fighting you, let a proper scraping layer handle the messy stuff like rendering, proxies, and anti-bot shields.

That's the whole flow: set timeouts, catch failures, retry smart, centralize defaults, and switch tools when the website decides to play dirty. Next we'll cover all these topics in greater detail.

What is a timeout in Python Requests?

A timeout is basically your way of telling the HTTP client how long to wait before it gives up on a response. And do you know what you get without it? Hung scripts, that's what. The kind that just sit there, staring into the void, doing absolutely nothing because the server decided to take a smoke break. No one likes hung scripts, I promise you. Python won't bail out on its own unless you set the limit on how long it should stay patient.

Think of it like waiting for a buddy outside a bar. You agree to wait five minutes. If he doesn't show up, you bounce. That's a timeout. If you never set that rule, you might stay there half an hour, checking your phone and wondering what happened. (And yeah, my buddy is one of those people who is never on time; dude operates on his own timezone. His real-life timeout is like fifteen minutes minimum, otherwise I'd never see him.)

Same logic with HTTP. Networks act weird, servers get overloaded, routers sneeze, and suddenly your code is stuck. An explicit Requests timeout Python value keeps things sane. It forces your request to fail cleanly instead of freezing the whole script.

How Python Requests handles timeouts by default

It's important to keep in mind that, by default, Requests doesn't set a timeout for you. If you call requests.get(url) without a timeout=..., it can wait basically forever; and "forever" here means "until the underlying socket finally gives up". Depending on your OS, DNS resolver, proxy setup, or whatever chaos is happening in the network stack, that can take a long time. The Requests docs still say it straight: no timeout unless you set one. And yes, you got that right — no timeout, no boundaries, no mercy. Full Mad Max mode, but for sockets.

So, this is why the Python Requests default timeout is effectively "no timeout at all". The Python Requests timeout default isn't some hidden 5-second or 30-second safety net: it's literally "just keep waiting". If the server accepts the connection but never sends anything back, or a proxy decides to buffer your soul away, your script looks hung even though it's technically still waiting on I/O.

In production, that behavior is a straight-up trap. One stuck request can pin a worker thread, block an async bridge, jam your connection pool, and slow down the whole service.
In scraping workloads it's even worse: you're firing requests in loops, and one stalling endpoint can freeze an entire batch.

So the rule for grown-up code is simple: don't trust the default. Pick explicit timeout values, keep them in one place, and treat them as part of your standard. Predictable failure is always better than the classic "why is this job still running at 3 AM" nightmare.

Basic usage: Setting a timeout on requests.get and other methods

The easiest way to avoid hanging calls is to pass a timeout directly into the request. When you use requests.get timeout or any other verb, Requests treats that number as both the connect timeout and the read timeout. That means: up to 5 seconds to establish the connection, and up to 5 seconds waiting for data (including the first byte, and also between chunks). For many simple API calls, that's all you really need.

Example using requests.get timeout:

import requests

# Send a GET request
resp = requests.get(
    "https://example.com",
    # Set a timeout:
    timeout=5
)

That's you setting a timeout, nothing exotic. Let's break down what that 5 actually means:

It's five seconds, not milliseconds, not minutes, not dog years.
With a single number like timeout=5, Requests uses it as both:
- connect timeout: up to 5s to establish the connection
- read timeout: up to 5s waiting for the server to send data (including the first byte, and also any "nothing is arriving" gaps)
If the connection can't be made in time, you'll usually get requests.exceptions.ConnectTimeout. If the connection is fine but the server goes silent too long, you'll usually get requests.exceptions.ReadTimeout. Both are under the requests.exceptions.Timeout umbrella, so either way, it bails instead of hanging forever.

So that one number is basically you telling the client: "Bro, you have five seconds. If nothing comes back, bail and let me handle it." This keeps the rest of your app moving even when the internet decides to implode creatively.

Same idea applies to POST and the rest:

payload = {"msg": "hey"}
resp = requests.post("https://api.example.com/submit", json=payload, timeout=10)

Typical timeout values:

2–3 seconds — fast, stable internal APIs.
5 seconds — balanced default for public endpoints.
10 seconds — slower servers or heavier processing.

The trade-off is simple: shorter timeouts keep your app quick but may fail on shaky networks, while longer timeouts are forgiving but can slow everything down if the server drags its feet.

Connect vs read timeouts in Python Requests

Timeouts aren't just one number. Requests lets you pass a tuple like timeout=(3, 10), where:

The first value is the connect timeout. This covers the moment where your client is trying to open a socket. If the host/proxy is unreachable or the route is busted, this is where you want fast-fail.
- Connect timeout is per IP attempt (IPv4/IPv6 can multiply perceived time).
- Slow DNS resolution can still mess you up and isn't cleanly governed by the Requests timeout the way people assume.
The second value is the read timeout. This kicks in after the connection is established. It's how long the client will wait between bytes coming back from the server, and in real life that usually means "how long until the first byte shows up."

This matters because a Python request timeout can happen in two totally different stages: failing to connect at all, or connecting fine but then waiting forever for the server to actually send something.

Example with a Python Requests get timeout tuple:

import requests

resp = requests.get("https://example.com/heavy", timeout=(2, 15))

Here you're saying: "Take up to two seconds to connect, and then up to fifteen seconds to actually return something."

A few more things to mention:

Read timeout = how long Requests will wait between bytes coming from the server. In 99.9% of cases that mostly feels like "time to first byte," but it's really about gaps.
Requests doesn't enforce a "total download took too long" timer; it times the idle gap between chunks/bytes.
Tuple timeouts (timeout=(connect, read)) are recommended in production because they let you tune fast-fail connects separately from "be patient while it starts responding." (You'll still usually know connect vs read from the exception type.)
Connect timeout should usually be quite short but not microscopic — still you don't want to waste your life waiting to learn the host is unreachable. A good starting point for internet-facing stuff is ~3 seconds (Requests even hints "a bit more than a multiple of 3" because of TCP retransmit timing). If you go 1–2s, that's the aggressive setting — great for low-latency internal services, kinda spicy for the public internet.
- Also: connect timeout can effectively apply per IP address. If a hostname resolves to IPv4 + IPv6 (or multiple A/AAAA records), the client may try them one by one, so a "2s connect timeout" can feel like longer before it gives up.
- And yeah: none of this is strict wall-clock anyway as real elapsed time can be higher than your numbers.
A bigger read timeout helps:
- With slow proxies that connect fast but drip-feed data.
- when the server needs time before sending the first byte (heavy rendering, long SQL, slow upstream).
- When APIs queue work and only respond once the job is done, so the read timeout needs breathing room.

A Requests timeout Python tuple gives you real control: fast failure when the host is unreachable, and enough patience when the server is just busy instead of dead.

Handling Python Requests timeout exceptions cleanly

When a request timeout Python moment hits, Requests doesn't just blow up with a random stack trace. It raises specific exceptions so you know exactly where things stalled. The main ones are requests.exceptions.Timeout, plus the more targeted ConnectTimeout and ReadTimeout. These tell you whether the call died while connecting or while waiting for data.

Catching them is straightforward: wrap the call in a try/except block so your script doesn't faceplant. You can log it, retry, skip the item, or bail out gracefully when a Python Requests timeout exception shows up.

Example pattern:

import requests

url = "https://example.com/api"

try:
    resp = requests.get(url, timeout=(2, 8))
    resp.raise_for_status()
    data = resp.json()
except requests.exceptions.ConnectTimeout:
    print("connect timed out")
except requests.exceptions.ReadTimeout:
    print("server connected but took too long to send anything")
except requests.exceptions.Timeout:
    print("generic timeout hit")
except requests.exceptions.RequestException:
    print("request exception")
except Exception as e:
    print("other error:", e)

This covers the usual Python timeout exception cases: failed connects, slow reads, and the generic timeout umbrella. From here you can retry, log and skip, or move on to the next item. The idea is simple: don't let one slow endpoint drag your entire run into the abyss.

Setting default timeouts with sessions and adapters

If you're dropping timeout=... into every call, that gets old fast. A requests.Session() gives you connection reuse and cleaner code, but it won't magically apply a timeout for you.

So the grown-up move is: make one session that bakes in sane defaults.

a default timeout (so you never "forget" it)
Requests-native retries via HTTPAdapter + urllib3.Retry (so transient flakiness doesn't ruin your day)

That way your project's default isn't "maybe we remembered it", it's "this codebase always uses timeouts and retries, period."

One Session to rule them all: Default timeout + retries

import requests
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry


class SafeSession(requests.Session):
    def __init__(self, timeout=(2, 8), retries=5):
        """
        timeout: (connect, read) seconds
        retries: total retry attempts for retryable failures
        """
        super().__init__()
        self._default_timeout = timeout

        retry = Retry(
            total=retries,
            backoff_factor=0.5,  # 0.5s, 1s, 2s, 4s... (capped by retries)
            status_forcelist=(429, 500, 502, 503, 504),
            allowed_methods=frozenset(["GET", "HEAD", "OPTIONS"]),  # keep it idempotent
            respect_retry_after_header=True,
            raise_on_status=False,  # after retries, return the last Response instead of raising
        )

        adapter = HTTPAdapter(max_retries=retry)
        self.mount("https://", adapter)
        self.mount("http://", adapter)

    def request(self, method, url, **kwargs):
        # Apply default timeout unless caller explicitly sets one
        kwargs.setdefault("timeout", self._default_timeout)
        return super().request(method, url, **kwargs)


session = SafeSession(timeout=(2, 8), retries=5)

# default timeout + retries apply automatically
resp = session.get("https://example.com/api")
resp.raise_for_status()

# override just for this call if you need extra breathing room
resp2 = session.get("https://example.com/heavy", timeout=(2, 20))
resp2.raise_for_status()

A couple notes:

Timeouts stop hangs. Retries handle transient flakiness (dropped connections, 429s, 503s, etc.). You usually want both.
Keep retries on idempotent methods (GET/HEAD/OPTIONS). Retrying POST/PUT can double-create / double-charge unless the API supports idempotency keys.
status_forcelist is where you decide which HTTP codes are "probably temporary."
Passing timeout=None disables timeouts entirely, so do that only if you really mean it.

Timeout best practices for real-world Python API calls

When you deal with real traffic, random network delays, and moody servers, picking the right Python Requests timeout values becomes part of your API design, not an afterthought. Different endpoints need different patience levels, and the trick is finding a balance between user experience, reliability, and not wasting resources.

Typical ranges that work well in production:

Fast internal APIs: 1–3 seconds. These run inside your own infra, so if they're slow, something's broken and you want to know fast.
Public REST APIs: 5–10 seconds. The internet does internet things, so you give it a bit more room to breathe.
Heavy or batch-style endpoints: 10–20 seconds. Some services legitimately need more time before sending the first byte.

These aren't rules carved into stone; they're starting points. You adjust them by watching logs. If you're blowing through your requests timeout too often, maybe that endpoint is slow by design. If you never hit timeouts, maybe you're being too generous and letting workers stall longer than necessary.

Timeouts are also part of making API flows actually reliable. For a deeper look at stable patterns and retry logic, check out this guide on Python API calls. It ties in cleanly with what we're doing here: predictable failure, sane retries, and consistent defaults.

In short: set explicit values, monitor them, tune over time, and keep your Python request timeout logic solid across the whole codebase. That's how you build systems that don't wake you up at night.

Handling timeouts and retries like a pro

Exponential backoff without looking like an amateur

If you retry a failing request instantly, you're not being "robust"; you're just smashing your head into the same dead server and hoping it suddenly rises from the grave. Real backoff is about giving the system room to breathe so you don't melt your own scraper or the backend you're hitting.

The idea is simple: each retry waits a bit longer than the previous one. The "pro" twist is adding jitter, because if every worker retries on the same schedule, they'll hammer the server again in perfect sync and make things worse.

The usual formula looks like this (it can be further adjusted, and there are different flavors, I'm just giving you an overall idea):

wait_n = (base * 2^n) + jitter

Where:

base is your starting delay (like 1 second),
n is the retry number (0, 1, 2...),
jitter is a small random offset that breaks herd behavior.

The exponential part gives breathing room after each failure, while jitter spreads out retry timings so you're not part of a synchronized stampede. That's the difference between "I added retries" and "I actually know what I'm doing".

A typical pattern looks like this:

import random
import time
import requests

def exp_backoff_retry(url, attempts=4, base=1, connect_read_timeout=(2, 8)):
    # We're covering only timeouts here,
    # you might also want to call raise_for_status() for the response
    for n in range(attempts):
        try:
            return requests.get(url, timeout=connect_read_timeout)
        except requests.exceptions.Timeout:
            delay = base * (2 ** n)
            jitter = random.uniform(0, delay * 0.3)
            wait = delay + jitter
            print(f"timeout on attempt {n+1}, waiting {wait:.2f}s before retrying")
            time.sleep(wait)
    return None

What's happening here:

2 ** n creates the classic exponential curve.
jitter prevents synchronized retries across workers.
bounded attempts keep your scraper from drifting into multi-minute retry comas.

This is the kind of backoff that actually works instead of turning your retries into self-inflicted pain.

What to retry, and what not to retry

Not every failure deserves a retry. Some errors are the universe saying "try again," and some are the universe saying "stop, bro, this endpoint hates you."

What's generally safe to retry:

timeouts — server is slow or the network hiccuped
connection errors — DNS stumbles, dropped sockets, random network burps
429 / rate limits — the server literally asks you to chill
503 / temporary load issues — backend is overloaded, backoff (usually) helps
flaky proxies — normal in scraping, proxies just spasm sometimes

These failures are usually transient. A retry with backoff makes sense.

What you should not retry:

401 / 403 — usually this means "you're not allowed here," but there's a nuance:
- if you're scraping and your IP is blocked, retrying with the same IP is probably pointless. Switching proxies or rotating IPs can make sense, but that's not a retry of the identical request, that's a different strategy.
404 — the page isn't there; retrying won't summon it from the void unless you're some kind of a wizard
consistent 500s on static assets — the file is broken, not "momentarily shy"
business-logic responses ("out of stock", "invalid input") — that's not an error, it's reality
anything that clearly means "go away" — retries just waste time and annoy the host

Also, be careful retrying non-idempotent requests:

Retrying GET/HEAD is usually fine as you're just re-fetching stuff.
Retrying POST/PUT/PATCH can accidentally do the action twice (double charge, double create, double submit) if the first attempt actually reached the server but your client timed out.
- If you must retry writes, use an idempotency key (if the API supports it) or make the operation idempotent on the server side. Otherwise, keep retries for "safe" reads only.

The rule of thumb: retry what's unstable, don't retry what's intentional. If the server is saying "I can't," retry. If the server is saying "I won't," stop.

Retry budget

Retries aren't free. Every extra attempt costs time, threads, bandwidth, and sometimes money if you scrape at scale. A retry budget is basically you deciding up front how much time or how many attempts you're willing to burn before you move on.

There are two budgets that matter:

attempts budget — how many retries max. For example: "3 tries, then bail."
time budget — how long you're willing to wait overall. For example: "this whole fetch has 12 seconds total, including retries and backoff."

Why it matters:

a single slow endpoint can stall your entire pipeline if you let it run wild
backoff grows quickly, so retries can balloon into long delays
workers pile up if each of them waits too long, which can snowball into timeouts elsewhere
scraping with proxies or paid APIs means retries literally cost money
batch jobs drift into "still running after 20 minutes... why?" territory if you don't cap them

So the idea is simple: you don't let one stubborn request drag your script into a 2-minute timeout coma. You cap the pain. If the job blows through your retry budget, you log it, skip it, and keep the workflow moving.

Smart retries for scraping

Plain retries are fine, but scraping gets weird fast. A "smart" retry means you don't just repeat the same request and hope for a different outcome; you change something between attempts.

Things worth adjusting on each retry:

user-agent — some sites punish repeated requests with the same client signature
proxy — switching the exit IP often fixes what looked like a timeout or a block
headers or cookies — small changes can slip past basic filters or caching layers
timeouts — a tiny bump in read timeout helps with heavy pages (but don't inflate it endlessly)
fetch strategy — start with a normal GET, and if it keeps failing, retry with JS rendering or a full browser mode

This is the kind of retry logic that adapts to what the site is actually doing. Instead of hammering the same request over and over, you evolve the attempt. Scrapers that do this stay alive on pages where static, identical retries just keep dying.

Use a retry package so you don't reinvent it badly

If you want retries to be consistent across a whole codebase, don't hand-roll loops everywhere. Use a library that already handles the weird corners. In Python, the usual choice is tenacity.

A typical pattern is: Requests handles timeouts, Tenacity handles retry policy.

import requests
from tenacity import (
    retry,
    stop_after_attempt,
    wait_exponential_jitter,
    retry_if_exception_type,
)

TIMEOUT = (2, 8)  # connect, read

@retry(
    retry=retry_if_exception_type(requests.exceptions.Timeout),
    wait=wait_exponential_jitter(initial=1, jitter=1, max=15),
    stop=stop_after_attempt(4),
    reraise=True,
)
def fetch(url: str) -> requests.Response:
    return requests.get(url, timeout=TIMEOUT)

try:
    resp = fetch("https://example.com/page")
    resp.raise_for_status()
    html = resp.text
except requests.exceptions.Timeout:
    # log + skip, or mark URL as failed
    print("timed out after retries")

In this example:

your timeout is explicit
retries use exponential backoff with jitter automatically
you get a hard cap on attempts so nothing stalls forever

If you also want to retry based on the HTTP response (common in scraping), you can add a result-based retry. For example, retry on 429/503:

import email.utils
import random
from datetime import datetime, timezone

import requests
from tenacity import (
    retry,
    stop_after_attempt,
    wait_exponential_jitter,
    retry_if_exception_type,
)

TIMEOUT = (2, 8)  # connect, read


class RetryableHTTPError(Exception):
    """Raised for HTTP responses that should be retried (e.g., 429/503)."""
    def __init__(self, status_code: int, retry_after: float | None = None):
        super().__init__(f"retryable http status {status_code}")
        self.status_code = status_code
        self.retry_after = retry_after


def _parse_retry_after_seconds(resp: requests.Response) -> float | None:
    ra = resp.headers.get("Retry-After")
    if not ra:
        return None

    ra = ra.strip()

    # Retry-After: seconds
    if ra.isdigit():
        return max(0.0, float(ra))

    # Retry-After: HTTP-date
    try:
        dt = email.utils.parsedate_to_datetime(ra)
    except (TypeError, ValueError, OverflowError):
        return None

    if dt is None:
        return None

    # Some parsers can return naive datetimes; assume UTC for safety
    if dt.tzinfo is None:
        dt = dt.replace(tzinfo=timezone.utc)

    now = datetime.now(timezone.utc)
    seconds = (dt - now).total_seconds()
    return max(0.0, seconds)


_base_wait = wait_exponential_jitter(initial=1, jitter=1, max=15)

# You might want to adjust this logic further
# as theoretically the server might return a huge
# retry time (cap it!)
def _wait_with_retry_after(retry_state) -> float:
    """
    If server provides Retry-After, respect it (with a tiny jitter),
    otherwise fall back to exponential backoff with jitter.
    """
    wait_s = _base_wait(retry_state)

    exc = None
    if retry_state.outcome is not None:
        exc = retry_state.outcome.exception()

    retry_after = getattr(exc, "retry_after", None)
    if retry_after is None:
        return wait_s

    # add small jitter so a fleet doesn't retry in perfect sync
    retry_after_jittered = retry_after + random.uniform(0.0, min(1.0, retry_after * 0.1))

    # respect server instruction, but never be *less* conservative than backoff
    return max(wait_s, retry_after_jittered)


@retry(
    retry=retry_if_exception_type((
        requests.exceptions.Timeout,
        requests.exceptions.ConnectionError,
        RetryableHTTPError,
    )),
    wait=_wait_with_retry_after,
    stop=stop_after_attempt(5),
    reraise=True,
)
def fetch2(url: str) -> requests.Response:
    resp = requests.get(url, timeout=TIMEOUT)

    if resp.status_code in (429, 503):
        retry_after = _parse_retry_after_seconds(resp)
        # no sleeping here as Tenacity will wait using _wait_with_retry_after
        raise RetryableHTTPError(resp.status_code, retry_after=retry_after)

    resp.raise_for_status()
    return resp

That's the "pro" combo: timeouts keep requests from hanging, and a retry library keeps your retry logic predictable instead of vibes-based.

Timeouts, retries, proxies and resilient web scraping

Scraping isn't like calling neat little JSON APIs. Pages are heavy, proxies add lag, networks drift in and out of sanity, so a solid Requests Python timeout setup is mandatory. And just like everywhere else in this guide: don't brute-force retries, do proper backoff + jitter.

The first rule: timeouts are normal. A slow proxy, a clogged route, or a JS-heavy page can trigger a Python Requests get timeout at any moment. The right reaction isn't "panic and retry instantly," it's "retry with breathing room." A smarter retry loop looks like this:

import time
import random
import requests

url = "https://example.com/page"
timeout = 5
attempts = 3

for i in range(attempts):
    try:
        resp = requests.get(url, timeout=timeout)
        resp.raise_for_status()
        html = resp.text
        break
    except requests.exceptions.Timeout:
        # exponential backoff
        delay = 1 * (2 ** i)
        # jitter so workers don't sync-smack the server
        jitter = random.uniform(0, delay * 0.3)
        wait = delay + jitter

        print(f"timeout on attempt {i+1}, waiting {wait:.2f}s")
        time.sleep(wait)

        # tiny timeout bump only when it helps
        timeout = min(timeout + 2, 20)
else:
    print("failed after retries")

What's happening here:

exponential backoff gives the site space to recover
jitter makes retries land at different times
a small timeout bump helps with slow proxies or first-byte delays
the cap on attempts prevents the scraper from drifting into a multi-minute coma

Now, proxies. They can connect instantly but then drip data like a leaky faucet. Because of that:

read timeouts often need more room behind proxies
rotating proxies triggers more transient failures; expect more Requests timeout Python events
some proxy nodes are simply garbage; retrying with the same one won't fix anything

A good primer on proxy basics is here: Python Requests proxy.

Putting it all together: explicit timeouts + backoff + jitter + proxy-aware tuning = a scraper that keeps crawling even when the web is throwing mood swings at you.

When timeouts are caused by the website, not your code

Sometimes your scraper hits a Python Requests timeout not because your code is weak, but because the site is straight-up messing with you. Plenty of websites use bot-protection tricks that make a normal scraper crawl or fail, and in those cases tweaking timeouts or retries won't save you as the server is intentionally slowing you down.

Common server-side shenanigans:

aggressive bot filters that delay or drop responses
IP blocking after a few hits
CAPTCHAs that freeze the response until a human solves them
geo-based throttling that slows clients from certain regions
heavy JavaScript pages where HTML doesn't exist until a browser executes code
giant content loads that choke slow connections

When this stuff kicks in, you'll often see repeated Requests timeout Python failures or a Python Requests timeout exception even with sane timeout values. That's your signal the server isn't "slow", it's pushing back.

At this point, throwing bigger timeouts or more retries at the wall usually won't help. The site wants a real browser, clean IP rotation, or full JS rendering. That's where a managed scraping layer like ScrapingBee comes in: those tools handle blocking, JavaScript execution, and IP pools so your scraper doesn't get stonewalled.

Using ScrapingBee's Web Scraping API to reduce timeout errors

A managed web scraping API helps when a site is basically fighting you. Instead of juggling proxies, CAPTCHAs, JS rendering, and retry storms, ScrapingBee handles that mess behind the curtain. Smart proxy rotation cuts down on blocked IPs, headless browser rendering handles heavy JavaScript, and built-in retries smooth out network noise. Result: far fewer timeout blowups compared to raw requests.get calls.

For pages that need structured extraction, you can even pull data directly with CSS or XPath rules
And if you need a visual snapshot of a page, the screenshot engine handles that too

Here's how you call it from Python in a few lines, no wrestling with Python Requests timeout or Requests get timeout logic yourself:

import requests

API_KEY = "YOUR_API_KEY"
url = "https://example.com/complex"

params = {
    "api_key": API_KEY,
    "url": url,
    # You might want to disable this initially and enable only
    # for JS-heavy websites as this param costs additional credits:
    "render_js": False
    # Optionally, use ScrapingBee request timeout:
    # "timeout": 140000
    # If you set the ScrapingBee API timeout higher than your Requests client timeout,
    # your script may give up first even though ScrapingBee would've completed the fetch.
}

resp = requests.get("https://app.scrapingbee.com/api/v1/", params=params, timeout=10)
resp.raise_for_status()
html = resp.text

Note that when you call ScrapingBee from Python, there are two different timeouts involved, and they mean totally different things:

ScrapingBee API timeout (milliseconds): params["timeout"] — how long ScrapingBee's servers are allowed to spend fetching/rendering the target page before they give up.
Requests client timeout (requests.get(..., timeout=...)) — how long your script will wait for ScrapingBee to respond over the network.

You still control the timeout value, but most timeout-causing problems (IP bans, geo throttling, JS-rendered content) are handled upstream. That's the whole point: let the service absorb the chaos so your scraper stays simple, fast, and predictable.

Python web scraping with Requests vs ScrapingBee

Doing web scraping with Python using plain requests is perfect when the target pages are simple: no JS, no bot protection, no weird regional throttling. In those cases, a clean requests.get(url, timeout=5) and maybe an HTML parser is all you need. You tune your own Python Requests timeout, you control retries, and you avoid extra infrastructure.

But once the site gets defensive (CAPTCHAs, IP bans, slow JS, inconsistent latency) the DIY route stops being cute and starts being work. Suddenly you're juggling proxy lists, rotating user agents, retry loops, cookie jars, and constant timeout tweaks. And when the server purposely delays you, your python requests set timeout strategy turns into guesswork instead of control.

ScrapingBee flips that around. You keep using normal Python code, but all the heavy lifting moves out of your script: proxy rotation, JS rendering, anti-bot countermeasures, geo routing, and built-in retries. It behaves like an infrastructure layer that plugs into your existing scraper instead of replacing it.

When raw requests still makes sense:

internal dashboards or intranet pages
public sites with no JS and no blocking
fast, predictable API-like endpoints

When a managed scraper saves real time:

sites that block or throttle aggressively
any page that needs JS to generate content
large scraping jobs where reliability matters more than babysitting proxies

If you want to round out your stack, here are solid references for picking the right tools:

Bottom line: use Requests when the road is smooth, and drop in ScrapingBee when the road is full of traps. They work cleanly together, and you don't have to rewrite your scraper to benefit from it.

Low-code and workflow automation options with ScrapingBee

Not everyone on your team writes Python every day, but they still might need to run scraping workflows without hitting random Python timeout exception issues. ScrapingBee integrates smoothly with low-code automation tools, so non-devs can run scraping jobs, handle retries, and pass data downstream without touching proxies, browser sessions, or timeout tuning.

ScrapingBee's integration with Make lets you drop scraping blocks into visual workflows — no code needed. You can trigger a scrape on a schedule, pipe results into spreadsheets or databases, send notifications, clean the data, or chain it into any other Make module. The platform handles the orchestration; ScrapingBee handles the scraping.
On the n8n side, the ScrapingBee node works inside their drag-and-drop editor. You add a ScrapingBee step, set the URL, choose options like JS rendering or premium proxies, and then chain it to any of n8n's 1000+ nodes. You can schedule recurring jobs, transform and route the scraped data, branch workflows conditionally, and let n8n retry failed steps automatically, all without managing proxies or browser automation yourself.

These low-code options let teams automate scraping without reinventing timeout logic, retry patterns, or proxy handling. Your workflows simply call the ScrapingBee API, get reliable results, and pass them wherever they need to go. Consistent, maintainable, and friendly to everyone, not just the Python folks.

Ready to stop fighting Python timeout errors?

You already saw the essentials: set clear timeouts, split connect/read values, handle exceptions without blowing up your script, and add retries with backoff. That alone kills a ton of python requests timeout nonsense when the site is behaving like a normal adult.

But when the site isn't normal (heavy JS, anti-bot tricks, region throttling, IP bans) your perfectly tuned logic still eats dirt. That's where handing the ugly parts to a managed layer actually pays off. A solid web scraping api like ScrapingBee deals with the blockers you can't code around: rotating proxies, browser rendering, smart retries, and structured extraction.

And if you want something even simpler, the ScrapingBee AI Web Scraper can take a plain-language description ("grab the product name, price, images, and rating") and return clean JSON; no selectors, no scripts, no guessing. It handles JS, headless rendering, and extraction for you.

If timeouts keep dragging your scrapers down, offload the pain. Let the service wrestle with the chaos, and you focus on the part that actually moves your project forward.

Python Requests timeout FAQs

How do I set a timeout for requests.get in Python?

Pass a numeric value or a tuple: requests.get(url, timeout=5) or requests.get(url, timeout=(2, 10)).

What is the difference between connect timeout and read timeout in Python Requests?

Connect timeout is how long the client waits to establish the TCP connection to the server (or proxy).
Read timeout is how long the client will wait between bytes while receiving the response. This includes the wait for the first byte and any stalls mid-download (so it often feels like "time-to-first-byte," but it's really "gap between chunks").

Does Python Requests have a default timeout if I don't set one?

No. Without an explicit timeout, the call can wait indefinitely.

How should I handle a Python Requests timeout exception without crashing my script?

Wrap the request in try/except and catch requests.exceptions.Timeout (plus the specific ConnectTimeout and ReadTimeout). Log it, retry if needed, or skip the item.

What are good timeout values for Python web scraping and external APIs?

Fast internal APIs: ~2–3s
Public APIs: ~5–10s
Heavy pages or slow servers: ~10–20s

When should I switch from plain Python Requests to a web scraping API like ScrapingBee to avoid timeouts?

When pages rely on JS, use bot protection, throttle regions, or block your IPs. A managed layer like ScrapingBee handles rendering, proxies, retries, and reduces timeout failures dramatically.

Ilya Krukowski

Ilya is an IT tutor and author, web developer, and ex-Microsoft/Cisco specialist. His primary programming languages are Ruby, JavaScript, Python, and Elixir. He enjoys coding, teaching people and learning new things. In his free time he writes educational posts, participates in OpenSource projects, tweets, goes in for sports and plays music.