CWE-770: Resource Allocation Without Limits - Throttling and Rate Limiting Best Practices

Date- Mar 21,2026

cwe 770 security

Overview

CWE-770 refers to the vulnerability related to resource allocation without limits, which can lead to various security issues, including Denial of Service (DoS) attacks. This vulnerability arises when applications fail to impose limits on the number of requests or resources a user can consume, allowing malicious users to overwhelm the system. By implementing throttling and rate limiting, developers can control the flow of requests to their applications, ensuring that they remain responsive and stable under high load.

Throttling and rate limiting are essential techniques for managing traffic in any application that interacts with external clients, such as APIs, web services, and even internal microservices. These techniques help prevent abuse and ensure fair usage among all users, thereby improving the overall user experience and system reliability. Real-world use cases include API rate limiting, where service providers restrict the number of requests a client can make over a specific period, and throttling in user interfaces to prevent excessive actions like rapid clicks or form submissions.

Prerequisites

Basic Programming Knowledge: Familiarity with programming concepts and languages such as JavaScript or Python.
Understanding of HTTP Protocol: Knowledge of how HTTP requests and responses work.
Web Application Architecture: Basic understanding of client-server architecture and web application frameworks.
Familiarity with APIs: Understanding how to interact with APIs and the significance of request limits.

What is Throttling?

Throttling is a technique used to control the amount of traffic sent or received by a system. It helps manage the rate at which requests are processed, ensuring that the system does not become overwhelmed by too many simultaneous requests. By implementing throttling, developers can protect their applications from performance degradation and potential outages caused by excessive load.

Throttling can be implemented in various ways, such as limiting the number of requests a user can make in a specified time frame or dynamically adjusting the request processing rate based on the current load on the system. This adaptive approach helps maintain optimal performance while ensuring that all users receive fair access to resources.

class Throttler {
    constructor(limit, interval) {
        this.limit = limit;
        this.interval = interval;
        this.requests = [];
    }

    isAllowed() {
        const now = Date.now();
        this.requests = this.requests.filter(requestTime => now - requestTime < this.interval);
        if (this.requests.length < this.limit) {
            this.requests.push(now);
            return true;
        }
        return false;
    }
}

const throttler = new Throttler(5, 10000); // 5 requests per 10 seconds
const checkRequest = () => {
    if (throttler.isAllowed()) {
        console.log('Request allowed');
    } else {
        console.log('Request denied');
    }
};

// Simulating requests
setInterval(checkRequest, 1000); // Check every second

This Throttler class demonstrates a simple throttling mechanism:

constructor(limit, interval): Initializes the throttler with a request limit and time interval.
isAllowed(): Checks if a new request can be processed. It filters out requests that are older than the specified interval and determines if the request limit has been reached.
requests: An array that stores the timestamps of the requests made.

The expected output will show 'Request allowed' for the first five requests within a 10-second interval, after which it will display 'Request denied' for subsequent requests.

Advanced Throttling Techniques

Advanced throttling techniques include dynamic throttling, where the system adjusts the limits based on real-time load conditions. For example, if the server is under heavy load, it can temporarily reduce the request limit for all users or specific users exhibiting abusive behavior. This technique ensures that while the system remains responsive, users do not experience degraded service due to unexpected spikes in traffic.

What is Rate Limiting?

Rate limiting is a specific form of throttling that restricts the number of requests a user can make to a system over a defined period. Unlike general throttling, which can apply to overall system traffic, rate limiting is user-specific, allowing developers to allocate resources fairly among all users. This practice is especially important for APIs and web services that serve multiple clients.

Implementing rate limiting helps mitigate the risk of resource exhaustion, which can lead to system outages. By enforcing limits, developers can ensure that all users have equitable access to resources, thereby enhancing the user experience and system integrity.

const rateLimit = (limit, interval) => {
    const requests = {};
    return (req, res, next) => {
        const userId = req.ip; // using IP as user identifier
        const now = Date.now();

        if (!requests[userId]) {
            requests[userId] = [];
        }

        requests[userId] = requests[userId].filter(requestTime => now - requestTime < interval);

        if (requests[userId].length < limit) {
            requests[userId].push(now);
            next(); // Allow the request
        } else {
            res.status(429).send('Too Many Requests'); // Deny the request
        }
    };
};

// Express.js example usage
const express = require('express');
const app = express();
app.use(rateLimit(5, 10000)); // 5 requests per 10 seconds per user
app.get('/', (req, res) => {
    res.send('Hello, World!');
});
app.listen(3000, () => {
    console.log('Server running on port 3000');
});

This rate limiting middleware for an Express.js application works as follows:

rateLimit(limit, interval): A factory function that returns a middleware function to limit requests.
requests: An object that tracks the timestamps of requests for each user identified by their IP address.
next(): Calls the next middleware if the request is allowed.
res.status(429): Sends a 'Too Many Requests' response if the limit is exceeded.

The expected behavior is that each user can make up to five requests within a 10-second window. Exceeding this limit results in a 429 status code.

Token Bucket Algorithm

The Token Bucket algorithm is a popular rate limiting technique that allows a burst of requests while maintaining an average rate over time. This is particularly useful for applications where users may experience sporadic high traffic but should not exceed a defined average rate. Tokens are added to the bucket at a fixed rate, and each request consumes a token. If the bucket is empty, additional requests are denied until tokens are replenished.

Edge Cases & Gotchas

When implementing throttling and rate limiting, developers must be aware of several common pitfalls:

Using Client-Side Throttling: Relying solely on client-side mechanisms to control request rates can be easily bypassed by malicious users. Always implement server-side protections.
Ignoring Time Zones: When implementing time-based limits, consider the time zone differences of users. Use UTC timestamps for consistency.
Inadequate Testing: Failing to test the throttling and rate limiting under stress can lead to unexpected failures in production. Always include load testing in your deployment process.

Incorrect vs. Correct Implementation

// Incorrect implementation: Using a simple counter without time checks
let requestCount = 0;
const maxRequests = 5;
const resetTime = 10000;
const checkRequest = () => {
    requestCount++;
    if (requestCount > maxRequests) {
        console.log('Request denied');
    } else {
        console.log('Request allowed');
    }
};

setInterval(checkRequest, 1000); // Count requests every second

The incorrect implementation above fails to account for the time period over which requests should be counted. As a result, it may incorrectly deny requests if multiple requests are made in quick succession.

// Correct implementation: Using timestamps to track request timing
const requests = [];
const checkRequest = () => {
    const now = Date.now();
    requests.push(now);
    // Remove requests older than the reset time
    while (requests.length > 0 && (now - requests[0]) > resetTime) {
        requests.shift();
    }

    if (requests.length > maxRequests) {
        console.log('Request denied');
    } else {
        console.log('Request allowed');
    }
};

setInterval(checkRequest, 1000); // Check every second

The correct implementation tracks request timestamps and removes old requests, ensuring that the limit is enforced based on the defined time window.

Performance & Best Practices

Implementing throttling and rate limiting can introduce overhead to your application. Here are some best practices to minimize performance impacts:

Optimize Data Structures: Use efficient data structures, such as circular buffers or heaps, to store request timestamps for quick access and removal.
Use Caching: If applicable, cache the request counts and timestamps to avoid recalculating them for every request.
Monitor Performance: Continuously monitor the performance of your throttling and rate limiting mechanisms, adjusting parameters based on real-world usage patterns.

Measuring Performance

When evaluating the performance of your throttling and rate limiting mechanisms, it is important to measure their impact on response times and system throughput. Use tools like JMeter or Gatling to simulate load and analyze how your application behaves under stress. Monitor metrics such as:

Average Response Time
Error Rates (e.g., 429 Too Many Requests)
Throughput (requests per second)

Real-World Scenario

In this section, we will create a simple Express.js API that implements both throttling and rate limiting. The API will allow users to register and submit messages, with rate limits imposed on message submissions to prevent abuse.

const express = require('express');
const app = express();
app.use(express.json());

const messageRateLimit = rateLimit(3, 10000); // 3 messages per 10 seconds
const messages = [];

app.post('/register', (req, res) => {
    res.send('User registered');
});

app.post('/submit-message', messageRateLimit, (req, res) => {
    const message = req.body.message;
    messages.push(message);
    res.send('Message submitted');
});

app.get('/messages', (req, res) => {
    res.json(messages);
});

app.listen(3000, () => {
    console.log('API running on port 3000');
});

This API demonstrates the following functionalities:

POST /register: A simple endpoint to register a user.
POST /submit-message: This endpoint is rate-limited to allow only three messages every 10 seconds per user.
GET /messages: Returns all submitted messages.

After running the API, users can register and submit messages while adhering to the imposed rate limits.

Conclusion

Implementing throttling and rate limiting is crucial for preventing resource exhaustion and ensuring system stability.
Understanding the differences between throttling and rate limiting helps in applying the right technique for various scenarios.
Monitoring performance and adjusting parameters based on real-world usage patterns is essential for maintaining an optimal user experience.
Testing and validating implementations under load can prevent unexpected failures in production.
Consider advanced techniques like token buckets for more flexible rate limiting strategies.