Input Sanitization / Output Encoding

Injection vulnerabilities — SQL injection, cross-site scripting (XSS), command injection, LDAP injection, log injection — have been a perennial top category of the OWASP Top 10 since its first edition, ranked #1 in 2010, 2013, and 2017 and A03 in 2021. They all share the same root cause: untrusted data crosses a trust boundary and is interpreted as code, query logic, or markup because the system failed to distinguish data from instructions.

Input sanitization and output encoding address the two sides of this problem. On the input side, validation rejects data that does not conform to expected schemas. On the output side, encoding ensures that when data is rendered in a specific context — HTML page, SQL query, shell command, JSON response — it is treated as literal data, never as executable content.

How It Works

Define strict input schemas for every entry point: expected type, length, format, character set, and value range. Reject anything that does not conform (allowlist approach), rather than trying to strip known-bad patterns (denylist approach).
Use parameterized queries (prepared statements) for all database access — never concatenate user input into SQL, NoSQL, or LDAP query strings.
Apply context-specific output encoding at every rendering boundary: HTML entity encoding for web pages, URL encoding for query parameters, JavaScript string escaping for inline scripts, shell escaping for OS commands.
Validate and sanitize file uploads: check MIME type, file extension, file size, and (where possible) file content; store uploads outside the web root with randomized names.
Sanitize structured input (HTML, Markdown, SVG) with a well-maintained allowlist sanitizer library rather than custom regex.
Log sanitization events — rejected inputs, encoding failures — to support incident detection and tuning of validation rules.

Failure Modes

Denylist-based validation (stripping <script> tags, blocking DROP TABLE) is trivially bypassed by encoding variations, case changes, or novel attack patterns.
Validation at the API gateway but not at the service layer: a direct service-to-service call bypasses the gateway and delivers unsanitized input.
Output encoding applied in the wrong context — for example, HTML-encoding data that is injected into a JavaScript string literal, where JavaScript escaping is required.
Double encoding: data is encoded twice, producing garbled output for legitimate users while potentially still allowing attacks through decode-then-interpret chains.
Rich-text fields allow overly permissive HTML, so stored XSS activates when other users view the content.

Verification

Automated DAST (Dynamic Application Security Testing): run an injection scanner (OWASP ZAP, Burp Suite) against all public endpoints and verify zero high-severity findings for SQLi, XSS, command injection, and LDAP injection.
Parameterized-query audit: static analysis scan of all database access code to verify that no query is constructed by string concatenation with user input.
Context-encoding review: for each output context (HTML body, HTML attribute, JavaScript, CSS, URL, SQL, shell), verify that the correct encoding function is applied and that no raw user data reaches the renderer.
Fuzzing: send malformed, oversized, and boundary-case inputs to all entry points and verify the system returns appropriate validation errors without crashing, leaking stack traces, or accepting the payload.
Upload verification: attempt to upload executable files with spoofed MIME types and verify they are rejected or stored safely.

Content Security Policy (CSP) headers provide a browser-side defense-in-depth layer against XSS by restricting script sources, even if output encoding fails.
Web Application Firewalls (WAF) add a network-level validation layer but should not replace application-level sanitization.
Least Privilege limits the damage of a successful injection: even if an attacker executes a query, the database user has only the minimum required permissions.
Strong Authentication (MFA / OIDC) protects against credential theft that could be achieved through phishing pages injected via XSS.
Encryption at Rest + in Transit protects data confidentiality but does not prevent injection — encrypted malicious input is still malicious after decryption.

References

OWASP Top 10 (2021) — A03:2021 Injection remains a top-three risk category
OWASP Application Security Verification Standard (ASVS) — V5 (Validation, Sanitization and Encoding)
OWASP Cheat Sheet: Input Validation — practical guidance for allowlist validation
NIST SP 800-53: Security and Privacy Controls — SI-10 (Information Input Validation)

Input Sanitization / Output Encoding

Intent

Mechanism

Applicability

How It Works

Failure Modes

Verification

References

Supported Qualities

Trade-offs

Related Requirements

Intent

Mechanism

Applicability

How It Works

Failure Modes

Verification

Variants and Related Tactics

References

Supported Qualities

Trade-offs

Related Requirements