Lab 5: Security Analyzer

Work on this exercise locally

This web app is a reference guide — you can read instructions, browse starter code, and view tests here. To actually complete the exercise, you need to work in your local development environment.

1Clone the repo: git clone https://github.com/weihaoqu/program-analysis-bootcamp-student

2Edit the starter file in your editor (VS Code, Vim, etc.) — replace failwith "TODO" with your implementation.

3Run the tests: dune runtest labs/lab5-security-analyzer

Lab 5: Security Analyzer

Overview

In this lab you'll build a complete security analyzer that detects OWASP vulnerability patterns (SQL injection, XSS, command injection, path traversal) using taint analysis. Your analyzer propagates taint from untrusted sources through computations, checks for tainted data at security-sensitive sinks, and produces formatted vulnerability reports.

Learning Objectives

Define security configurations mapping sources, sinks, and sanitizers
Build a forward taint propagation engine using abstract transfer functions
Detect vulnerabilities where tainted data reaches sinks without sanitization
Format and report detected vulnerabilities with severity ratings
Analyze the precision and limitations of taint-based security analysis

Structure

Part	Points	Description
A	35	Taint Analyzer (security_config.ml, taint_analyzer.ml)
B	40	Vuln Checker + Reporter (vuln_checker.ml, vuln_reporter.ml)
C	25	Security Audit Report (analysis_report.md)

Getting Started

# Build
dune build

# Run your tests
dune runtest

Part A: Taint Analyzer (35 points)

security_config.ml (10 points)

Define the security configuration:

default_config: A web security configuration with:
- Sources: get_param, read_cookie, read_input, read_file, get_header
- Sinks: exec_query (sql-injection), send_response (xss), exec_cmd (command-injection), open_file (path-traversal), redirect (open-redirect)
- Sanitizers: escape_sql, html_encode, shell_escape, validate_path, validate_url
is_source, find_sink, find_sanitizer: Lookup helpers

taint_analyzer.ml (25 points)

Implement the taint propagation engine:

eval_expr: Evaluate expressions using taint rules:
- Literals → Untainted
- Variables → lookup in env
- BinOp → propagate taint from both operands
- Source calls → Tainted
- Sanitizer calls → Untainted
- Unknown calls → Top
transfer_stmt: Transfer statements through taint environment:
- Assign: evaluate RHS, update env
- If: transfer both branches, join
- While: fixpoint with widening
- Return/Print: env unchanged
analyze_function: Initialize params to Top, transfer body

Part B: Vuln Checker + Reporter (40 points)

vuln_checker.ml (25 points)

Detect vulnerabilities at sink calls:

check_call: Check if a call is a sink with tainted arguments
check_stmt / check_stmts: Walk statements, threading env and collecting vulnerabilities
check_function / check_program: Entry points

vuln_reporter.ml (15 points)

Format vulnerability reports:

severity_of_vuln_type: Map vulnerability types to severity levels
format_vulnerability: Format as [SEVERITY] type in location: message (var, sink)
format_summary: Summary with count or "No vulnerabilities found."
group_by_type: Count vulnerabilities by type

Part C: Security Audit Report (25 points)

Write analysis_report.md documenting:

Analyze 5 programs showing taint flow step-by-step
For each program, show whether vulnerabilities are detected or the program is safe
Discuss the precision, limitations, and trade-offs of your analyzer
Compare with real-world tools (Semgrep, CodeQL, etc.)

Dependencies

abstract_domains (ABSTRACT_DOMAIN module type and MakeEnv functor)
shared_ast (AST types)

Tips

Start with Part A -- the analyzer is the foundation for Part B
Test with simple programs first (source → sink, then add sanitizers, then branches)
The taint_domain.ml file is provided -- focus on the analyzer logic
For Part C, trace through programs by hand to verify your analyzer's output

Starter Files

starter

starter/tests

Test Files

tests

starter/security_config.ml

Read-only

Loading editor...

Work on this exercise locally

1Clone the repo: git clone https://github.com/weihaoqu/program-analysis-bootcamp-student

2Edit the starter file in your editor (VS Code, Vim, etc.) — replace failwith "TODO" with your implementation.

3Run the tests: dune runtest labs/lab5-security-analyzer

Lab 5: Security Analyzer

Overview

Learning Objectives

Define security configurations mapping sources, sinks, and sanitizers
Build a forward taint propagation engine using abstract transfer functions
Detect vulnerabilities where tainted data reaches sinks without sanitization
Format and report detected vulnerabilities with severity ratings
Analyze the precision and limitations of taint-based security analysis

Structure

Part	Points	Description
A	35	Taint Analyzer (security_config.ml, taint_analyzer.ml)
B	40	Vuln Checker + Reporter (vuln_checker.ml, vuln_reporter.ml)
C	25	Security Audit Report (analysis_report.md)

Getting Started

# Build
dune build

# Run your tests
dune runtest

Part A: Taint Analyzer (35 points)

security_config.ml (10 points)

Define the security configuration:

default_config: A web security configuration with:
- Sources: get_param, read_cookie, read_input, read_file, get_header
- Sinks: exec_query (sql-injection), send_response (xss), exec_cmd (command-injection), open_file (path-traversal), redirect (open-redirect)
- Sanitizers: escape_sql, html_encode, shell_escape, validate_path, validate_url
is_source, find_sink, find_sanitizer: Lookup helpers

taint_analyzer.ml (25 points)

Implement the taint propagation engine:

eval_expr: Evaluate expressions using taint rules:
- Literals → Untainted
- Variables → lookup in env
- BinOp → propagate taint from both operands
- Source calls → Tainted
- Sanitizer calls → Untainted
- Unknown calls → Top
transfer_stmt: Transfer statements through taint environment:
- Assign: evaluate RHS, update env
- If: transfer both branches, join
- While: fixpoint with widening
- Return/Print: env unchanged
analyze_function: Initialize params to Top, transfer body

Part B: Vuln Checker + Reporter (40 points)

vuln_checker.ml (25 points)

Detect vulnerabilities at sink calls:

check_call: Check if a call is a sink with tainted arguments
check_stmt / check_stmts: Walk statements, threading env and collecting vulnerabilities
check_function / check_program: Entry points

vuln_reporter.ml (15 points)

Format vulnerability reports:

severity_of_vuln_type: Map vulnerability types to severity levels
format_vulnerability: Format as [SEVERITY] type in location: message (var, sink)
format_summary: Summary with count or "No vulnerabilities found."
group_by_type: Count vulnerabilities by type

Part C: Security Audit Report (25 points)

Write analysis_report.md documenting:

Analyze 5 programs showing taint flow step-by-step
For each program, show whether vulnerabilities are detected or the program is safe
Discuss the precision, limitations, and trade-offs of your analyzer
Compare with real-world tools (Semgrep, CodeQL, etc.)

Dependencies

abstract_domains (ABSTRACT_DOMAIN module type and MakeEnv functor)
shared_ast (AST types)

Tips

Start with Part A -- the analyzer is the foundation for Part B
Test with simple programs first (source → sink, then add sanitizers, then branches)
The taint_domain.ml file is provided -- focus on the analyzer logic
For Part C, trace through programs by hand to verify your analyzer's output