Checklists/AI Security/LLM Application Security

LLM Application Security Checklist

28 security checks for LLM-powered applications covering prompt injection, data leakage, output validation, RAG poisoning, API security, and agent framework hardening.

๐Ÿšจ LLM Application Threat Landscape

LLM-powered applications introduce attack surfaces that traditional security testing misses entirely:

โ€ข OWASP Top 10 for LLMs prompt injection is #1 risk (OWASP 2025)

โ€ข 77% of companies have no LLM security policy (Gartner)

โ€ข 90% of jailbreaks succeed on first attempt against unprotected apps (NVIDIA)

โ€ข $4.88M average cost of an AI-related data breach (IBM 2025)

โ€ข 56% of LLM apps leak sensitive data through outputs (Lakera)

โ€ข 3.2x more attack surface in RAG applications vs traditional LLM apps

๐Ÿ“„
Want this as a PDF? DM me on Telegram โ†’
Showing 28 of 28 vulnerabilities
#1

Direct Prompt Injection Defense

Critical
โ–ผ

User inputs cannot override system prompts or modify LLM behavior directives

#2

Indirect Prompt Injection Resistance

Critical
โ–ผ

External data sources (web pages, documents, emails) cannot inject instructions into the LLM

#3

Jailbreak Resistance

Critical
โ–ผ

Application resists role-play, encoding, and multi-turn jailbreak techniques

#4

System Prompt Protection

High
โ–ผ

System prompts cannot be extracted or leaked through conversational techniques

#5

PII Detection & Redaction

Critical
โ–ผ

Personally identifiable information is detected and redacted before reaching the LLM

#6

Training Data Extraction Prevention

Critical
โ–ผ

Model cannot be prompted to reveal memorized training data or fine-tuning examples

#7

Conversation History Isolation

High
โ–ผ

User sessions are isolated โ€” no cross-contamination of conversation data between users

#8

Log Sanitization

High
โ–ผ

Application logs do not contain sensitive prompts, user data, or model responses in plaintext

#9

Hallucination Detection

High
โ–ผ

Critical outputs are validated against ground truth โ€” hallucinated facts flagged before reaching users

#10

Code Execution Output Sanitization

Critical
โ–ผ

LLM-generated code is sandboxed and validated before execution โ€” no arbitrary code runs unsupervised

#11

Structured Output Validation

High
โ–ผ

JSON, SQL, and other structured outputs are validated against schemas before downstream use

#12

Toxic Content Filtering

High
โ–ผ

Model outputs screened for harmful, biased, or inappropriate content before delivery

#13

RAG Document Poisoning Prevention

Critical
โ–ผ

Documents ingested into the knowledge base are scanned for embedded injection payloads

#14

Retrieval Access Control

High
โ–ผ

RAG retrieval respects document-level permissions โ€” users only see content they're authorized to access

#15

Embedding Inversion Protection

High
โ–ผ

Vector embeddings cannot be reverse-engineered to reconstruct original document content

#16

Source Attribution Integrity

Medium
โ–ผ

Retrieved sources are accurately cited โ€” no fabricated or manipulated source references

#17

Rate Limiting & Abuse Prevention

High
โ–ผ

API endpoints enforce per-user rate limits to prevent abuse, cost attacks, and resource exhaustion

#18

Input Length & Complexity Limits

High
โ–ผ

Maximum input token counts enforced โ€” no context window exhaustion attacks

#19

Model API Key Security

Critical
โ–ผ

LLM provider API keys are never exposed client-side โ€” all calls proxied through backend

#20

Streaming Response Security

Medium
โ–ผ

SSE/WebSocket streaming responses validated incrementally โ€” no mid-stream injection

#21

Tool Call Authorization

Critical
โ–ผ

LLM tool/function calls validated against permission boundaries before execution

#22

Tool Output Sanitization

Critical
โ–ผ

Data returned from tool calls sanitized before re-entering the LLM context

#23

Autonomous Action Limits

High
โ–ผ

Agent loop iterations capped โ€” no unbounded recursive tool calling or infinite loops

#24

Multi-Agent Trust Boundaries

High
โ–ผ

In multi-agent systems, agents cannot escalate privileges or manipulate other agents' instructions

#25

Role-Based Output Filtering

High
โ–ผ

Model responses filtered based on user role โ€” admins see full data, regular users see redacted versions

#26

Session Token Security

High
โ–ผ

Conversation session tokens are cryptographically secure and expire appropriately

#27

Prompt Injection Detection Logging

High
โ–ผ

Suspected injection attempts are logged and trigger security alerts

#28

Cost & Usage Anomaly Detection

Medium
โ–ผ

Unusual spikes in token usage, API calls, or compute costs trigger automated alerts

Need an LLM Application Security Audit?

Zealynx tests LLM applications against real-world attack patterns โ€” prompt injection, data leakage, jailbreaks, and RAG poisoning. We find what automated scanners miss.