Fred Rohrer's Blog

Fred Rohrer's Blog

AI Cybersecurity blog by Fred Rohrer

AI Security Featured

Multi-Path Ensemble Detection of Prompt Injection Attacks via Embedding Similarity, Trajectory Analysis, and Fine-Tuned Classification

Abstract. Prompt injection attacks pose a critical threat to large language model (LLM) deployments, enabling adversaries to override system instructions, exfiltrate data, and bypass safety controls. We present a multi-path ensemble system that combines three complementary detection strategies: (1) centroid-based embedding similarity against curated attack pattern clusters, (2) trajectory analysis
14 min read