Pentest of AI/LLM Systems

Where AI and LLM Systems Are Vulnerable

AI-based applications and large language models (LLMs) are being rapidly integrated into business-critical processes. Companies are using them to boost internal productivity, power customer-facing applications, and enable automated decision-making and agent-based workflows.

As an integral part of corporate infrastructure, AI and LLM applications are subject to the same security requirements as traditional IT systems and are increasingly subject to regulatory requirements as well. At the same time, they currently pose particularly high risks: They are often developed under high time pressure, their failure behavior is still relatively poorly understood due to their stochastic nature, and they are typically closely interlinked with sensitive data sources, tools, APIs, and the extended internal organizational infrastructure.

Due to their central role and high degree of autonomy, LLMs present a new target for attacks. Instead of exploiting only code errors or misconfigurations, attackers now target model behavior to exfiltrate sensitive data via retrieval mechanisms or abuse complex agent functions through context manipulation and prompt injections. These risks often cannot be reliably detected using traditional security analyses. For companies, this means that security can only be thoroughly analyzed using specialized testing approaches, such as our pentest of AI/LLM systems.

What Checks Are Included in the Penetration Test of Your AI/LLM System?

These checks are included in the pentest of your AI/LLM system, among others:

Taint analysis of the information flow throughout the entire system, with a focus on context poisoning, to detect direct and indirect prompt injections

Assessment of data exfiltration possibilities

Impact of attacker-induced malicious LLM outputs on downstream systems

Analysis of potential tool calls for “excessive agency”

Identification of LLM-based broken access control

Assessing the effectiveness of alignment training for deployed models with regard to:

Misinformation
Unethical outputs
Regulatory-relevant statements
Liability-relevant statements

Together for more security: The ISF DACH Chapter Spring Meeting 2026 at usd AG

Mar 30, 2026

On 12 and 13 March 2026, the Information Security Forum (ISF) DACH Chapter Spring Meeting was held at the CST Academy of usd AG in Neu-Isenburg. The ISF regularly brings together information security officers from all over the DACH region to openly discuss current...

Effectively Implementing Third-Party Risk Management under DORA

Mar 25, 2026

The Digital Operational Resilience Act (DORA) is now a reality for financial institutions and their service providers. In 2026, the focus will shift to the practical implementation of third-party risk management at financial institutions, as BaFin will conduct its...

Deep Dive Into Red Teaming: Physical Pentesting. How Resilient Is Your Organization?

Mar 19, 2026

As a security manager, you protect your systems and processes every day and invest in awareness training. However, experience shows that physical attacks are an often underestimated attack vector. Attackers combine digital and physical methods. This is where Red...

Where AI and LLM Systems Are Vulnerable

Common Vulnerabilities in AI/LLM Systems Include:

How Does usd AG Approach Penetration Testing of AI/LLM Systems?

What Checks Are Included in the Penetration Test of Your AI/LLM System?

Tip: AI Security Training

Get More Insights

Pentest: Our standardized approach

Pentests with usd AG:
Your benefits at a glance

How secure are AI chatbots?Common vulnerabilities in LLM platforms

OWASP „Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0”

Contact

usd AG

News

Together for more security: The ISF DACH Chapter Spring Meeting 2026 at usd AG

Effectively Implementing Third-Party Risk Management under DORA

Deep Dive Into Red Teaming: Physical Pentesting. How Resilient Is Your Organization?

Follow Us

Pentest of AI/LLM Systems

Where AI and LLM Systems Are Vulnerable

Common Vulnerabilities in AI/LLM Systems Include:

How Does usd AG Approach Penetration Testing of AI/LLM Systems?

What Checks Are Included in the Penetration Test of Your AI/LLM System?

Tip: AI Security Training

Get More Insights

Pentest: Our standardized approach

Pentests with usd AG:Your benefits at a glance

How secure are AI chatbots?Common vulnerabilities in LLM platforms

OWASP „Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0”

Contact

Together for more security: The ISF DACH Chapter Spring Meeting 2026 at usd AG

Effectively Implementing Third-Party Risk Management under DORA

Deep Dive Into Red Teaming: Physical Pentesting. How Resilient Is Your Organization?

Pentests with usd AG:
Your benefits at a glance