EU GMP Annex 11 Requirements for Computerised Systems in Pharmaceutical Manufacturing

Annex 11 governs computerised systems in EU pharma manufacturing. Its data integrity requirements and AI implications are more specific than teams assume.

EU GMP Annex 11 Requirements for Computerised Systems in Pharmaceutical Manufacturing
Written by TechnoLynx Published on 25 Apr 2026

Annex 11 is narrower than its reputation suggests

EU GMP Annex 11 is often cited as the blanket regulatory framework for software in European pharmaceutical manufacturing. In practice, its scope is both more specific and more consequential than that framing implies. Annex 11 governs computerised systems — defined as systems that create, modify, maintain, archive, retrieve, or transmit data required by GMP — used in the manufacture of medicinal products in the European Union. It does not apply to every piece of software in a pharmaceutical facility, and it does not apply identically to every computerised system within its scope.

Understanding exactly what Annex 11 requires — and where it stops — is a prerequisite for deploying AI in EU pharmaceutical operations without either under-validating (compliance gap) or over-validating (wasted effort). The regulatory text is 16 sections long. The operational implications, particularly for AI systems, are concentrated in a handful of those sections.

EudraLex Volume 4 Annex 11 Section 4.8 requires that regulated records must be protected by physical and electronic means against damage, and backup data integrity and accuracy must be checked during validation (per EudraLex Volume 4, published specification). The MHRA’s Data Integrity guidance (2018) established that organisations must maintain data throughout the complete retention period — typically 15–25 years for pharmaceutical batch records, depending on product class.

What Annex 11 actually governs

The scope clause (Section 1) defines the boundary: Annex 11 applies to all forms of computerised systems used as part of GMP-regulated activities. A system is “computerised” if it involves electronic data processing — which includes AI/ML models that process manufacturing data, generate quality predictions, or classify inspection results.

Importantly, Annex 11 does not apply to computerised systems used in non-GMP activities. A scheduling optimisation system, an energy management system, or a business intelligence tool operating in a pharmaceutical facility falls outside Annex 11 scope unless it creates or modifies GMP-regulated data. The GxP scope assessment framework applies here: the question is whether the system participates in GMP-regulated data handling, not whether it runs in a pharmaceutical building.

The sections most relevant to AI system deployment, condensed:

Section Topic AI-specific implication
3 Suppliers and Service Providers Buying a “validated” vendor model does not transfer validation obligation; verification evidence must be documented
4 Validation Risk-proportionate; aligns with GAMP 5 Second Edition; methodology must be documented, not prescribed
7 Data Storage Outputs plus traceability metadata (model version, input reference, confidence score) must be stored and verified
9 Audit Trail Model version changes must be recorded with old/new value, identity, timestamp, and rationale
10 Change and Configuration Management Retraining, pipeline changes, training-data updates all trigger change control with impact assessment
11 Periodic Evaluation Continuous monitoring becomes a regulatory requirement, not just an operational best practice

Section 3 — Suppliers and Service Providers. When a pharmaceutical company uses an AI system developed by a third party — a vendor-supplied quality inspection model, a commercial process monitoring platform, or an outsourced ML development — the company retains responsibility for ensuring the system meets Annex 11 requirements. Purchasing a vendor’s “validated” AI product does not transfer the validation obligation. The pharmaceutical company must verify that the vendor’s validation approach is adequate for the intended GMP use, and must maintain documented evidence of that verification.

Section 4 — Validation. Computerised systems must be validated proportionate to the risk and complexity of the application. This section aligns with the GAMP 5 Second Edition risk-based approach — validation intensity scales with the system’s impact on product quality and data integrity. For AI systems, Annex 11 Section 4 — particularly 4.4 (user requirements specification), 4.6 (quality and performance assessment for custom systems), and 4.7 (documented test evidence) — does not prescribe a specific validation methodology. It requires that the methodology is documented, risk-proportionate, and produces evidence that the system performs its intended function reliably.

Section 7 — Data Storage. Data must be secured against damage by both physical and electronic means (7.1). Stored data must be checked for accessibility, readability, and accuracy, with regular back-ups verified for integrity (7.2). For AI systems that generate GxP records (inspection classifications, deviation flags, process parameter predictions), the stored data includes both the system’s outputs and the metadata required for traceability — model version, input data reference, and confidence score.

Section 9 — Audit Trail. Annex 11 requires that any GMP-relevant changes to data are recorded in an audit trail capturing the old value, the new value, the identity of the person (or system) making the change, and the timestamp. For AI systems, this extends to model version changes: when a retrained model replaces the production version, the audit trail must record the change, the rationale, and the evidence that the new version meets acceptance criteria.

Section 10 — Change and Configuration Management. Any change to a validated computerised system must follow a documented change management process. For AI systems, this covers model retraining, preprocessing pipeline modifications, infrastructure changes that could affect model behaviour, and training data updates. The change management process must include impact assessment and, where the risk warrants it, revalidation.

Section 11 — Periodic Evaluation. Computerised systems must be evaluated periodically to confirm they remain in a validated state. This is where continuous monitoring becomes a regulatory requirement, not just an operational best practice. An AI model whose performance has not been assessed since initial validation cannot claim ongoing compliance with Section 11 — the periodic evaluation must demonstrate that the system continues to perform within its documented acceptance criteria.

How does Annex 11 intersect with AI-specific challenges?

Several AI-specific operational realities create friction with Annex 11 requirements that were designed for deterministic software. Understanding these friction points is necessary for designing AI systems that meet the regulation’s intent without producing validation artifacts that test the wrong properties.

As reported in the European Medicines Agency (2024) inspection summary, Annex 11 non-compliance was cited in roughly 22% of GMP inspection findings across EU manufacturing sites — a directional industry-scale figure from the published report, not an operational benchmark for any specific firm. PIC/S survey reports indicate that data integrity findings under Annex 11 increased on the order of 35% between 2019 and 2023 across member-state inspections (published-survey class, market-direction framing, not a benchmarked rate for any individual operation).

Model drift and Section 11

Deterministic software does not change behaviour between validated versions — the same code on the same infrastructure produces the same results. ML models can drift: the production data distribution changes, the model’s performance degrades, and the degradation may be invisible without monitoring. Section 11’s periodic evaluation requirement maps directly to this challenge, but the evaluation frequency must be informed by the model’s drift characteristics, not by an arbitrary schedule (quarterly, annually) designed for stable software.

For pharmaceutical AI systems, our recommendation is continuous automated monitoring with periodic human review — the monitoring infrastructure detects performance shifts as they occur, and the periodic review provides documented evidence for Annex 11 compliance. The monitoring should track not just headline accuracy but domain-specific metrics: false positive rate at the operating threshold, performance across data subsets (different product variants, different lighting conditions, different shifts), and input data distribution statistics that flag drift before it affects output quality (an observed pattern across our regulated-software engagements, not a benchmarked threshold).

Explainability and Sections 8.1–8.2

Sections 8.1 and 8.2 — requiring clear printed copies of electronic data and the ability to identify whether records have been altered since original entry — together with the broader data integrity expectations imply that GMP-regulated computerised system outputs must be understandable to the quality team responsible for them. For deterministic software, this is straightforward: the output is a calculated value from known inputs using documented logic. For ML models, the mapping from input to output is not transparent in the same way.

Annex 11 does not explicitly require model explainability, but the regulatory expectation that quality teams can investigate and understand system outputs creates an implicit requirement for AI systems operating in GMP contexts. A quality engineer investigating a deviation flagged by an AI system must be able to determine why the system flagged that particular event — not at the individual-neuron-weight level, but at a level sufficient to evaluate whether the flag is meaningful or spurious. In our experience, this is the most underestimated Annex 11 requirement for AI: the regulation does not name explainability, but the intent demands it.

Practical approaches include feature importance analysis (SHAP values, gradient-based attribution) for tabular models, saliency mapping for computer vision models (which regions of the image drove the defect classification), and documented decision boundaries for classification models. The explainability approach should be part of the validation documentation — not because Annex 11 prescribes it, but because the ability to investigate AI system outputs is a prerequisite for meeting the regulation’s intent around data traceability and quality decision transparency.

Electronic signatures and Section 14

Section 14 requires that electronic signatures have the same legal significance as handwritten signatures. For AI systems, this is relevant when the system’s output is used as the basis for a GMP quality decision — a batch release determination, a deviation classification, or an inspection pass/fail judgment. The electronic signature of the person accepting the AI system’s recommendation must be traceable, attributable, and linked to the specific system output being accepted.

In practice, this means the AI system’s user interface must present the model’s output (prediction, classification, recommendation), capture the human operator’s acceptance or rejection of that output with an electronic signature, and link the signature to the specific model version, input data, and output. This is a UX and audit trail design requirement — straightforward to implement, but frequently overlooked until audit preparation reveals the gap.

Annex 11 in the context of the broader EU regulatory landscape

Annex 11 does not operate in isolation. Pharmaceutical companies deploying AI in EU manufacturing must also consider the EU AI Act requirements that apply to GxP systems, which introduce additional classification and transparency obligations for high-risk AI systems. The intersection of Annex 11 (GMP-specific computerised system requirements) and the EU AI Act (general AI regulation with sector-specific implications) creates a compliance landscape that requires system-by-system assessment rather than blanket policy application.

For pharmaceutical manufacturers operating across both EU and US jurisdictions, the Annex 11 requirements must be reconciled with the FDA’s CSA guidance and 21 CFR Part 11. The requirements are compatible in principle — both are risk-based frameworks that emphasise proportionate validation and data integrity — but they differ in terminology, documentation expectations, and audit practices. A validation strategy designed only for one jurisdiction will have gaps when audited by the other.

The organisations that navigate this landscape most effectively are those that assess each AI system individually against the applicable regulatory frameworks rather than applying a single validation template across all systems. If your EU pharmaceutical operations are evaluating which AI systems fall within Annex 11 scope and what the proportionate validation approach should be for each, a GxP Regulatory Scope Analysis maps the regulatory requirements per system across applicable jurisdictions.

FAQ

What does EU GMP Annex 11 actually require for computerised systems in pharmaceutical manufacturing?

Annex 11 requires that any computerised system used in GMP-regulated activities — meaning any system that creates, modifies, maintains, archives, retrieves, or transmits data required by GMP — is validated proportionate to its risk, has a documented audit trail for GMP-relevant data changes, is subject to change control, and is periodically evaluated to confirm it remains in a validated state. It does not prescribe a single validation methodology; it requires that whatever methodology is used is documented and risk-proportionate.

How do Annex 11 requirements differ from 21 CFR Part 11 for the same AI system?

Annex 11 and 21 CFR Part 11 are compatible in principle — both are risk-based and emphasise proportionate validation and data integrity — but they differ in terminology, documentation expectations, audit practices, and the way computerised-system scope is defined. A validation strategy designed only for one jurisdiction will typically have gaps when audited by the other, particularly around supplier responsibility (Annex 11 Section 3) and signature semantics (Section 14 vs. Part 11 Subpart C).

Where does Annex 11 apply specifically to AI/ML-based computerised systems?

It applies wherever the AI/ML system creates or modifies GMP-regulated data — quality inspection classifiers, deviation flagging, process parameter prediction, batch-record generation, release-decision support. It does not apply to AI systems whose outputs do not enter GMP-regulated data flows, such as scheduling optimisation or energy management, unless those outputs feed back into regulated decisions or records.

What does an Annex 11–compliant validation package contain (risk assessment, electronic records, audit trail, change control)?

A compliant package contains a documented risk assessment driving the validation scope; a user requirements specification (Section 4.4); evidence that the supplier’s contribution has been assessed (Section 3); test evidence proportionate to the risk (Section 4.7); audit-trail design covering old value, new value, identity, and timestamp (Section 9); a change and configuration management process (Section 10); a data storage and backup verification approach (Section 7); and a periodic evaluation plan (Section 11). For AI systems, the package also documents how model versioning, retraining triggers, and performance monitoring are handled within those existing controls.

How is the draft 2025 Annex 11 revision changing requirements for AI-driven systems?

The draft revision currently in consultation explicitly references AI/ML systems, dynamic learning, and the need for ongoing performance verification — concepts that are only implicit in the current text. The direction of travel is toward making continuous monitoring and lifecycle management of model behaviour an explicit Annex 11 expectation rather than an interpretation of Section 11. Companies that already operate continuous monitoring with documented periodic review are well positioned; companies that validate once and re-check annually will face a wider gap to close.

Which Annex 11 controls demand special evidence when the underlying model is retrained?

Retraining touches Section 9 (audit trail of the model version change), Section 10 (change and configuration management with impact assessment), Section 4 (revalidation evidence where the change is significant), Section 7 (storage of the new model artefact plus the training data reference), and Section 11 (post-change periodic evaluation showing the new version remains within acceptance criteria). The retraining event itself should be treated as a controlled change with its own evidence trail, not a routine update.

Annex 11’s force is not in any single section but in the cumulative expectation that the people responsible for a GMP-regulated computerised system can, at any point, demonstrate it is doing what it is supposed to do and explain how they know. For AI systems, that expectation is harder to meet than it looks — and easier to meet than teams assume, once the validation scope is sized correctly.

Back See Blogs
arrow icon