EU's draft high-risk AI guidelines: A provider cannot simply say “not for high-risk use” in a terms-of-service document while simultaneously marketing the system as...

...broadly useful for HR, education, legal decision support, public administration, finance, border control, or other sensitive settings.

Pascal Hetzscholdt

May 20, 2026

Summary: The EU’s draft high-risk AI guidelines are important because they turn the AI Act into a practical classification tool for providers, deployers, regulators, and market surveillance authorities.

Hard enforcement is unlikely in the immediate short term due to delayed application dates, but AI providers should treat the guidance seriously now because product design, documentation, sales language, and intended use will shape future liability.

Overall, this is a positive but incomplete development: it closes obvious loopholes, but still relies too much on self-assessment, “intended purpose,” and future enforcement capacity.

Europe Is Drawing the Boundary Around “Serious AI”

by ChatGPT-5.5

The EU’s draft high-risk AI guidelines are important because they turn the EU AI Act’s abstract “high-risk AI” category into something operational. This is where the law starts becoming a product-classification system, a procurement checklist, a market-access gate, and eventually an enforcement tool. The documents explain when AI becomes high-risk under two routes: either because it is part of a regulated product or safety component under Annex I, or because it is used in sensitive standalone use cases under Annex III, such as biometrics, education, employment, essential services, law enforcement, migration, justice, and democratic processes. The Commission stresses that high-risk classification does not ban these systems; it means they must meet requirements around safety, accuracy, risk mitigation, human oversight, documentation, and fundamental-rights protection.

The most important move is that the Commission is trying to close the obvious loopholes. A provider cannot simply say “not for high-risk use” in a terms-of-service document while simultaneously marketing the system as broadly useful for HR, education, legal decision support, public administration, finance, border control, or other sensitive settings. The guidelines say the whole presentation of the system matters: instructions, contracts, usage policy, promotional material, examples, technical documentation, and reasonably foreseeable functionality. That is a major warning to horizontal AI providers, because product positioning can itself help determine intended purpose.

The second important move is the rejection of the “human-in-the-loop” escape route. The guidelines make clear that human involvement does not, by itself, change classification. Human oversight is a compliance requirement once a system is high-risk; it is not a magic wand that turns high-risk into low-risk. This matters because many AI vendors currently describe their systems as “assistive,” “copilot,” “decision-support,” or “recommendation-only” tools while knowing that, in real workflows, the output may materially shape a decision.

The third significant move is the treatment of complex and agentic systems. The Commission says split architectures should be assessed as a whole where their combined outputs materially influence an individual decision. That is crucial. Without that rule, providers could decompose a sensitive AI workflow into allegedly harmless modules: one model retrieves information, another ranks candidates, another drafts the recommendation, another generates a confidence score, and the provider then claims no single module is the “decision-maker.” The guidelines are rightly alert to that architecture-level circumvention risk.

Will any of this be enforced in the short term?

Not in the hard, fines-and-market-withdrawal sense for most high-risk systems. The guidelines themselves are still draft, non-binding, and subject to stakeholder consultation; the Commission also states that only the Court of Justice of the EU can give the authoritative interpretation of the AI Act.

The short-term enforcement picture is also weakened by the Digital/AI Omnibus delay. The attached general-principles document says the original dates were 2 August 2026 for Annex III high-risk systems and 2 August 2027 for Annex I systems, but that these are now postponed to 2 December 2027 and 2 August 2028 respectively. The Council’s May 2026 provisional agreement confirms those delayed dates for standalone high-risk systems and AI embedded in products, although the source also notes that formal endorsement and legal-linguistic steps still follow.

That said, “not yet enforceable” does not mean “irrelevant.” The Commission’s own standardisation page says high-risk requirements must be fulfilled before market placement and that harmonised standards are being developed for risk management, data governance, record-keeping, transparency, human oversight, accuracy, robustness, cybersecurity, quality management, and conformity assessment. It also says the latest application dates are 2 December 2027 and 2 August 2028, but the Commission may decide earlier application if support tools, including standards, are available earlier.

So the practical answer is: formal enforcement is mostly not immediate; procurement, legal diligence, product redesign, standards preparation, and regulatory expectation-setting begin now. Providers who wait until late 2027 will almost certainly be too late.

How seriously should AI providers take this?

Very seriously. For AI providers, this is not just regulatory commentary; it is the emerging map of the EU market-access perimeter.

Providers should immediately review product documentation, demos, sales decks, customer use cases, contracts, acceptable-use policies, technical documentation, and API examples. The most dangerous inconsistency will be saying “we prohibit high-risk uses” while sales teams, partner materials, app-store listings, enterprise pilots, or workflow integrations suggest otherwise. The guidelines are effectively telling providers that legal disclaimers will be judged against product reality.

Providers also need a classification file for every significant AI system. If they claim the Article 6(3) “filter mechanism” applies because the system performs only a narrow procedural task, improves a previously completed human activity, detects deviations, or performs a preparatory task, they will need to document that assessment, explain why the system is not profiling, and register exempted systems in the EU database. Market surveillance authorities may later challenge misclassification, require corrective action, and impose penalties if the low-risk classification was used to circumvent the Act.

The highest-risk providers are not only obvious “AI for hiring” or “AI for policing” vendors. General-purpose AI providers, enterprise copilots, workflow automation companies, agentic AI platforms, legal AI vendors, education-tech companies, HR platforms, financial scoring tools, health-insurance systems, migration tools, and government-service automation providers all need to treat these guidelines as a design constraint. The moment a system is promoted or configured to influence access to jobs, education, benefits, justice, credit, insurance, public services, or border decisions, the provider is no longer merely selling a clever productivity tool.

Flaws and omissions

The biggest structural flaw is the continued reliance on “intended purpose.” The Commission tries to make this harder to game by looking at the provider’s full product presentation, but the problem remains: in modern AI ecosystems, actual purpose often emerges downstream. A model provider supplies an API; a platform wraps it; a customer fine-tunes it; a systems integrator connects it to HR data; an agentic workflow starts making recommendations; and suddenly the “intended purpose” is distributed across several actors. The guidelines recognise this problem, but they do not fully solve the accountability gap.

Second, the filter mechanism is useful but risky. Concepts such as “narrow procedural task,” “preparatory task,” and “improves the result of a previously completed human activity” are necessary, but they are also exactly the phrases vendors will use to avoid high-risk classification. Almost every AI copilot can be described as preparatory. Almost every scoring system can be described as assistive. Almost every recommendation engine can be dressed up as human-reviewed. The guidelines are right to say that systems materially influencing decisions should not escape classification, but enforcement will depend on whether authorities can inspect real workflows, not just provider descriptions.

Third, the guidelines underdevelop the knowledge-integrity problem. They focus on whether a system falls into a high-risk use case, but they say far less about whether the underlying knowledge, training data, retrieval corpus, citations, source material, or update process is reliable, licensed, current, and traceable. For sectors such as healthcare, legal services, scholarly research, education, public administration, and regulated finance, the risk is not only discriminatory automation or lack of human oversight. The risk is that the system confidently relies on outdated, hallucinated, pirated, unlicensed, unverifiable, or retracted information. For publishers and trusted knowledge providers, that omission matters.

Fourth, the treatment of agentic AI is directionally good but too thin. The guidelines mention complex, interconnected and agentic systems, but future guidance needs to go deeper into tool use, MCP-style connectors, autonomous execution, logging, permissions, prompt injection, post-deployment drift, customer-side configuration, and multi-agent workflows. The next generation of high-risk AI will not always look like a single model producing a score. It will look like a chain of tools quietly gathering data, ranking options, drafting recommendations, escalating cases, and initiating actions.

Fifth, enforcement capacity is the unresolved political issue. The documents assume competent market surveillance authorities can evaluate classification, challenge misclassification, and impose penalties. That is correct on paper. But if the EU delays obligations, relies heavily on self-assessment, and does not resource authorities with technical audit capability, the regime risks becoming another compliance theatre: lots of documentation, limited inspection, and enforcement only after visible scandals.

Positive, negative, or neutral?

ChatGPT’s view: this is a positive development, but a fragile one.

It is positive because the Commission is drawing sensible lines. It rejects the most cynical loopholes: “we only assist,” “a human is involved,” “the risky part is in another module,” “our terms ban this use,” or “the customer is responsible.” It also gives providers practical examples and signals that high-risk AI is not prohibited but must be governed like serious infrastructure. That is the right philosophy.

But it is fragile because the political context is already one of delay, simplification, and industry pressure. The Omnibus postponement may be pragmatic if standards and tools are not ready, but it also weakens the credibility of the AI Act at precisely the moment when high-risk AI is moving into real workflows. The danger is that Europe spends two years refining classifications while AI providers deepen market dependency, embed themselves into public and private infrastructure, and then argue that compliance is too disruptive.

So I, ChatGPT, would call this a constructive but incomplete development. It is not a Pandora’s box; it is an attempt to put a lid on one. The risk is that by the time the lid is enforceable, many of the systems it was meant to control will already be embedded in hiring, education, public benefits, insurance, justice, migration, and democratic infrastructure. The guidelines are therefore a strong warning shot, but not yet a functioning enforcement regime.

Pascal’s Substack

Discussion about this post

Ready for more?