This website uses cookies

Read our Privacy policy and Terms of use for more information.

On June 3, the European Commission presented a tech sovereignty package centred on the proposed Cloud and AI Development Act, Chips Act 2.0, and the EU Open Source Strategy. CADA defines four cloud and AI sovereignty assurance levels for public-sector risk assessment and targets at least tripling EU data-centre capacity within five to seven years. The Commission's framing was unusually geopolitical for Brussels: cloud dependence is now being treated not only as an IT sourcing issue, but as a strategic control risk.

The package will dominate sovereignty discussions for the rest of the year, and we cover it in The Brief. But this edition's Deep Dive keeps a promise we made in April: the decision tree for choosing between RAG and the three-folder knowledge base pattern — and what to do when a RAG system has already hallucinated its way past an auditor.

The two stories are the same story at different altitudes. The Sovereignty Package asks: who controls your infrastructure, and can you prove it? The knowledge-architecture question asks: who controls your AI's answers, and can you prove it? Stanford RegLab's peer-reviewed evaluation of commercial legal AI tools found that systems built by the most sophisticated RAG engineering teams in the world — LexisNexis and Thomson Reuters — still produced incorrect information on roughly one-in-six to one-in-three expert-scored queries, often with citations that looked impeccable. Gartner added the governance frame two weeks ago: enterprises that apply uniform, binary governance to AI systems of different autonomy and risk levels will fail, and by 2027, 40% of enterprises will demote or decommission AI agents after production incidents expose the gap.

The thesis this week: when an AI system's answers must survive an auditor, unverified retrieval probability becomes a liability — and the choice between probabilistic retrieval and deterministic knowledge structures is a governance decision that most organisations are currently letting a platform vendor make for them by default.

TL;DR
  • Sovereignty is becoming procurement language. CADA's four assurance levels will appear in EU public-sector tenders first and in your private-sector vendor questionnaires within 18 months. Map your cloud estate against the levels now, before a client or regulator does it for you.

  • Binary AI governance fails in both directions. Gartner's May 26 warning: over-restrict simple systems and you breed shadow AI; under-restrict autonomous ones and you breed incidents. Proportional, autonomy-tiered governance is the fix — and it maps cleanly onto the AI Act's risk-tier logic.

  • Your knowledge architecture is a compliance surface. Article 50 disclosure obligations land August 2; Article 12 traceability follows for high-risk systems in December 2027. A RAG pipeline that cannot reproduce why it gave an answer is an audit finding waiting to be written. Choose retrieval architecture per use case, deliberately, with a decision record.

  • The Omnibus enters its final procedural mile. Formal adoption is expected in the June/July window. One under-reported detail: the watermarking compliance deadline moved to December 2, 2026 — but Article 50 user-disclosure obligations still bite on August 2. Do not let the first date relax your teams about the second.

10x the context. Half the time.

Speak your prompts into ChatGPT or Claude and get detailed, paste-ready input that actually gives you useful output. Wispr Flow captures what you'd cut when typing. Free on Mac, Windows, and iPhone.

The Brief

1. The EU Tech Sovereignty Package: Brussels Moves From Speeches to Legislative Text

On June 3, the European Commission presented the European Technological Sovereignty Package — measures spanning semiconductors, AI, cloud, and open source. The centrepiece is the proposed Cloud and AI Development Act (CADA), which aims to reduce reliance on non-EU cloud providers and at least triple EU data-centre capacity within five to seven years. CADA sits inside the broader AI Continent / tech sovereignty push, which the Commission frames around mobilising €200 billion for AI investment via InvestAI, while CADA itself specifically targets private investment in cloud and data-centre capacity. The package also includes a revised Chips Act and accelerated permitting for data centres and their energy supply.

Why it matters: This is the infrastructure answer to the regulatory lead Europe already holds. For two years the sovereignty conversation was rhetorical; CADA turns it into legislative text with definitions, obligations, and assurance levels that procurement teams can cite. The proposal now enters Parliament and Council negotiations — meaning the lobbying window is open and the final shape is not settled. Watch: Committee assignments and the first Parliament draft report — they will signal whether the assurance levels survive intact. Source: European Commission — Strengthening Europe's tech sovereignty · EC — Cloud and AI Development Act

2. CADA's Four Sovereignty Levels: A Procurement Framework Hiding Inside a Regulation

CADA defines four cloud sovereignty assurance levels, to be used by public-sector bodies based on risk assessment. Level 1: data hosted in EU infrastructure. Level 2: providers must demonstrate independence from third countries and transparency over their software supply chain. Level 3: providers must be EU-owned and controlled, with additional criteria including personnel citizenship — though the Commission may recognise third-country providers. Level 4: full transparency and control over the software supply chain with no third-country interference.

Why it matters: Assurance levels written for public procurement never stay in public procurement — see SecNumCloud, see C5, see the EUCS debate. Within a procurement cycle or two, regulated private-sector buyers (banks under DORA, operators under NIS2) will lift these levels into their own vendor due diligence. If you sell cloud-hosted AI services in Europe, your sales team will be asked which level you can attest to. If you buy them, the levels give you a vocabulary you previously had to invent per-contract. Watch: Whether Level 3's ownership-and-citizenship criteria survive the inevitable US hyperscaler counter-lobby. Source: EC — Cloud and AI Development Act · CSA — EU Tech Sovereignty: Cloud Concentration Risk and the Compliance Cascade

3. Gartner: Uniform Governance Across AI Agents Guarantees Failure

In a May 26 press release, Gartner warned that applying uniform governance across AI agents — regardless of autonomy level and scope — leads to enterprise AI agent failure. The prediction: by 2027, 40% of enterprises will demote or decommission autonomous AI agents due to governance gaps identified only after production incidents. Senior director analyst Shiva Varma identified the root cause as binary thinking — agents treated as either "locked down or fully trusted." Gartner's recommended model classifies agents into four autonomy tiers: Observe (read-only), Advise (read-only plus hallucination testing), Act with Approval (human review, audit trails), and Autonomous (guardrails, rollback, continuous monitoring, red-teaming).

Why it matters: This is the analyst-tier validation of what the May 28 edition called the agent governance stack. Note the failure mode symmetry: over-restriction drives shadow AI development; under-restriction drives incidents. Both end in the 40%. The four-tier model also maps naturally onto AI Act risk tiers — which means building it once serves both your internal control framework and your December 2027 compliance evidence. Source: Gartner — Uniform governance across AI agents will lead to failure · CIO — Many autonomous agents doomed by governance failures

4. Omnibus Adoption Expected Around the June Plenary — and the Watermarking Date Moved

The Digital Omnibus on AI, provisionally agreed May 7, is in legal-linguistic revision with formal adoption by Parliament and Council expected between June and July. Until formal adoption and Official Journal publication, this remains the provisional agreement text. Two details deserve more attention than they are getting. First, the agreement adds a new Article 5 prohibition covering AI systems that generate non-consensual intimate imagery or CSAM, including nudifier apps. Second, the compliance deadline for machine-readable watermarking of AI-generated content moved from August 2, 2026 to December 2, 2026 — while the rest of the Article 50 transparency obligations remain on the August date.

Why it matters: The split deadline is a trap for compliance calendars. Teams that hear "watermarking moved to December" may file the entire transparency file under Q4. Wrong: the obligation to inform users they are interacting with AI, to disclose deep fakes, and to label AI-generated public-interest text still applies from August 2. Brief your product owners on the two-date structure explicitly. Watch: The plenary vote date, and publication in the Official Journal — entry into force is three days later. Source: European Parliament — Legislative Train: Digital Omnibus on AI · Matheson — EU AI Digital Omnibus: where are we now?

5. Code of Practice on Marking and Labelling Published — Now Awaiting Adequacy Assessment

The Commission published the final Code of Practice on transparency of AI-generated content on June 10, after the multi-stakeholder drafting process. It covers two surfaces: provider rules for marking and detecting AI-generated or manipulated content, and deployer rules for labelling deep fakes and certain AI-generated text on matters of public interest. The Code is voluntary, but Article 50's transparency obligations remain legal obligations from August 2, 2026. The Code is now undergoing adequacy assessment by the Commission and the AI Board.

Why it matters: If the Code receives a positive adequacy assessment, signatories will be able to rely on its measures to demonstrate compliance with Article 50's marking and labelling obligations. Teams that do not follow it will need to show, case by case, that their own measures are adequate. That makes the Code less like a soft-law curiosity and more like the implementation baseline most compliance teams will use. With the watermarking deadline now December 2, the build window is real — but it starts now, not after the adequacy verdict. Source: EC — Code of Practice on marking and labelling of AI-generated content

6. Article 50 Final Guidelines: Seven Weeks to Enforcement

The targeted consultation on the Commission's draft Article 50 transparency guidelines closed June 3; the final guidelines are expected before the August 2 application date. The 40-page draft remains the best available signal of what the AI Office considers compliant: clear disclosure when users interact with AI, machine-readable marks on generated content, and disclosure duties for emotion recognition and biometric categorisation systems.

Why it matters: Seven weeks. If your chatbots, content tools, or biometric systems serve EU users, the implementation should already be in sprint. The final guidelines may adjust details; they will not adjust the deadline. Covington's analysis of the draft is the most useful practitioner read so far — particularly on the interplay between provider marking duties and deployer labelling duties. Source: EC — Article 50 transparency consultation · Covington — 10 Takeaways from the draft transparency guidelines

7. The Evidence Base: Commercial Legal RAG Still Hallucinates 17–34% of the Time

Stanford RegLab's preregistered evaluation of AI legal research tools — now peer-reviewed and published in the Journal of Empirical Legal Studies — tested Lexis+ AI, Westlaw AI-Assisted Research, and Ask Practical Law AI on 202 legal queries, hand-scored by legal experts. Lexis+ AI produced incorrect information more than 17% of the time; Westlaw AI-Assisted Research was roughly one-third wrong in the evaluated queries. Both vendors had marketed their products as hallucination-free or hallucination-avoiding. A recurring failure mode: real citations attached to claims the cited document does not support.

Why it matters: These are not hobbyist deployments — they are RAG systems built by the two most experienced legal-content engineering organisations on earth, over proprietary, curated, canonical corpora. If the best-funded legal RAG systems still fail at material rates over curated corpora, your internal compliance chatbot over a SharePoint export deserves verification before anyone treats it as reliable. The "miscited but real-looking source" failure mode is the one auditors find — months later. This study is the empirical spine of this week's Deep Dive. Source: Stanford RegLab — Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools · Stanford HAI — AI on Trial

8. Data-Centre Buildout: The Package's Physical Layer

Beyond the sovereignty framework, the June 3 package commits to tripling EU data-centre capacity within five to seven years, supported by accelerated permitting and improved access to energy, land, water, and financing — alongside the broader InvestAI ambition to mobilise €200 billion for AI investment. The Commission pairs this with open-source investment provisions and a revised Chips Act targeting design and advanced-node capacity.

Why it matters: Europe's regulatory lead has always been undermined by its infrastructure gap — the theme of our January 29 edition on the sovereignty paradox. This is the first package that addresses the gap with permitting reform rather than just funding announcements. For capacity-constrained enterprises: if the permitting acceleration is real, the EU-region availability and pricing picture in 2028–2030 looks materially different from today's planning assumptions. Worth a line in your infrastructure strategy refresh. Source: EC — Strengthening Europe's tech sovereignty · IEU Monitoring — Commission targets digital dependencies

9. European Capital: Defence and Resilience Lead the Week

European venture activity in early June continued the pattern Crunchbase identified for Q1 — $17.6 billion raised, up nearly 30% year over year, with AI claiming more than half of all European funding for the first time. The standout signal: ICEYE, the Finnish-Polish satellite intelligence company, announced one of Europe's largest defence-tech funding rounds of the year, expanding space-based intelligence for sovereign defence and resilience — alongside a cluster of smaller AI-enabled rounds in climate and health tech.

Why it matters: Capital concentration in defence, resilience, and sovereignty-adjacent infrastructure is the private-market echo of the June 3 package. The investable thesis in European tech right now is strategic autonomy. For corporate development and procurement teams, the vendor landscape in sovereign cloud, secure compute, and resilience tooling will be meaningfully deeper in 24 months — plan vendor reviews accordingly. Source: EU-Startups — Weekly funding round-up (June 1–5) · Crunchbase — AI drives Europe's second straight quarter of funding gains

Deep Dive

RAG or Records? The Decision Tree for Knowledge That Faces an Auditor

In April we built an EU AI Act knowledge base out of three folders and an index file. We promised the follow-up: when does that pattern beat RAG, when does RAG win, and what do you do with a RAG system that has already hallucinated past an auditor? This is that edition.

What Changed

Two evidence streams converged this spring. The first is empirical: Stanford RegLab's peer-reviewed evaluation showed that the best-funded RAG deployments in the legal industry — built over curated, canonical corpora by vendors with decades of content-engineering experience — still produce incorrect answers on 17% to 33% of expert-scored queries. The headline number is bad; the failure mode is worse. The tools did not mostly invent fake cases. They cited real documents for claims those documents do not support. That is the failure an internal reviewer misses and an external auditor finds.

The second stream is regulatory. Article 50's transparency obligations apply from August 2. Article 12's logging and traceability requirements for high-risk systems become enforceable in December 2027, under the provisional Omnibus agreement heading to formal adoption. Together they convert a question that used to live in engineering standups — "how should our AI retrieve knowledge?" — into a question that lives in audit scopes: "can you reproduce why this system gave this answer, and does the cited source actually say that?"

A RAG pipeline answers that question probabilistically. An embedding model chose chunks by cosine similarity; the chunks that were retrieved on Tuesday may not be retrieved for the same query on Thursday after a re-index; the citation was assembled by a model that, per Stanford, attaches real references to unsupported claims at material rates. A compiled knowledge base — the three-folder pattern we built in April, with raw sources, LLM-maintained wiki articles, and a human-readable index — answers it deterministically. The article the model read is a file with a version history. The citation points to a canonical source in raw/. The retrieval step is reproducible because it is legible: the model read the index and opened a named file.

Why It Matters

Most organisations are not making this architecture decision. They are receiving it. Every enterprise platform now ships a "chat with your documents" feature, and every one of those features is RAG, because RAG is the only pattern that generalises across customers without per-customer knowledge engineering. When your teams enable the feature, you have adopted a probabilistic retrieval architecture for whatever corpus they point it at — HR policies, contract repositories, regulatory guidance — without anyone writing down that a decision was made, what the alternatives were, or what failure rate was accepted.

For most corpora, that is fine. Nobody audits the canteen FAQ. The decision matters in the band of use cases where answers carry obligations: compliance guidance, contractual commitments, regulated advice, anything that ends up quoted in a filing or relied on by a customer. In that band, the relevant comparison is not "RAG versus nothing." It is "RAG versus a knowledge structure whose every answer is a readable file with a provenance trail."

What Enterprises Usually Miss

The first miss is that corpus dynamics matter more than corpus size. RAG's genuine advantage is handling large, heterogeneous, fast-changing corpora where nobody will ever curate per-concept articles — support tickets, call transcripts, the whole document lake. The three-folder pattern's advantage is the opposite profile: corpora that are bounded, slow-moving, and canonically structured — regulation, internal policy, standard contracts, product documentation. The EU AI Act changes a few times a year and has numbered articles with stable names. Embedding it and searching by similarity discards exactly the structure that makes it tractable. Most regulated-knowledge corpora look like the AI Act, not like the document lake — which means the default architecture is wrong for precisely the use cases with audit exposure.

The second miss is that RAG failures are silent by design. A compiled wiki article that is wrong is wrong visibly, in a file someone can read, diff, and correct once for every future query. A RAG hallucination is wrong invisibly, per-query, wrapped in fluent prose and a plausible citation. Stanford's reviewers needed legal experts hand-scoring 202 queries to surface the failure rates; your organisation will not do that for its internal tools. The practical consequence: by the time a RAG error surfaces, it has usually been repeated across months of queries, and the remediation question is not "fix the bug" but "which past answers do we now have to doubt?" — a question auditors are professionally trained to ask and engineering teams are structurally unable to answer.

The third miss is governance asymmetry. Gartner's four autonomy tiers — Observe, Advise, Act with Approval, Autonomous — implicitly assume you can test what the system will say. A deterministic knowledge layer makes that testing meaningful: the wiki is enumerable, so you can review what the system knows. A RAG layer makes it sampling: you can only test what you thought to ask. When the December 2027 obligations arrive — risk management under Article 9, logging under Article 12, human oversight under Article 14 — the deterministic layer produces its own compliance evidence as a by-product. The probabilistic layer requires you to build an evaluation harness beside it just to know what it does.

The Governance / Infrastructure Implication

The decision tree below belongs in your AI governance framework as a mandatory artefact: every AI use case touching a knowledge corpus gets a recorded retrieval-architecture decision, the same way every system gets a data classification. This is a one-page decision record, not a committee. It names the corpus, its change rate, its audit exposure, the chosen architecture, and the accepted failure mode. When a market surveillance authority or internal auditor asks — and after August 2, for transparency-scoped systems, they can — the record is the difference between "we chose this deliberately" and "the platform did it."

And for the system that has already failed — the compliance chatbot that quoted a policy clause that does not exist, discovered three months later in an internal audit — resist both reflexes. Do not rip it out; do not patch the prompt and move on. Freeze its scope: restrict it to the corpus segments where errors are tolerable. Then carve out the regulated 20% of the corpus — the part auditors actually look at — and compile it into the three-folder structure, where every answer resolves to a named, versioned file. Route queries by topic: deterministic layer for obligations, RAG for exploration. The April Build Lab edition is the implementation manual for that carve-out; a weekend of compilation work covers the anchor obligations. The migration is not a rewrite. It is a re-routing of the queries that can hurt you.

What Leaders Should Do Next

Run the decision tree against your current inventory — it takes an afternoon, and the output is a list of systems whose retrieval architecture does not match their audit exposure. Expect to find at least one "chat with your documents" deployment sitting on a regulated corpus. That system is your pilot for the carve-out pattern, and the decision records you produce become Article 12-adjacent evidence eighteen months before you are required to have it.

Artifact: The RAG-or-Records Decision Tree

For each AI use case touching a knowledge corpus, answer in order:

  1. Do answers from this system carry obligations? (Quoted in filings, relied on by customers, used for compliance decisions, subject to audit.) If no → RAG is fine. Stop here. If yes → continue.

  2. Is the corpus bounded and slow-moving? (Changes monthly or slower, fits in a knowable structure: regulation, policy, contracts, product docs.) If yes → compile it. Three-folder pattern: raw/ sources, wiki/ per-concept articles, index.md map. If no → continue.

  3. Can you split it? Identify the regulated subset (usually under 20% of the corpus, responsible for over 80% of audit exposure). Compile the subset; RAG the remainder; route queries by topic. If you cannot split → continue.

  4. Can you afford per-answer verification? (Human review before answers reach a decision, or a citation-checking layer that validates every retrieved claim against its source.) If yes → RAG with mandatory verification gate. If no → the use case is not ready for AI retrieval. Defer it — that is a governance outcome, not a failure.

  5. In every case: write the one-page decision record. Corpus, change rate, audit exposure, architecture chosen, failure mode accepted, owner named.

Forward this section to whoever owns your AI governance framework. It is designed to be stolen.

Enterprise Playbook

  1. For the AI Governance Lead: Add the retrieval-architecture decision record to your AI use-case intake form this month. One page per use case: corpus, change rate, audit exposure, architecture, accepted failure mode, owner. Backfill it for the three highest-exposure systems already in production.

  2. For the CTO: Inventory every "chat with your documents" feature currently enabled across your platform estate (Microsoft 365, Salesforce, ServiceNow, internal tools). For each, identify which corpus it points at and whether that corpus contains regulated content. The inventory is a two-day task for a platform team and it will surface at least one surprise.

  3. For the CISO / Head of Audit: Take the Stanford failure mode — real citation, unsupported claim — and add it to your AI audit test plan. Sample 25 answers from your highest-exposure internal AI tool and have a domain expert verify that each cited source actually supports the claim. The hit rate is your real hallucination number; budget remediation against it.

  4. For the CIO: Map your cloud estate against CADA's four sovereignty assurance levels now, while the framework is still a proposal. You are not complying early — you are learning the vocabulary your public-sector clients and regulated customers will use in RFPs within 18 months, and discovering which workloads could not attest above Level 1.

  5. For the DPO / Compliance Lead: Build the two-date transparency calendar and circulate it this week: August 2, 2026 for user disclosure, deep-fake labelling, and interaction transparency; December 2, 2026 for machine-readable watermarking. Name an owner for each obligation. The split deadline is where compliance calendars will quietly break.

What to Watch Next

  • June/July: formal adoption vote on the Digital Omnibus by Parliament and Council. Publication in the Official Journal follows; entry into force three days later. The December 2027 / August 2028 high-risk dates become binding law.

  • June 2026: adequacy assessment of the final Code of Practice. The Code was published June 10; the next question is whether the Commission and AI Board assess it positively, allowing signatories to rely on it as a recognised compliance route.

  • Before August 2: final Article 50 transparency guidelines. The AI Office's definitive reading of "inform users" and "machine-readable marks." Seven weeks out.

  • H2 2026: CADA committee assignments and first Parliament draft report. Early signal on whether the Level 3/4 sovereignty criteria survive hyperscaler lobbying.

  • August 2, 2026: Article 50 obligations apply. The first hard AI Act deadline for most deployers. It does not move.

Next Steps

What to read now?

Regulation

Research

Enterprise AI

Market Signals

That’s it for this week.

In April we said the quiet part: for regulated knowledge, three folders and an index beat a vector database. This week Brussels said its version of the same thing about clouds — sovereignty means being able to read, name, and control the thing your operations depend on. Whether the corpus is a regulation or a continent's compute, the principle holds: if you cannot reproduce why a system did what it did, you do not govern it. You host it.

Until next Thursday, João

OnAbout.AI delivers strategic AI analysis to enterprise technology leaders. European governance lens. Vendor-agnostic. Actionable.

If this landed in your inbox from a forward — subscribe here to get the full picture every week.

Keep Reading