AI Act, Proprietà intellettuale, originalità e responsabilità ai tempi dell’AI

All contents of this article are protected by copyright and reflect original elaboration. Reproduction, even partial, without the author’s permission is prohibited

Editoriale: Eudemonia, Gala&Syn Project

(EN – For the English version of this article, please scroll down)

Una domanda che torna, in forma nuova

Cos’è la creatività, oggi? E cos’è l’originalità, in un ecosistema informativo dove i contenuti vengono prodotti, replicati, sintetizzati e ripubblicati con una velocità che il diritto fatica a inseguire?

La domanda non è retorica. È la cornice entro cui si muovono, contemporaneamente, l’AI Act europeo (Regolamento UE 2024/1689), la prima legge italiana sull’intelligenza artificiale (Legge 23 settembre 2025, n. 132), la Direttiva Copyright (UE) 2019/790 e la giurisprudenza in lenta formazione su entrambe le sponde dell’Atlantico.
Vale la pena fermarsi a pensare, senza scorciatoie e senza prese di posizione di parte. Perché il punto, prima ancora che giuridico, è culturale.

Il quadro normativo, in sintesi

L’AI Act è in vigore dal 1° agosto 2024, con un’applicazione progressiva: alcune disposizioni sono operative dal 2 febbraio 2025, gli obblighi per i modelli di IA generativa ad uso generale dal 2 agosto 2025, e la piena operatività è fissata per il 2 agosto 2026. Il Regolamento non contiene norme specifiche sulla proprietà intellettuale: rinvia alla Direttiva Copyright, e in particolare al suo articolo 4, che disciplina il text and data mining (TDM), ossia l’estrazione massiva di testi e dati per finalità anche di addestramento. La regola, sintetizzata, è duplice: l’accesso alle opere deve essere legittimo, e i titolari dei diritti possono opporsi (cosiddetto opt-out) all’uso a fini diversi dalla ricerca scientifica.

L’Italia ha aggiunto un tassello con la Legge 132/2025, in vigore dal 10 ottobre 2025. L’articolo 25 modifica la storica Legge sul diritto d’autore (L. 633/1941) e introduce il nuovo articolo 70-septies, che ribadisce il principio di “riserva di umanità”: sono protette le opere dell’ingegno umano, anche quando create con l’ausilio di strumenti di IA, purché siano frutto del lavoro intellettuale dell’autore. I contenuti prodotti interamente da un sistema generativo, senza apporto umano qualificato, non accedono alla tutela autoriale. Sul fronte sanzionatorio, viene aggiunta all’art. 171 LDA la lettera a-ter, che punisce con multa chi riproduce o estrae testo o dati da opere disponibili online o in banche dati in violazione delle eccezioni TDM, anche mediante sistemi di intelligenza artificiale.

A questo si affianca il provvedimento del Garante Privacy del maggio 2024 sul web scraping per finalità di addestramento, che indica le possibili cautele tecniche (creazione di aree riservate, clausole nei termini di servizio, file robots.txt, sistemi anti-bot) a disposizione dei titolari dei siti.

A questo punto il sistema, a parole, è chiaro. Nei fatti, lo è meno.

Un caso, di quelli che capitano sempre più spesso

Immaginiamo, perché non è un’ipotesi di scuola, un ricercatore e/o autore indipendente. Spende mesi a leggere, intervistare, verificare, mettere a sistema. Pubblica su un editoriale personale, magari con poche migliaia di lettori affezionati, un’analisi originale: una sintesi che non si trova altrove, un’angolazione che richiede ore se non giorni di studio per essere maturata.
Un anno dopo, su testate note, compare un articolo a firma di un giornalista riconosciuto. Stessa angolazione, stessa struttura, gli stessi tre o quattro snodi argomentativi nello stesso ordine. Nessuna citazione. Nessun rimando.

Sul sito del ricercatore, i log mostrano un picco di accessi automatizzati nei giorni precedenti, riconducibili a strumenti di scraping.
Cosa è successo, esattamente? E soprattutto: chi risponde, e di che cosa?

I piani della responsabilità

Il diritto italiano distingue, non da oggi, tra plagio testuale (la copia letterale), plagio evolutivo (la riformulazione che mantiene la sostanza creativa) e plagio mascherato (lo stesso contenuto travestito da forma diversa). La Cassazione, già nel 2018, ha chiarito che il plagio non si esaurisce nella contraffazione letterale: vale anche quando l’opera derivata, pur formalmente diversa, ricalca la struttura ideativa di quella originale. L’articolo 65 della LDA consente la riproduzione di articoli di attualità con citazione della fonte; l’articolo 101 vieta la riproduzione “parassitaria” delle notizie quando contraria agli usi onesti del giornalismo, senza citazione. Sul versante delle responsabilità interne alla testata, la disciplina è chiara: il direttore responsabile risponde, a titolo di omesso controllo, anche dei contenuti pubblicati a firma altrui. Il vicedirettore può rispondere se delegato formalmente. L’editore risponde sul piano civile.

Lo scraping, il diritto e l’asimmetria

Sul fronte dell’estrazione automatizzata, la cornice europea ha un’impostazione precisa: lo scraping è lecito quando l’accesso ai contenuti è legittimo e non è stato esercitato l’opt-out. Il Garante italiano, in un avvertimento rivolto al Gruppo Editoriale Gedi nel novembre 2024, ha mostrato di non considerare scontato che un editore possa cedere i propri archivi per l’addestramento di sistemi di IA senza una valutazione caso per caso del bilanciamento con le finalità originarie della pubblicazione.

Sul piano internazionale il contenzioso è in piena espansione. Il New York Times ha citato in giudizio OpenAI; nel 2025 Anthropic ha definito una class action sul copyright per cifre molto rilevanti; Reddit ha avviato un’azione contro Perplexity. NewsGuard, in un report del 2023, aveva identificato decine di siti che ripubblicavano articoli di testate autorevoli rielaborati da chatbot, senza citare le fonti, in alcuni casi conservando per errore frammenti di testo che ne tradivano l’origine artificiale. Il fenomeno è stato definito, da chi lo studia, come una possibile forma di “aggregazione efficiente” o, meno benevolmente, di “plagio iper-efficiente”.

La domanda scomoda

E qui arriva la domanda che vorrei lasciare ai lettori, senza pretendere di chiuderla. Quando una testata di rilievo pubblica un articolo che ricalca, anche solo nella struttura argomentativa, il lavoro di un ricercatore indipendente, e quel lavoro è stato in precedenza oggetto di scraping massivo, di chi è la responsabilità? Del giornalista, che ha firmato? Del caporedattore, che avrebbe dovuto verificare? Del direttore, che risponde di omesso controllo? Dell’eventuale strumento di IA usato come supporto, e di chi lo ha addestrato? Dell’editore, che incassa il vantaggio reputazionale e pubblicitario?
E ancora: quanto pesa, nella catena delle responsabilità, il fatto che il ricercatore non avesse “una firma alle spalle”? Che la redazione non avesse il tempo, o l’incentivo economico, per verificare le fonti? Che il sistema premi la velocità di pubblicazione più della tracciabilità dell’origine di un’idea?

Non sono domande retoriche. Sono i punti in cui il diritto, oggi, non offre risposte automatiche.

La Legge 132/2025 dichiara che la creatività resta umana e che la responsabilità resta umana. Ma la traduzione operativa di quel principio, chi prova cosa, davanti a quale giudice, con quali strumenti, in quanto tempo, è ancora largamente da costruire, nei contratti e nelle aule.

Una postilla, sul piano culturale

Il rischio, in questo passaggio, non è soltanto giuridico. È che si consolidi un’asimmetria strutturale: chi produce contenuti originali a piccola scala diventa, di fatto, un giacimento gratuito da cui altri estraggono valore. Chi ha redazione, ufficio legale, ufficio comunicazione, può difendersi e, all’occorrenza, attaccare. Chi non li ha, scopre che l’originalità, risorsa rara e costosa da produrre, non si traduce automaticamente in tutela.
Il diritto d’autore italiano, costruito nel 1941 attorno alla figura dell’autore-persona, viene oggi messo alla prova da un ecosistema in cui la replica costa quasi nulla e l’originalità costa moltissimo.

La Legge 132/2025 rafforza il principio di riserva di umanità. Tocca però a chi quella creatività la pratica, ricercatori, giornalisti, editori, lettori, decidere quanto pretenderla, riconoscerla, citarla. Le norme aiutano. Ma non on bastano da sole.

Riferimenti essenziali

– Regolamento (UE) 2024/1689 (AI Act), entrato in vigore il 1° agosto 2024, piena applicabilità dal 2 agosto 2026.
– Direttiva (UE) 2019/790 (Direttiva Copyright), in particolare l’art. 4 sul text and data mining e l’opt-out dei titolari dei diritti.
– Legge 23 settembre 2025, n. 132 (“Disposizioni e deleghe al Governo in materia di intelligenza artificiale”), in vigore dal 10 ottobre 2025; in particolare art. 25 (modifica all’art. 1 e nuovo art. 70-septies della L. 633/1941; nuova fattispecie all’art. 171 lett. a-ter).
– Legge 22 aprile 1941, n. 633 sul diritto d’autore, in particolare artt. 65, 70 e 101 sul diritto di cronaca e sullo sfruttamento dei contenuti giornalistici.
– Provvedimento del Garante per la protezione dei dati personali del 20 maggio 2024 in materia di web scraping per finalità di addestramento dell’IA generativa, e avvertimento al Gruppo Editoriale Gedi del 27 novembre 2024 (doc. web 10077129).
– Cassazione 2018, sulla nozione di plagio non limitata alla contraffazione letterale.
– Relazione del Parlamento europeo A10-0019/2026 su diritto d’autore e IA generativa.
– Report NewsGuard sui siti che ripubblicano contenuti rielaborati da chatbot senza citazione delle fonti.
– Contenzioso internazionale: NYT v. OpenAI, class action Anthropic (definita nel 2025), Reddit v. Perplexity.

(EN) A question that returns, in a new form

What is creativity today? And what is originality in an information ecosystem where content is produced, replicated, synthesized, and republished at a speed that the law struggles to keep up with?

This is not a rhetorical question. It is the framework within which several developments are simultaneously unfolding: the European AI Act (Regulation EU 2024/1689), Italy’s first law on artificial intelligence (Law of 23 September 2025, no. 132), the Copyright Directive (EU) 2019/790, and the slowly evolving case law on both sides of the Atlantic.
It is worth pausing to reflect, without shortcuts and without taking partisan positions. Because the issue, even before being legal, is cultural.

The regulatory framework, in brief

The AI Act has been in force since 1 August 2024, with a phased application: some provisions have applied since 2 February 2025, obligations for general-purpose generative AI models since 2 August 2025, and full applicability is set for 2 August 2026. The Regulation does not contain specific rules on intellectual property; it refers instead to the Copyright Directive, particularly Article 4, which governs text and data mining (TDM), i.e., the large-scale extraction of text and data, including for training purposes. The rule, in essence, is twofold: access to works must be lawful, and rights holders may object (so-called opt-out) to uses other than scientific research.

Italy added another piece with Law 132/2025, in force since 10 October 2025. Article 25 amends the historic Copyright Law (Law 633/1941) and introduces new Article 70-septies, which reaffirms the principle of the “human reserve”: works of human intellect are protected, even when created with the assistance of AI tools, provided they are the result of the author’s intellectual labor. Content produced entirely by a generative system, without meaningful human contribution, does not qualify for copyright protection. On the sanctions side, a new provision (Article 171, letter a-ter) penalizes, with fines, those who reproduce or extract text or data from works available online or in databases in violation of TDM exceptions, including through AI systems.

This is complemented by the Italian Data Protection Authority’s measure of May 2024 on web scraping for training purposes, which outlines possible technical safeguards (restricted areas, terms of service clauses, robots.txt files, anti-bot systems) available to website owners.

At this point, the system is clear in theory. In practice, less so.

A case: one that is increasingly common

Let us imagine, because this is not a hypothetical, a researcher working independently. They spend months reading, interviewing, verifying, and systematizing information. They publish, on a personal editorial platform with perhaps a few thousand loyal readers, an original analysis: a synthesis not found elsewhere, a perspective that requires hours, if not days, of study to develop.
A year later, a well-known publication features an article signed by a recognized journalist. Same angle, same structure, the same three or four argumentative nodes in the same order. No citation. No reference.

On the researcher’s site, logs show a spike in automated access in the preceding days, traceable to scraping tools.
What exactly happened? And above all: who is responsible, and for what?

Levels of liability

Italian law has long distinguished between textual plagiarism (literal copying), evolutionary plagiarism (reformulation that preserves creative substance), and disguised plagiarism (the same content presented in a different form). As early as 2018, the Court of Cassation clarified that plagiarism is not limited to literal counterfeiting: it also applies when a derived work, though formally different, reproduces the conceptual structure of the original. Article 65 of the Copyright Law allows reproduction of current affairs articles with attribution; Article 101 prohibits “parasitic” reproduction of news contrary to fair journalistic practices, without citation. Regarding internal responsibilities within a newsroom, the framework is clear: the editor-in-chief is liable, for failure of oversight, even for content published under others’ bylines. A deputy editor may be liable if formally delegated. The publisher is liable on the civil level.

Scraping, law, and asymmetry

On automated extraction, the European framework is clear: scraping is lawful when access to content is lawful and no opt-out has been exercised. The Italian Data Protection Authority, in a warning to Gruppo Editoriale Gedi in November 2024, showed that it does not take for granted that a publisher may license its archives for AI training without a case-by-case assessment balancing this with the original purposes of publication.

Internationally, litigation is expanding rapidly. The New York Times has sued OpenAI; in 2025 Anthropic settled a copyright class action for significant sums; Reddit has taken action against Perplexity. In a 2023 report, NewsGuard identified dozens of sites republishing articles from major outlets reworked by chatbots, without citing sources, in some cases inadvertently preserving fragments that revealed their artificial origin. The phenomenon has been described as a form of “efficient aggregation” or, less charitably, “hyper-efficient plagiarism.”

The uncomfortable question

And here is the question to leave with readers, without claiming to resolve it. When a major publication releases an article that mirrors, even just in its argumentative structure, the work of an independent researcher, and that work has previously been subject to large-scale scraping, who is responsible? The journalist who signed it? The section editor who should have verified it? The editor-in-chief, liable for lack of oversight? The AI tool possibly used as support, and those who trained it? The publisher, who benefits reputationally and commercially?
And further: how much weight should be given, in this chain of responsibility, to the fact that the researcher lacked institutional backing? That the newsroom lacked time or economic incentive to verify sources? That the system rewards speed of publication more than traceability of an idea’s origin?

These are not rhetorical questions. They are precisely the points where the law, today, does not provide automatic answers.

Law 132/2025 states that creativity remains human and responsibility remains human. But the operational translation of that principle, who proves what, before which court, with which tools, and in what time frame, is still largely to be built, both in contracts and in courtrooms.

A final note, on the cultural level

The risk in this transition is not only legal. It is the consolidation of a structural asymmetry: those who produce original content at a small scale effectively become a free reservoir from which others extract value. Those with a newsroom, legal department, and communications office can defend themselves and, when needed, take action. Those without discover that originality, a rare and costly resource to produce, does not automatically translate into protection.

Italian copyright law, built in 1941 around the figure of the author as a person, is now being tested by an ecosystem where replication costs almost nothing and originality costs a great deal.

Law 132/2025 reinforces the principle of the human reserve. But it is up to those who practice creativity, researchers, journalists, publishers, readers, to decide how much to demand it, recognize it, and cite it. Rules help. But they are not enough on their own.

Essential references

Regulation (EU) 2024/1689 (AI Act), in force since 1 August 2024, fully applicable from 2 August 2026.
Directive (EU) 2019/790 (Copyright Directive), in particular Article 4 on text and data mining and the opt-out for rights holders.
Law 23 September 2025, no. 132 (“Provisions and delegation to the Government on artificial intelligence”), in force since 10 October 2025; in particular Article 25 (amendment to Article 1 and new Article 70-septies of Law 633/1941; new provision in Article 171 letter a-ter).
Law 22 April 1941, no. 633 on copyright, in particular Articles 65, 70, and 101 on reporting rights and exploitation of journalistic content.
Italian Data Protection Authority measure of 20 May 2024 on web scraping for training generative AI, and warning to Gruppo Editoriale Gedi of 27 November 2024 (doc. web 10077129).
Court of Cassation 2018, on the notion of plagiarism not limited to literal counterfeiting.
European Parliament report A10-0019/2026 on copyright and generative AI.
NewsGuard report on sites republishing chatbot-reworked content without attribution.
International litigation: NYT v. OpenAI, Anthropic class action (settled in 2025), Reddit v. Perplexity.