Originality and Provenance in the AGI Era
What 'original' means when models are trained on enormous corpora of prior work, and how provenance systems are emerging in response.

Executive summary
Originality has always built on prior work. AGI-class models make the inheritance explicit and contestable. New provenance systems — content credentials, watermarks, training-data attribution — are emerging to support both creators and audiences.
Key concepts
- Originality as combination
- Training-data lineage
- Content credentials
- Watermarking
- Attribution
Originality as combination
Most originality is novel combination, framing, and execution of inherited ideas. AI makes the inheritance visible at a new scale, which is uncomfortable but not novel.
Training-data lineage
Models inherit from training corpora. Active research is improving attribution and detection of memorised content.
Content credentials and watermarks
Standards like C2PA attach verifiable provenance to media. Watermarking is improving but remains imperfect.
What audiences want
Audiences mostly want to know what they are looking at. Good provenance helps; perfect verification is not yet possible.
Key takeaways
- 01Originality is novel combination, not first creation.
- 02Training-data inheritance is now visible and contestable.
- 03Provenance standards are emerging.
- 04Audiences value disclosure even where verification is imperfect.
Frequently asked questions
Is AI-generated work plagiarism?
Sometimes, when it memorises specific source work. Mostly it is novel combination of patterns, which sits in the same category as most creative work.
How can I verify what I see?
Look for content credentials (C2PA), source-verifiable distribution, and provenance metadata. Treat unverified content as such.
Further reading
Related hubs
How AGI is reshaping art, authorship, originality, and creative economies — and why human creative direction becomes more valuable, not less.
How working artists, illustrators, and designers are integrating AI into studios and what changes in the work, the craft, and the output.
What it means to be an author when a model can generate competent prose, code, or music. The legal, ethical, and practical answer.