The Cyber Solicitor

The Cyber Solicitor

AI Governance

More Notes on LLMs and privacy leakage

A paper exploring the causes of privacy leakage in language models

Mahdi Assan's avatar
Mahdi Assan
Nov 17, 2023
∙ Paid

TL;DR

Carlini et al 2023, 4

These notes are on a 2023 paper by Carlini et al looking at the factors that cause large language models (LLMs) to memorize their training data verbatim and thus increase the risk of privacy leakage where that data consists of personal data.

I explored the risk of privacy leakage in a previous post on a 2020 paper by Carlini. Th…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2026 Mahdi Assan · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture