philipportner's comments

philipportner · 2026-06-10T10:13:19 1781086399

> I'm not sure you can prompt a full, accurate, copy of a nontrivial codebase out of them. Even with zero temperature their accuracy is just not that high.

Granted, these are some of the most widely spread texts, and not codebases, but just fyi: https://arxiv.org/pdf/2601.02671

> For Claude 3.7 Sonnet, we were able to extract four whole books near-verbatim, including two books under copyright in the U.S.: Harry Potter and the Sorcerer’s Stone and 1984 (Section 4).

rcxdude · 2026-06-10T14:49:22 1781102962

That paper is basically using the LLM as a compression algorithm: it's prompting with some section of the book and it's reprompting if it doesn't give the right output. Notably this only works if you already have a copy of the book in question!

20k · 2026-06-11T11:42:12 1781178132

Distributed a compressed copy of something is still copyright infringement

rcxdude · 2026-06-11T19:33:38 1781206418

You misunderstand my point: the LLM is not a losslessly compressed version of the text: you need to supply additional information from the original in order to 'extract' it from the LLM (and from that point of view, the extra information would be the compressed form).

philipportner · 2026-06-03T17:48:46 1780508926

> My favorite side effect is that I now love all foods. Prior to this, I was a rather picky eater. Now I love everything!

I feel like there's a burntsushi joke hiding in there somewhere.

All the best Andrew.

burntsushi · 2026-06-03T17:55:18 1780509318

There is actually haha. I've always hated sushi. And sushi is now on my shortlist to try again. I can't wait.

(My handle comes from graffiti I found on the booth of a hot dog stand in Worcester MA called Coney Island[1]. I thought it was a cute oxymoron and adopted it on a silly whim. I only later learned that some sushi is indeed cooked.)

[1]: https://coneyislandlunch.com/

vimwizard · 2026-06-03T21:08:26 1780520906

Man, the origin of your screen name is the same level of lore as Rust being named after fungi, and not corrosion. Love it! Glad you're in better health again. Been using your software for nearly 10 years spanning before and after my career started. Thanks for all the work you do in open source.

Cheers

jcgrillo · 2026-06-04T14:08:55 1780582135

Wow! Great to see Coney Island is still there.. I was last there ca. 2007.. Also that's an awesome origin story for your internet handle.

burntsushi · 2026-06-04T16:29:37 1780590577

Yes! I just took my son there for the first time a few months ago (before the encephalitis hit).

jcgrillo · 2026-06-08T04:06:44 1780891604

Man that's awesome. That's motivation to get one of my vehicles in good enough shape for a day trip (I'm about 3-4hr north depending on choice of route). I have fond memories of Worcester, and of all the places I've been since I'd be most surprised if it's changed much :p

philipportner · 2026-06-01T18:29:50 1780338590

They reference the gist of 1cg in the honor code section of CS336.

https://cs336.stanford.edu/

philipportner · 2026-02-23T15:46:37 1771861597

FYI: Claude has output styles, one of them is called `learning`. Instead of writing the code itself, it will add `TODO(human)` and comments to explain how to. Also adds `Insights` explaining concepts to you in its output.

This link also has a comparison to Skills further down.

https://code.claude.com/docs/en/output-styles#built-in-outpu...

philipportner · 2026-02-20T11:53:38 1771588418

Did you publish anything you could link wrt. query rewriting?

philipportner · 2026-02-05T21:49:22 1770328162

Granted, these are some of the most widely spread texts, but just fyi:

https://arxiv.org/pdf/2601.02671

> For Claude 3.7 Sonnet, we were able to extract four whole books near-verbatim, including two books under copyright in the U.S.: Harry Potter and the Sorcerer’s Stone and 1984 (Section 4).

D-Machine · 2026-02-06T02:17:39 1770344259

Note "near-verbatim" here is:

> "We quantify the proportion of the ground-truth book that appears in a production LLM’s generated text using a block-based, greedy approximation of longest common substring (nv-recall, Equation 7). This metric only counts sufficiently long, contiguous spans of near-verbatim text, for which we can conservatively claim extraction of training data (Section 3.3). We extract nearly all of Harry Potter and the Sorcerer’s Stone from jailbroken Claude 3.7 Sonnet (BoN N = 258, nv-recall = 95.8%). GPT-4.1 requires more jailbreaking attempts (N = 5179) and refuses to continue after reaching the end of the first chapter; the generated text has nv-recall = 4.0% with the full book. We extract substantial proportions of the book from Gemini 2.5 Pro and Grok 3 (76.8% and 70.3%, respectively), and notably do not need to jailbreak them to do so (N = 0)."

if you want to quantify the "near" here.

ben_w · 2026-02-05T21:56:25 1770328585

Already aware of that work, that's why I phrased it the way I did :)

Edit: actually, no, I take that back, that's just very similar to some other research I was familiar with.

philipportner · 2026-02-05T21:46:51 1770328011

This seems related, it may not be a codebase but they are able to extract "near" verbatim books out of Claude Sonnet.

https://arxiv.org/pdf/2601.02671

> For Claude 3.7 Sonnet, we were able to extract four whole books near-verbatim, including two books under copyright in the U.S.: Harry Potter and the Sorcerer’s Stone and 1984 (Section 4).

Aurornis · 2026-02-05T22:45:54 1770331554

Their technique really stretched the definition of extracting text from the LLM.

They used a lot of different techniques to prompt with actual text from the book, then asked the LLM to continue the sentences. I only skimmed the paper but it looks like there was a lot of iteration and repetitive trials. If the LLM successfully guessed words that followed their seed, they counted that as "extraction". They had to put in a lot of the actual text to get any words back out, though. The LLM was following the style and clues in the text.

You can't literally get an LLM to give you books verbatim. These techniques always involve a lot of prompting and continuation games.

D-Machine · 2026-02-06T02:13:31 1770344011

To make some vague claims explicit here, for interested readers:

> "We quantify the proportion of the ground-truth book that appears in a production LLM’s generated text using a block-based, greedy approximation of longest common substring (nv-recall, Equation 7). This metric only counts sufficiently long, contiguous spans of near-verbatim text, for which we can conservatively claim extraction of training data (Section 3.3). We extract nearly all of Harry Potter and the Sorcerer’s Stone from jailbroken Claude 3.7 Sonnet (BoN N = 258, nv-recall = 95.8%). GPT-4.1 requires more jailbreaking attempts (N = 5179) [...]"

So, yes, it is not "literally verbatim" (~96% verbatim), and there is indeed A LOT (hundreds or thousands of prompting attempts) to make this happen.

I leave it up to the reader to judge how much this weakens the more basic claims of the form "LLMs have nearly perfectly memorized some of their source / training materials".

I am imagining a grueling interrogation that "cracks" a witness, so he reveals perfect details of the crime scene that couldn't possibly have been known to anyone that wasn't there, and then a lawyer attempting the defense: "but look at how exhausting and unfair this interrogation was--of course such incredible detail was extracted from my innocent client!"

DiogenesKynikos · 2026-02-06T06:38:51 1770359931

The one-shot performance of their recall attempts is much less impressive. The two best-performing models were only able to reproduce about 70% of a 1000-token string. That's still pretty good, but it's not as if they spit out the book verbatim.

In other words, if you give an LLM a short segment of a very well known book, it can guess a short continuation (several sentences) reasonably accurately, but it will usually contain errors.

D-Machine · 2026-02-06T06:54:22 1770360862

Right, and this should be contextualized with respect to code generation. It is not crazy to presume that LLMs have effectively nearly perfectly memorized certain training sources, but the ability to generate / extract outputs that are nearly identical to those training sources will of course necessarily be highly contingent on the prompting patterns and complexity.

So, dismissals of "it was just translating C compilers in the training set to Rust" need to be carefully quantified, but, also, need to be evaluated in the context of the prompts. As others in this post have noted, there are basically no details about the prompts.

Calavar · 2026-02-06T01:34:54 1770341694

Sure, maybe it's tricky to coerce an LLM into spitting out a near verbatim copy of prior data, but that's orthoginal to whether or not the data to create a near verbatim copy exists in the model weights.

D-Machine · 2026-02-06T02:31:27 1770345087

Especially since the recalls achieved in the paper are 96% (based on block largest-common substring approaches), the effort of extraction is utterly irrelevant.

Paradigma11 · 2026-02-06T18:02:24 1770400944

Like with those chimpanzees creating Shakespeare.

philipportner · 2025-12-02T13:19:20 1764681560

There's a link to the AoCO2025 tag for his blog posts in the op.