by gwern 3 days ago

It is definitely not the first codebase an extensively RL-trained Claude has ever analyzed. How do you think it got so good?

2 days ago | [-0 more]
[deleted]
spullara 2 days ago | [-1 more]

Meaning it has no episodic memory of any of those analyses that it has done.

gwern a day ago | [-0 more]

You didn't say anything about 'episodic' and that's irrelevant to the point even if its long-term memory from training didn't count.