Is this paper wrong? - https://arxiv.org/abs/2311.09807

simonw · on Dec 16, 2024

It shows that if you deliberately train LLMs against their own output in a loop you get problems. That's not what synthetic data training does.

belter · on Dec 16, 2024

I understand and appreciate your clarification. However would it not be the case some synthetic data strategies, if misapplied, can resemble the feedback loop scenario and thus risk model collapse?