Study: Meta AI model can reproduce almost half of Harry Potter book
Niby wszyscy wiedzieli o tym że fejs używa spiraconych ebooków w treningu Llamy ale tutaj zostali złapani z majtami na kostkach.
"Specifically, the paper estimates that Llama 3.1 70B has memorized 42 percent of the first Harry Potter book well enough to reproduce 50-token excerpts at least half the time. (I’ll unpack how this was measured in the next section.)
Interestingly, Llama 1 65B, a similar-sized model released in February 2023, had memorized only 4.4 percent of Harry Potter and the Sorcerer's Stone. This suggests that despite the potential legal liability, Meta did not do much to prevent memorization as it trained Llama 3. At least for this book, the problem got much worse between Llama 1 and Llama 3."