Otter@lemmy.ca to Programming@programming.devEnglish · 26 days agoAnthropic can now track the bizarre inner workings of a large language modelwww.technologyreview.comexternal-linkmessage-square7fedilinkarrow-up11arrow-down10cross-posted to: [email protected][email protected]
arrow-up11arrow-down1external-linkAnthropic can now track the bizarre inner workings of a large language modelwww.technologyreview.comOtter@lemmy.ca to Programming@programming.devEnglish · 26 days agomessage-square7fedilinkcross-posted to: [email protected][email protected]
minus-squarewedge@lemmy.onelinkfedilinkarrow-up0·26 days ago“Why does it keep looking at Furry porn…?”
“Why does it keep looking at Furry porn…?”