AI chatbots tend to choose violence and nuclear strikes in wargames

BlushedPotatoPlayers@sopuli.xyz · 9 months ago

AI chatbots tend to choose violence and nuclear strikes in wargames

MNByChoice@midwest.social · edit-2 9 months ago

https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html

However, this only worked for a model trained on a synthetic dataset of games uniformly sampled from the Othello game tree. They tried the same techniques on a model trained using games played by humans and had poor results. To me, this seemed like a major caveat to the findings of the paper which may limit its real world applicability. We cannot, for example, generate code by uniformly sampling from a code tree.

Author later discusses training on you data versus general datasets.

I am out of my depth, but does not seem to provide strong evidence for the modem not just repeating information that shows up a lot for the given inputs.

Warning: Some posts on this platform may contain adult material intended for mature audiences only. Viewer discretion is advised. By clicking ‘Continue’, you confirm that you are 18 years or older and consent to viewing explicit content.

AI chatbots tend to choose violence and nuclear strikes in wargames

AI chatbots tend to choose violence and nuclear strikes in wargames