• simple@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Feels like AI creators can only get away with using pre-2022 data for so long. At some point the information will be outdated and they’ll have to train on newer data, and it’ll be interesting to see if this is a problem that can be solved without harming the dataset’s quality.

    My guess is they’d need to have an AI that tries to find blatantly AI generated data and take it out of the dataset. It won’t be 100% accurate, but it’ll be better than nothing.