• wise_pancake@lemmy.ca
    link
    fedilink
    English
    arrow-up
    2
    ·
    18 days ago

    Additional is fairly trivial for a neural network to learn.

    Weight 1 plus weight 2 equals output is literally the baseline model structure.

    • Zos_Kia@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      5
      ·
      17 days ago

      It’s actually a fairly involved process because the tokens representing 1 and 4 don’t have any mathematical correlation with the numbers 1 and 4 so you can’t math them directly to get to 5.

      Apparently how they do it is by a series of approximations from big numbers to small numbers, not too dissimilar from the way a human would do it. The anthropic team published a paper about it recently, I can dig it up if you’re interested.