• milicent_bystandr@lemm.ee
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    Can someone tell me what’s meant by,

    The repository describes bitnet.cpp as offering “a suite of optimized kernels that support fast and lossless inference of 1.58-bit models on CPU

    Does it mean you need to run your OS with a specific kernel from bitnet.cpp? Or is it a different kind of ‘kernel’?

    • mindbleach@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      4 days ago

      I think they mean whatever’s handling the model. A program into which you feed this inherently restricted format, so it takes advantage of those limitations, in order to run more efficiently.

      Like if every number’s magnitude is 1 or 0, you don’t need to do floating-point multiplication.