Microsoft researchers build 1-bit AI LLM with 2B parameters — model small enough to run on some CPUs

ThorrJo@lemmy.sdf.org · 7 days ago

Microsoft researchers build 1-bit AI LLM with 2B parameters — model small enough to run on some CPUs

milicent_bystandr@lemm.ee · 5 days ago

Can someone tell me what’s meant by,

The repository describes bitnet.cpp as offering “a suite of optimized kernels that support fast and lossless inference of 1.58-bit models on CPU

Does it mean you need to run your OS with a specific kernel from bitnet.cpp? Or is it a different kind of ‘kernel’?

mindbleach@sh.itjust.works · edit-2 4 days ago

I think they mean whatever’s handling the model. A program into which you feed this inherently restricted format, so it takes advantage of those limitations, in order to run more efficiently.

Like if every number’s magnitude is 1 or 0, you don’t need to do floating-point multiplication.