The MAMBA Model transformer that has a language modeling head on leading (linear layer with weights tied towards the enter
which describes how all the internal states are connected since they characterize the https://k2spiceshop.com/product/liquid-k2-on-paper-online/