There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This linear layer allows dynamic reallocation of parameters post training (tokenformer). Additionally, it creates a low rank transformation effectively doubling the amount of parameter tokens you can attend to.