🔎 View Tweet

exns@euxenus• about 1 month ago
I wrote a post analyzing the router component of H-Net Theirs is basically a weird L2 loss on F via G. I came up with a simplification and generalization. Top of attached image is the loss in the paper, bottom is the loss I propose https://t.co/ZAZFN4MBjn
