Bilinear Transformers (TinyStories) A small collection of Transformers with bilinear MLPs, trained on the TinyStories dataset. tdooms/ts-medium 29.4M • Updated Nov 19, 2024 • 17 tdooms/ts-large 81.8M • Updated Nov 19, 2024 • 6 tdooms/ts-medium-scope Updated Oct 15, 2024
Bilinear Transformers (FineWeb) A small collection of Transformers with bilinear MLPs, trained on the FineWeb-Edu dataset. tdooms/fw-tiny-old 0.1B • Updated Oct 18, 2024 • 23 tdooms/fw-small 0.2B • Updated Nov 21, 2024 • 7 tdooms/fw-medium 0.3B • Updated Nov 19, 2024 • 96 tdooms/fw-medium-scope Updated Nov 20, 2024
Bilinear Transformers (TinyStories) A small collection of Transformers with bilinear MLPs, trained on the TinyStories dataset. tdooms/ts-medium 29.4M • Updated Nov 19, 2024 • 17 tdooms/ts-large 81.8M • Updated Nov 19, 2024 • 6 tdooms/ts-medium-scope Updated Oct 15, 2024
Bilinear Transformers (FineWeb) A small collection of Transformers with bilinear MLPs, trained on the FineWeb-Edu dataset. tdooms/fw-tiny-old 0.1B • Updated Oct 18, 2024 • 23 tdooms/fw-small 0.2B • Updated Nov 21, 2024 • 7 tdooms/fw-medium 0.3B • Updated Nov 19, 2024 • 96 tdooms/fw-medium-scope Updated Nov 20, 2024