| | readme: fix reference to used dataset | stefan-it | Oct 23, 2024 |
| | figure: add re-trained loss curve for training | stefan-it | Oct 27, 2024 |
| | model: add re-trained xLSTM model with grouped corpus for pretraining | stefan-it | Nov 12, 2024 |
| | readme: fix markdown | stefan-it | Oct 27, 2024 |
| | readme: mention potential bug in pretraining (truncated Wikipedia articles are used) | stefan-it | Oct 27, 2024 |
| | xlstm-config: temporarily introduce new hidden_size parameter | stefan-it | Oct 24, 2024 |
| | readme: include some new logo :-) | stefan-it | Oct 27, 2024 |
| | readme: update information about final xLSTM model (one epoch over corpus) | stefan-it | Oct 27, 2024 |
| | model: add newly trained xLSTM model (with grad clipping) | stefan-it | Nov 12, 2024 |
| | readme: cleanup configuration example | stefan-it | Oct 27, 2024 |
| | readme: mention Tristan | stefan-it | Nov 12, 2024 |
| | readme: add more training details | stefan-it | Nov 12, 2024 |
| | readme: add example usage | stefan-it | Oct 27, 2024 |
| | readme: mention uploaded checkpoint | stefan-it | Nov 12, 2024 |
| | model: add generation confgi | stefan-it | Nov 9, 2024 |
| | figure: add training loss overview | stefan-it | Nov 12, 2024 |
| | readme: update | stefan-it | Oct 21, 2024 |
| | readme: finalize training section | stefan-it | Nov 12, 2024 |
| | model: add best model | stefan-it | Oct 27, 2024 |
| | readme: minor tweaks | stefan-it | Oct 31, 2024 |