Small casual language models trained for the evaluation of sample efficiency.
Daniel Christoph
J4bb4wukis
AI & ML interests
None yet
Organizations
None yet
models
9
J4bb4wukis/llama_208m_wikipedia_en_shuffeld
0.2B
•
Updated
J4bb4wukis/llama_360m_wikipedia_en_shuffeld
0.4B
•
Updated
•
1
J4bb4wukis/xlstm_406m_wikipedia_en_shuffeld
0.4B
•
Updated
•
1
J4bb4wukis/mamba2_432m_wikipedia_en_shuffeld
0.4B
•
Updated
•
4
J4bb4wukis/gpt2_355m_wikipedia_en_shuffeld
0.4B
•
Updated
J4bb4wukis/gpt2_209m_wikipedia_en_shuffeld
0.2B
•
Updated
•
1
J4bb4wukis/gpt2_124m_wikipedia_en_shuffeld
0.1B
•
Updated
•
1
J4bb4wukis/xlstm_247m_wikipedia_en_shuffeld
0.2B
•
Updated
•
1
J4bb4wukis/mamba2_172m_wikipedia_en_shuffeld
0.2B
•
Updated
•
4
datasets
0
None public yet