LongMamba https://github.com/jzhang38/LongMamba PY007/LongMamba_16384_bs128_step400 Updated Apr 3, 2024 • 10 • 5 PY007/tokenized_slim6B_train_neox_4096 Viewer • Updated Jan 30, 2024 • 1.37M • 41 PY007/tokenized_slim6B_train_neox_16384 Viewer • Updated Feb 1, 2024 • 341k • 46 • 3
EasyContext https://github.com/jzhang38/EasyContext PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M Viewer • Updated Apr 19, 2024 • 5.04k • 80 • 2 PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K Viewer • Updated Apr 19, 2024 • 3.94k • 93 • 1 PY007/EasyContext-1M-Llama-2-7B Text Generation • 7B • Updated Apr 7, 2024 • 14 • 4 PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K Viewer • Updated Apr 7, 2024 • 37.9k • 95
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M Viewer • Updated Apr 19, 2024 • 5.04k • 80 • 2
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K Viewer • Updated Apr 19, 2024 • 3.94k • 93 • 1
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K Viewer • Updated Apr 7, 2024 • 37.9k • 95
LongMamba https://github.com/jzhang38/LongMamba PY007/LongMamba_16384_bs128_step400 Updated Apr 3, 2024 • 10 • 5 PY007/tokenized_slim6B_train_neox_4096 Viewer • Updated Jan 30, 2024 • 1.37M • 41 PY007/tokenized_slim6B_train_neox_16384 Viewer • Updated Feb 1, 2024 • 341k • 46 • 3
EasyContext https://github.com/jzhang38/EasyContext PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M Viewer • Updated Apr 19, 2024 • 5.04k • 80 • 2 PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K Viewer • Updated Apr 19, 2024 • 3.94k • 93 • 1 PY007/EasyContext-1M-Llama-2-7B Text Generation • 7B • Updated Apr 7, 2024 • 14 • 4 PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K Viewer • Updated Apr 7, 2024 • 37.9k • 95
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_1M Viewer • Updated Apr 19, 2024 • 5.04k • 80 • 2
PY007/slimpajama_llama_tokenized_upsample_4096_chunk_256K Viewer • Updated Apr 19, 2024 • 3.94k • 93 • 1
PY007/slimpajama_mistral_tokenized_upsample_4096_chunk_128K Viewer • Updated Apr 7, 2024 • 37.9k • 95