TransNormerLLM TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints Text Generation • Updated Apr 7, 2024 • 13 • 15 OpenNLPLab/TransNormerLLM2-1B-300B Text Generation • Updated Feb 26, 2024 • 7 • 3 OpenNLPLab/TransNormerLLM2-3B-300B Text Generation • Updated Feb 26, 2024 • 5 • 3 OpenNLPLab/TransNormerLLM2-7B-300B Text Generation • Updated Feb 26, 2024 • 6 • 4
OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints Text Generation • Updated Apr 7, 2024 • 13 • 15
HGRN Hierarchically Gated Recurrent Neural Network for Sequence Modeling OpenNLPLab/HGRN-150M Text Generation • Updated Nov 10, 2023 • 5 • 2 OpenNLPLab/HGRN-355M Text Generation • Updated Nov 10, 2023 • 5 • 2 OpenNLPLab/HGRN-1B Text Generation • Updated Nov 10, 2023 • 8 • 8 Hierarchically Gated Recurrent Neural Network for Sequence Modeling Paper • 2311.04823 • Published Nov 8, 2023 • 2
Hierarchically Gated Recurrent Neural Network for Sequence Modeling Paper • 2311.04823 • Published Nov 8, 2023 • 2
TransNormerLLM TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints Text Generation • Updated Apr 7, 2024 • 13 • 15 OpenNLPLab/TransNormerLLM2-1B-300B Text Generation • Updated Feb 26, 2024 • 7 • 3 OpenNLPLab/TransNormerLLM2-3B-300B Text Generation • Updated Feb 26, 2024 • 5 • 3 OpenNLPLab/TransNormerLLM2-7B-300B Text Generation • Updated Feb 26, 2024 • 6 • 4
OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints Text Generation • Updated Apr 7, 2024 • 13 • 15
HGRN Hierarchically Gated Recurrent Neural Network for Sequence Modeling OpenNLPLab/HGRN-150M Text Generation • Updated Nov 10, 2023 • 5 • 2 OpenNLPLab/HGRN-355M Text Generation • Updated Nov 10, 2023 • 5 • 2 OpenNLPLab/HGRN-1B Text Generation • Updated Nov 10, 2023 • 8 • 8 Hierarchically Gated Recurrent Neural Network for Sequence Modeling Paper • 2311.04823 • Published Nov 8, 2023 • 2
Hierarchically Gated Recurrent Neural Network for Sequence Modeling Paper • 2311.04823 • Published Nov 8, 2023 • 2