AI & ML interests
None defined yet.
llm-stacking/StackingLaw_Factor_1.1B12L
Updated
•
1
llm-stacking/StackingLaw_Factor_1.1B3L
Updated
•
1
llm-stacking/StackingLaw_Factor_1.1B1L
Updated
•
1
llm-stacking/StackingLaw_Timing_1.1B50BT
Updated
•
1
llm-stacking/StackingLaw_Timing_1.1B20BT
Updated
•
1
llm-stacking/StackingLaw_Timing_1.1B5BT
Updated
•
1
llm-stacking/StackingLaw_Timing_1.1B1BT
Updated
•
1
llm-stacking/StackingLaw_Timing_410M50BT
Updated
•
1
llm-stacking/StackingLaw_Timing_410M20BT
Updated
•
1
llm-stacking/StackingLaw_Timing_410M10BT
Updated
•
1
llm-stacking/StackingLaw_Timing_410M5BT
Updated
•
1
llm-stacking/StackingLaw_Timing_410M1BT
Updated
•
1
llm-stacking/Ablation_PartialStack_123x7-456
Updated
•
1
llm-stacking/Ablation_PartialStack_1-234x7-56
Updated
•
1
llm-stacking/Ablation_PartialStack_12-34x10-56
Updated
•
1
llm-stacking/Ablation_PartialStack_1234-56x10
Updated
•
1
llm-stacking/Ablation_PartialStack_123-456x7
Updated
•
1
llm-stacking/Ablation_PartialStack_12-345x7-6
Updated
•
1
llm-stacking/Ablation_PartialStack_12-3456x5-56
Updated
•
1
llm-stacking/StackLLM_410M_750BToken
Text Generation
•
Updated
•
5
•
2
llm-stacking/PartialStackLLM_7B_130BToken
Text Generation
•
Updated
•
6
•
1
llm-stacking/StackLLM_7B_300BToken
Text Generation
•
Updated
•
5
•
1
llm-stacking/LLM_7B_300BToken
Text Generation
•
Updated
•
5
•
1
llm-stacking/StackLLM_3B_300BToken
Text Generation
•
Updated
•
5
•
1
llm-stacking/LLM_410M_750BToken
Text Generation
•
Updated
•
5
•
1
llm-stacking/G_zero_depth
Text Generation
•
Updated
•
3
•
1
llm-stacking/G_zero_width
Text Generation
•
Updated
•
3
•
1
llm-stacking/G_random_width
Text Generation
•
Updated
•
3
•
1
llm-stacking/G_learn_depth
Text Generation
•
Updated
•
5
•
1
llm-stacking/G_learn_width
Text Generation
•
Updated
•
6
•
1