vishaljoshi24's picture
Initial Commit
a080fe0

LayerSkip Training Recipe

Implements the training recipe as described in the LayerSkip paper.

Run training

cd scripts
python layer_skip_sft.py

Run benchmark

cd scripts
python benchmark_layer_skip.py