Base model:
Qwen/Qwen2.5-0.5B-Instruct
Description
Test repo to experiment with calling generate
from the hub. It is a simplified implementation of greedy decoding.
Additional Arguments
left_padding
(int
, optional): number of padding tokens to add before the provided input
Output Type changes
(none)
- Downloads last month
- 59
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support