Base model:

Qwen/Qwen2.5-0.5B-Instruct

Description

Test repo to experiment with calling generate from the hub. It is a simplified implementation of greedy decoding.

Additional Arguments

left_padding (int, optional): number of padding tokens to add before the provided input

Output Type changes

(none)

Downloads last month
59
Safetensors
Model size
494M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support