qwen30b_8layer

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Passthrough merge method.

Models Merged

The following models were included in the merge:

  • qwen30b

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: qwen30b
      layer_range: [0, 8] # or a range that only includes the first and last transformer block index
    
merge_method: passthrough # Use passthrough since you are taking layers from a single model

Downloads last month
6
Safetensors
Model size
5B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support