The response from SGLang 0.5.10.post1 is garbled.
๐ฅ 1
#9 opened 2 days ago
by
zhazhahui080507
SGLang --speculative-algorithm NEXTN or EAGLE?
๐ 1
#6 opened 10 days ago
by
pathosethoslogos
VLLM warning about Using uncalibrated q_scale 1.0 and/or prob_scale 1.0 with fp8 attention
1
#5 opened 10 days ago
by
androiddrew
NVFP4
โ 8
#4 opened 11 days ago
by
celikburak
Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
5
#3 opened 11 days ago
by
CHNtentes
[Question] Can the model be run on Nvidia DGX Spark?
2
#2 opened 11 days ago
by
hubertshelley