stemwats
/

anemll-Qwen3-0.6B-ctx4096-lut8_v0.3.4

Apple Neural Engine

Model card Files Files and versions

anemll-Qwen3-0.6B-ctx4096-lut8_v0.3.4 / meta.yaml

stemwats's picture

Upload folder using huggingface_hub

1eb3861 verified 16 days ago

history blame contribute delete

617 Bytes

	model_info:
	name: anemll-qwen3_0.6b_model_original-ctx4096
	version: 0.3.4
	description: \|
	Demonstarates running qwen3_0.6b_model_original on Apple Neural Engine
	Context length: 4096
	Batch size: 64
	Chunks: 1
	license: MIT
	author: Anemll
	framework: Core ML
	language: Python
	architecture: qwen3
	parameters:
	context_length: 4096
	batch_size: 64
	lut_embeddings: none
	lut_ffn: 8
	lut_lmhead: 8
	num_chunks: 1
	model_prefix: qwen
	embeddings: qwen_embeddings.mlmodelc
	lm_head: qwen_lm_head_lut8.mlmodelc
	ffn: qwen_FFN_PF_lut8.mlmodelc
	split_lm_head: 16