Qwen 32B

I am pleased with the performance and depth of the 32B Qwen MLX, running
locally on my Mac Studio M1 with 64GB of RAM.

9 tokens per second is not fast, but acceptable.
A 15-second wait for the first token output is very good.








As an Amazon Associate I earn from qualifying purchases.

No comments:

Post a Comment

apt quotation..