Qwen 32B

I am pleased with the performance and depth of the 32B Qwen MLX, running
locally on my Mac Studio M1 with 64GB of RAM.

9 tokens per second is not fast, but acceptable.
A 15-second wait for the first token output is very good.








As an Amazon Associate I earn from qualifying purchases.

No comments:

Post a Comment

Post Scriptum

The views in this article are mine and do not reflect those of my employer.
I am preparing to cancel the subscription to the e-mail newsletter that sends my articles.
Follow me on:
X.com (Twitter)
LinkedIn
Google Scholar

Popular Recent Posts

Most Popular Articles

apt quotation..