

统一内存:与 MLX 和其他框架的显着区别是统一内存模型。 MLX 中的数组位于共享内存中。 MLX 阵列上的操作可以在任何支持的设备类型上执行,而无需传输数据。

MLX Documentation


mkdir ml-explore && cd ml-explore
git clone https://github.com/ml-explore/mlx
git clone https://github.com/ml-explore/mlx-examples

python -m venv env
source env/bin/activate


  • 安装依赖包
cd llms/phi2
pip install -r requirements.txt
  • 模型下载和转换


mkdir microsoft
ln -s /Users/junjian/HuggingFace/microsoft/phi-2 microsoft/phi-2


python convert.py

这将生成 MLX 可以读取的 weights.npz 文件。

-rw-r--r--  1 junjian  staff   5.2G 12 20 20:36 weights.npz
  • 运行
python phi2.py
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
[INFO] Generating with Phi-2...
Write a detailed analogy between mathematics and a lighthouse.

Answer: Mathematics is like a lighthouse that guides us through the darkness of uncertainty. Just as a lighthouse emits a steady beam of light, mathematics provides us with a clear path to navigate through complex problems. It illuminates our understanding and helps us make sense of the world around us.

Exercise 2:
Compare and contrast the role of logic in mathematics and the role of a compass in navigation.

Answer: Logic in mathematics is like a compass in navigation. It helps


# python phi2.py --prompt <your prompt here> --max_tokens <max_tokens_to_generate>
python phi2.py --prompt "Why is the sky blue?"
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
[INFO] Generating with Phi-2...
Why is the sky blue?

Answer: The sky appears blue because of the way that light interacts with the Earth's atmosphere. When sunlight enters the atmosphere, it is scattered in all directions by the air molecules. Blue light is scattered more than other colors because it travels in shorter, smaller waves. This is why we see the sky as blue.

Exercise 2:
What is the difference between a hypothesis and a theory?

Answer: A hypothesis is an educated guess or prediction about how something works.
  • 帮助
python phi2.py --help
usage: phi2.py [-h] [--prompt PROMPT] [--max_tokens MAX_TOKENS] [--temp TEMP] [--seed SEED]

Phi-2 inference script

  -h, --help            show this help message and exit
  --prompt PROMPT       The message to be processed by the model
  --max_tokens MAX_TOKENS, -m MAX_TOKENS
                        Maximum number of tokens to generate
  --temp TEMP           The sampling temperature.
  --seed SEED           The PRNG seed


模型 Qwen-1.8B

  • 模型转换
cd llms/qwen
mkdir Qwen
ln -s /Users/junjian/HuggingFace/Qwen/Qwen-1_8B Qwen/Qwen-1_8B
python convert.py
  • 运行
python qwen.py
埃塞俄比亚的首都是亚的斯亚贝巴(Addis Ababa)
中国香港的首都是香港(Hong Kong)
python qwen.py --prompt "天空为什么是蓝色的?" --max_tokens 2048          
天空为什么是蓝色的? 天空为什么是蓝色的?这是一个常见的问题,但其实天空并不是蓝色的,而是由多种颜色组成的。以下是详细的解释:
1. 大气层:天空之所以呈现蓝色,是因为大气层中的气体分子会散射太阳光中的蓝色光。当太阳光穿过大气层时,蓝色光波长的光波更容易被散射,因此天空呈现出蓝色。而其他颜色的光波则更容易被散射,因此天空呈现出其他颜色。
2. 大气折射:天空之所以呈现出蓝色,还与大气折射有关。当太阳光穿过大气层时,它会受到大气层的折射。蓝色光波长的光波更容易被折射,因此天空呈现出蓝色。而其他颜色的光波则更容易被折射,因此天空呈现出其他颜色。
3. 大气散射:天空之所以呈现出蓝色,还与大气散射有关。当太阳光穿过大气层时,它会受到大气层的散射。蓝色光波长的光波更容易被散射,因此天空呈现出蓝色。而其他颜色的光波则更容易被散射,因此天空呈现出其他颜色。
4. 大气折射:天空之所以呈现出蓝色,还与大气折射有关。当太阳光穿过大气层时,它会受到大气层的折射。蓝色光波长的光波更容易被折射,因此天空呈现出蓝色。而其他颜色的光波则更容易被折射,因此天空呈现出其他颜色。

指定其他通义千问模型,注意字母的大小写 QWen/QWen

python qwen.py --tokenizer  QWen/QWen-7B
python qwen.py --tokenizer  QWen/QWen-7B-Chat

模型 Qwen-14B-Chat

  • 下载模型
huggingface-cli download Qwen/Qwen-14B-Chat
# 下面的命令可以使用缓存的模型进行转换
huggingface-cli download Qwen/Qwen-14B-Chat --local-dir Qwen/Qwen-14B-Chat --local-dir-use-symlinks False
  • 模型转换
ln -s /Users/junjian/HuggingFace/Qwen/Qwen-14B-Chat Qwen/Qwen-14B-Chat
python convert.py --model Qwen/Qwen-14B-Chat
python qwen.py --tokenizer Qwen/Qwen-14B-Chat --prompt "天空为什么是蓝色的?" --max_tokens 2048 
天空为什么是蓝色的? 天空之所以呈现蓝色,是因为大气中的气体和微粒会散射太阳光中的短波长颜色,如蓝色和紫色。这种散射现象被称为瑞利散射。由于短波长颜色的散射比长波长颜色(如红色和橙色)更强,所以当太阳光穿过大气层时,蓝色和紫色的光线会被散射到各个方向,使得我们看到的天空呈现出蓝色。在日落或日出时,太阳光需要穿过更多的大气层,因此更多的短波长颜色被散射掉,只剩下长波长颜色,所以天空呈现出橙色或红色。

运行下面的推理,使用内存的峰值达到了 46GB。

python qwen.py --tokenizer Qwen/Qwen-14B-Chat --prompt 'Traceback (most recent call last): File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 441, in model, tokenizer = load_model(args.model, args.dtype) File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 393, in load_model return model, Tokenizer(args.model, config) File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 337, in init self._tokenizer = T5Tokenizer.from_pretrained( File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 2028, in from_pretrained return cls._from_pretrained( File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 2260, in _from_pretrained tokenizer = cls(*init_inputs, **init_kwargs) File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/models/t5/tokenization_t5.py", line 200, in init self.sp_model = self.get_spm_processor(kwargs.pop("from_slow", False)) File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/models/t5/tokenization_t5.py", line 224, in get_spm_processor model_pb2 = import_protobuf(f"The new behaviour of {self.class.name} (with self.legacy = False)") File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/convert_slow_tokenizer.py", line 43, in import_protobuf raise ImportError(PROTOBUF_IMPORT_ERROR.format(error_message)) ImportError: The new behaviour of T5Tokenizer (with self.legacy = False) requires the protobuf library but it was not found in your environment. Checkout the instructions on the installation page of its repo: https://github.com/protocolbuffers/protobuf/tree/master/python#installation and follow the ones that match your environment. Please note that you may need to restart your runtime after installation. 这个错误怎么解决?' --max_tokens 8000
Traceback (most recent call last): File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 441, in model, tokenizer = load_model(args.model, args.dtype) File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 393, in load_model return model, Tokenizer(args.model, config) File "/Users/junjian/GitHub/ml-explore/mlx-examples/t5/t5.py", line 337, in init self._tokenizer = T5Tokenizer.from_pretrained( File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 2028, in from_pretrained return cls._from_pretrained( File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 2260, in _from_pretrained tokenizer = cls(*init_inputs, **init_kwargs) File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/models/t5/tokenization_t5.py", line 200, in init self.sp_model = self.get_spm_processor(kwargs.pop("from_slow", False)) File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/models/t5/tokenization_t5.py", line 224, in get_spm_processor model_pb2 = import_protobuf(f"The new behaviour of {self.class.name} (with self.legacy = False)") File "/Users/junjian/GitHub/ml-explore/env/lib/python3.10/site-packages/transformers/convert_slow_tokenizer.py", line 43, in import_protobuf raise ImportError(PROTOBUF_IMPORT_ERROR.format(error_message)) ImportError: The new behaviour of T5Tokenizer (with self.legacy = False) requires the protobuf library but it was not found in your environment. Checkout the instructions on the installation page of its repo: https://github.com/protocolbuffers/protobuf/tree/master/python#installation and follow the ones that match your environment. Please note that you may need to restart your runtime after installation. 这个错误怎么解决??



1. 安装protobuf库。你可以使用pip来安装:

   pip install protobuf

2. 重启你的运行环境。有时候,即使你已经安装了protobuf,也需要重启你的运行环境才能使新的库生效。

如果你在使用Google Colab,你可能需要在终端中运行上述命令,而不是在代码单元格中运行。

Stable Diffusion

模型:Hugging Face Hub by Stability AI at stabilitiai/stable-diffusion-2-1

  • 安装依赖包
pip install -r requirements.txt
  • 运行
python txt2image.py "A photo of an astronaut riding a horse on Mars." --n_images 4 --n_rows 2


  • 安装依赖包
pip install -r requirements.txt
pip install protobuf
  • 下载转换模型
python convert.py --model <model>
Model Name Model Size
t5-small 60 million
t5-base 220 million
t5-large 770 million
t5-3b 3 billion
t5-11b 11 billion
  • 运行
python t5.py --model t5-base --prompt "translate English to German: A tasty apple"
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
[INFO] Generating with T5...
Input:  translate English to German: A tasty apple
Ein leckerer Apfel
Time: 0.37 seconds, tokens/s: 18.79
python t5.py --model google/mt5-small --prompt "translate English to Franch: A tasty apple."
You are using a model of type mt5 to instantiate a model of type t5. This is not supported for all configurations of models and can yield errors.
[INFO] Generating with T5...
Input:  translate English to Franch: A tasty apple.
Time: 0.05 seconds, tokens/s: 39.83


  • 安装依赖包
pip install -r requirements.txt
  • 安装 ffmpeg
# on macOS using Homebrew (https://brew.sh/)
brew install ffmpeg
  • 运行
import whisper

text = whisper.transcribe(speech_file)["text"]
Then the good soul openly sorted the boat and she had buoyed so long in secret and bravely stretched on alone.
