Post

LLama-factory Memo

LLama-factory Memo

DEPLOY LLama-Factory

If you use wsl, remember to save it outside the Ubuntu system, This way you can operate more easily

DOWNLOAD THE MODEL

DONT FORGET git lfs

DATASET

Only sharegpt and alpaca style datasets are supported, but chatgpt can be used to generate code to help with conversion. Also, remember to fill in data_info.json

Parameters

After Trainning, the parameters and configs will be stored in config and save

EXPORT

When exporting the model, remember to select Checkpoint path, then Export quantization bit will automatically turn to None

Transform it to GGUF model

https://blog.csdn.net/Andy_shenzl/article/details/141709811

Llama.cpp

https://zhuanlan.zhihu.com/p/695287607

pip list

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
Package                  Version      Editable project location
------------------------ ------------ -------------------------
accelerate               1.5.2
aiofiles                 23.2.1
aiohappyeyeballs         2.6.1
aiohttp                  3.11.16
aiosignal                1.3.2
annotated-types          0.7.0
anyio                    4.9.0
async-timeout            5.0.1
attrs                    25.3.0
audioread                3.0.1
auto_gptq                0.7.1
av                       14.2.0
certifi                  2025.1.31
cffi                     1.17.1
charset-normalizer       3.4.1
click                    8.1.8
contourpy                1.3.1
cycler                   0.12.1
datasets                 3.4.1
decorator                5.2.1
dill                     0.3.8
docstring_parser         0.16
einops                   0.8.1
exceptiongroup           1.2.2
fastapi                  0.115.12
ffmpy                    0.5.0
filelock                 3.13.1
fire                     0.7.0
fonttools                4.57.0
frozenlist               1.5.0
fsspec                   2024.6.1
gekko                    1.3.0
gradio                   5.21.0
gradio_client            1.7.2
groovy                   0.1.2
h11                      0.14.0
httpcore                 1.0.7
httpx                    0.28.1
huggingface-hub          0.30.1
idna                     3.10
jieba                    0.42.1
Jinja2                   3.1.4
joblib                   1.4.2
kiwisolver               1.4.8
lazy_loader              0.4
librosa                  0.11.0
llamafactory             0.9.3.dev0   /mnt/e/LLaMA-Factory
llvmlite                 0.44.0
markdown-it-py           3.0.0
MarkupSafe               2.1.5
matplotlib               3.10.1
mdurl                    0.1.2
mpmath                   1.3.0
msgpack                  1.1.0
multidict                6.3.2
multiprocess             0.70.16
networkx                 3.3
nltk                     3.9.1
numba                    0.61.0
numpy                    1.26.4
nvidia-cublas-cu12       12.1.3.1
nvidia-cuda-cupti-cu12   12.1.105
nvidia-cuda-nvrtc-cu12   12.1.105
nvidia-cuda-runtime-cu12 12.1.105
nvidia-cudnn-cu12        9.1.0.70
nvidia-cufft-cu12        11.0.2.54
nvidia-curand-cu12       10.3.2.106
nvidia-cusolver-cu12     11.4.5.107
nvidia-cusparse-cu12     12.1.0.106
nvidia-nccl-cu12         2.21.5
nvidia-nvjitlink-cu12    12.1.105
nvidia-nvtx-cu12         12.1.105
optimum                  1.24.0
orjson                   3.10.16
packaging                24.2
pandas                   2.2.3
peft                     0.15.0
pillow                   11.0.0
pip                      25.0
platformdirs             4.3.7
pooch                    1.8.2
propcache                0.3.1
protobuf                 6.30.2
psutil                   7.0.0
pyarrow                  19.0.1
pycparser                2.22
pydantic                 2.10.6
pydantic_core            2.27.2
pydub                    0.25.1
Pygments                 2.19.1
pyparsing                3.2.3
python-dateutil          2.9.0.post0
python-multipart         0.0.20
pytz                     2025.2
PyYAML                   6.0.2
regex                    2024.11.6
requests                 2.32.3
rich                     14.0.0
rouge                    1.0.1
rouge-chinese            1.0.3
ruff                     0.11.4
safehttpx                0.1.6
safetensors              0.5.3
scikit-learn             1.6.1
scipy                    1.15.2
semantic-version         2.10.0
sentencepiece            0.2.0
setuptools               75.8.0
shellingham              1.5.4
shtab                    1.7.1
six                      1.17.0
sniffio                  1.3.1
soundfile                0.13.1
soxr                     0.5.0.post1
sse-starlette            2.2.1
starlette                0.46.1
sympy                    1.13.1
termcolor                3.0.1
threadpoolctl            3.6.0
tiktoken                 0.9.0
tokenizers               0.21.0
tomlkit                  0.13.2
torch                    2.5.1+cu121
torchaudio               2.5.1+cu121
torchvision              0.20.1+cu121
tqdm                     4.67.1
transformers             4.50.0
triton                   3.1.0
trl                      0.9.6
typer                    0.15.2
typing_extensions        4.12.2
tyro                     0.8.14
tzdata                   2025.2
urllib3                  2.3.0
uvicorn                  0.34.0
websockets               15.0.1
wheel                    0.45.1
xxhash                   3.5.0
yarl                     1.18.3
This post is licensed under CC BY 4.0 by the author.