Model Merging
Diagram
Is this really working? Neural Network Can Add?
- Paper: Editing Models with Task Arithmetic
- paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages
learning via addition
- 通常調整相加比例會有更好的結果
- paper: Model Stock: All we need is just a few fine-tuned models
Example
Forgetting via negation (Machine Unlearning)
Task analogies
- Task A : Task B = Task C : Task D
- 在沒Task D 的情況下,可以用 Task A, B, C 來推論 Task D ,model 學會task D
- Paper