Allison Horst
SVD
\[\begin{array}{rcl} W_{1} & = & W_{\text{pre}} + \Delta W_{0} \\ W_{2} & = & W_{1} + \Delta W_{1} \\ W_{3} & = & W_{2} + \Delta W_{2} \\ ... & ~ & ... \\ W_{\text{out}} \end{array}\]
\[W_{\text{out}} = W_{\text{pre}} + \displaystyle\sum_{i = 0} \Delta W_{i}\]
fine tuning scheme
matrix factorization
smaller rank \(r\)
larger rank \(r\)
Weight-Decomposed Low-Rank Adaptation
vectors!
\[W' = m \cdot \frac{W_{0} + BA}{||W_{0} + BA||_{c}}\]
DoRA decomposition
DoRA workflow