tokenization of text
tokenization of images
images are partitioned into patches
each patch is flattened into a vector
image source: Shusen Wang
positional encoding of patches
audio abstraction
beat
timbre
pitch
harmony
ā¦
source: Valerio Velardo
signal domain
digital signal processing \(\rightarrow\) rule-based systems
traditional ML \(\rightarrow\) feature engineering
deep learning \(\rightarrow\) automatic feature engineering
source: Valerio Velardo