Evaluation#
We train a convnet over 3 acoustic feature representations: the CQT, time scattering coefficients and joint time-frequency scattering, and evaluate the classification accuracy and confusions between playing techniques.
CQT#
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓ ┃ Test metric ┃ DataLoader 0 ┃ ┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩ │ test/acc │ 0.723718523979187 │ │ test/loss │ 0.6989585161209106 │ └───────────────────────────┴───────────────────────────┘
data:image/s3,"s3://crabby-images/8593d/8593d03a47622c333f806c41a2367f927cffc15c" alt="Confusion Matrix on test set for CQT model"
Scat1D#
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓ ┃ Test metric ┃ DataLoader 0 ┃ ┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩ │ test/acc │ 0.9504778385162354 │ │ test/loss │ 0.14028948545455933 │ └───────────────────────────┴───────────────────────────┘
data:image/s3,"s3://crabby-images/868a3/868a3cd7a1a86e546364ffd6cb88aa7f9902f5a7" alt="Confusion Matrix on test set for Scattering1D model"
JTFS#
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓ ┃ Test metric ┃ DataLoader 0 ┃ ┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩ │ test/acc │ 0.9704604744911194 │ │ test/loss │ 0.07301218807697296 │ └───────────────────────────┴───────────────────────────┘
data:image/s3,"s3://crabby-images/e42d2/e42d2ce484d98999a761dcfb30ae28262b42fb77" alt="Confusion Matrix on test set for JTFS model"