Sandwich Batch Normalization less than 1 minute read Published: February 26, 2021SaBN: address feature distribution heterogeneityfeatures in a batch is not always the same and not always separatea set of linear layers instead of one… after a shared linear layerso essentially a bn with a categorial bn?Share on Twitter Facebook LinkedIn Previous Next