Sandwich Batch Normalization

less than 1 minute read

Published:

SaBN: address feature distribution heterogeneity

  • features in a batch is not always the same and not always separate
  • a set of linear layers instead of one… after a shared linear layer
    • so essentially a bn with a categorial bn?