1. Vector: Embedding, Latent Representation, Latent Code

2. Binary Classifier 评估 Encoder


3. Feature Disentangle 特征拆解


3.1 声音变声



3.2 IN & AdaIN
IN = Instance Normalization (remove global information)
AdaIN = Adaptive Instance Normalization (only influence global information)

4. Discrete Representation

Binary vector (参数较少,还可以识别没有见到的样本)


参考文献
Machine Learning (2019,Spring)
Voice Conversion
网友评论