Publications -> Conference Papers

Local partial least square regression for spectral mapping in voice conversion


Authors: X. Tian, Z. Wu and E. S. Chng
Title: Local partial least square regression for spectral mapping in voice conversion
Abstract: Joint density Gaussian mixture model (JD-GMM) based method has been widely used in voice conversion task due to its flexible implementation. However, the statistical averaging effect during estimating the model parameters will result in over-smoothing the target spectral trajectories. Motivated by the local linear transformation method, which uses neighboring data rather than all the training data to estimate the transformation function for each feature vector, we proposed a local partial least square method to avoid the over-smoothing problem of JD-GMM and the over-fitting problem of local linear transformation when training data are limited. We conducted experiments using the VOICES database and measure both spectral distortion and correlation coefficient of the spectral parameter trajectory. The experimental results show that our proposed method obtain better performance as compared to baseline methods.
Keywords: Gaussian processes; Least squares approximations; Regression analysis; Speech processing
Conference Name: 5th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA'13)
Location: Kaohsiung, Taiwan
Publisher: IEEE
Year: 2013
Accepted PDF File: Local_partial_least_square_regression_for_spectral_mapping_in_voice_conversion_accepted.pdf
Permanent Link: http://dx.doi.org/10.1109/APSIPA.2013.6694332
Reference: X. Tian, Z. Wu, and E. S. Chng, “Local partial least square regression for spectral mapping in voice conversion,” in Proceedings of the 5th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA’13). IEEE, October–November 2013, pp. 1–6.
bibtex: 
@inproceedings{LILY-c12, 
   author	= {Tian, Xiaohai and Wu, Zhizheng and Chng, Eng Siong},
   title	= {Local partial least square regression for spectral mapping in voice conversion},  
   booktitle	= {Proceedings of the 5th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA'13)}, 
   year		= {2013}, 
   month	= {October--November}, 
   pages	= {1-6}, 
   location	= {Kaohsiung, Taiwan},
   publisher	= {IEEE},
}