Voice Conversion using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks

来源 :第七届国际博士生论坛暨青年教师发展研讨会 | 被引量 : 0次 | 上传用户:huishouzhong2
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  This paper investigates the use of Deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks(DBLSTM-RNNs)for voice conversion.Temporal correlations across speech frames are not directly modeled in frame-based methods using conventional Deep Neural Networks(DNNs),which results in a limited quality of the converted speech.To improve the naturalness and continuity of the speech output in voice conversion,we propose a sequence-based conversion method using DBLSTM-RNNs to model not only the frame-wised relationship between the source and the target voice,but also the long-range context-dependencies in the acoustic trajectory.
其他文献
随着计算机技术的不断发展,多媒体现代教育技术走进中学课堂已经成为了现代教育的趋势.多媒体技术走进课堂,给数学教师带来了挑战的同时也带来了新的机遇.如何抓住这一机会,
甲醇是一种极其重要的化工基础原料,也是一种极具潜力的车用燃料和燃料电池的原料。在工业上,甲醇的生产主要采用气相法,以合成气为原料,在高温、高压条件下,使用Cu/ZnO基催化剂催
辣椒碱是辣椒产生辛辣味的主要物质。高纯度的辣椒碱具有生物活性,在医药、军事、农业等领域有广泛的应用。目前溶剂提取纯化辣椒碱具有设备投资小,处理量大的优点;但是辣椒碱的
  Named Data Networking(NDN)is a novel net-working architecture in which packets are transmitted based on data names instead of IP addresses.The new networkin
会议
  A new iterative robust adaptive beam forming(RAB)is introduced in this paper,which is robust against covariance matrix uncertainty and arbitrary steering ve
会议
目前玉米深加工行业仍然存在着综合利用程度不够,产品收益低,污染大等问题。究其原因是现有玉米加工行业是建立在传统提胚法分离的基础上,关键技术没有根本性突破,难以经济有
  This paper describes our SeemGo system for the task of Aspect Based Sentiment Analysis in SemEval-2014.The subtask of aspect term extraction is cast as a se
会议
本文通过试验研究扦插时间、扦插基质以及插穗规格对金缘连翘扦插生根率和根系质量的影响。结果表明:5月5日的扦插生根率最高,为76.7%;以蛭石和河沙比1:2的混合基质能显著提
  High-Density(HD)Head-related transfer function(HRTF)measurements are extremely time consuming and complicated.This problem can be solved by continuously rot
会议
东亚三角涡虫(Dujesia japonica)是扁形动物门的代表动物,是首次出现两侧对称、三胚层的类群。由于其具有较强的再生能力,可作为一种研究再生的模型生物,在研究动物起源、进化、