公共文化服务平台

图像角点亚像素坐标提取研究: 2015年; 在角点检测技术的研究中融入一些改进的CSS算法判别曲率偏大像素点,提出一种新的基于支撑像素点曲线拟合的角点亚像素重定位算法,通过实验证明该算法会在CSS角点判别的基础上更加精确的提取角点坐标值,在精度性能上相对于改进CSS算法提高了17%左右.; 夏菽兰孙明泽张炜宇赵力; 关键词：图像测量亚像素

Intelligibility evaluation of enhanced whisper in joint time-frequency domain被引量：1: 2014年; Some factors influencing the intelligibility of the enhanced whisper in the joint time-frequency domain are evaluated. Specifically, both the spectrum density and different regions of the enhanced spectrum are analyzed. Experimental results show that for a spectrum of some density, the joint time-frequency gain-modification based speech enhancement algorithm achieves significant improvement in intelligibility. Additionally, the spectrum region where the estimated spectrum is smaller than the clean spectrum, is the most important region contributing to intelligibility improvement for the enhanced whisper. The spectrum region where the estimated spectrum is larger than twice the size of the clean spectrum is detrimental to speech intelligibility perception within the whisper context.; 周健魏昕梁瑞宇赵力

基于HMM的中英文语音合成技术研究被引量：2: 2014年; 在SYN6658的中文TTS基础上,结合改进的英文合成技术,经过分析比较中文TTS和英文TTS的特性之后,提出了一种构建简单快速、占用空间小的中英文语音合成系统的方法。构建的系统能够实时快速地合成出中英文语音,系统简单、易实现且合成效果较好,是中英文语音合成产品的较好选择。; 纪正飚王吉林赵力; 关键词：语音合成隐马尔可夫模型文语转换系统

基于局部区域灰度亚像素法电缆护套厚度精确测量被引量：6: 2016年; 图像测量技术作为一种新兴的高性能测量技术,近年来在测量领域中占据越来越重要的地位。它具有非接触,精度高,速度快等优点,如今已广泛被用在几何尺寸测量领域。文中提出了一种基于局部区域灰度的亚像素边缘检测算法,并将其应用到电缆护套材料的厚度图像测量系统中,实现了精确快速的测量。; 沈舷周锋赵力

An adaptive multichannel loudness compensation method: 2016年; To alleviate the conflict between audibility and distortion in the conventional loudness compensation method, an adaptive multichannel loudness compensation method is proposed for hearing aids. The linear and wide dynamic range compression （WDRC） methods are alternately employed according to the dynamic range of the band-passed signal and the hearing range （HR） of the patient. To further reduce the distortion caused by the WDRC and improve the output signal to noise ratio （SNR） under noise conditions, an adaptive adjustment of the compression ratio is presented. Experimental results demonstrate that the output SNR of the proposed method in babble noise is improved by at least 1.73 dB compared to the WDRC compensation method, and the average speech intelligibility is improved by 6.0% and 5. 7%, respectively, compared to the linear and WDRC compensation methods.; 王侠梁瑞宇王青云申红明赵力邹采荣

Speech emotion recognition via discriminant-cascading dimensionality reduction被引量：1: 2016年; In order to accurately identify speech emotion information, the discriminant-cascading effect in dimensionality reduction of speech emotion recognition is investigated. Based on the existing locality preserving projections and graph embedding framework, a novel discriminant-cascading dimensionality reduction method is proposed, which is named discriminant-cascading locality preserving projections （DCLPP）. The proposed method specifically utilizes supervised embedding graphs and it keeps the original space for the inner products of samples to maintain enough information for speech emotion recognition. Then, the kernel DCLPP （KDCLPP） is also proposed to extend the mapping form. Validated by the experiments on the corpus of EMO-DB and eNTERFACE＇05, the proposed method can clearly outperform the existing common dimensionality reduction methods, such as principal component analysis （PCA）, linear discriminant analysis （LDA）, locality preserving projections （LPP）, local discriminant embedding （LDE）, graph-based Fisher analysis （GbFA） and so on, with different categories of classifiers.; 王如刚徐新洲黄程韦吴尘张昕然赵力; 关键词：DISCRIMINANTANALYSIS

A novel speech emotion recognition algorithm based on combination of emotion data field and ant colony search strategy被引量：3: 2016年; In order to effectively conduct emotion recognition from spontaneous, non-prototypical and unsegmented speech so as to create a more natural human-machine interaction; a novel speech emotion recognition algorithm based on the combination of the emotional data field （EDF） and the ant colony search （ACS） strategy, called the EDF-ACS algorithm, is proposed. More specifically, the inter- relationship among the turn-based acoustic feature vectors of different labels are established by using the potential function in the EDF. To perform the spontaneous speech emotion recognition, the artificial colony is used to mimic the turn- based acoustic feature vectors. Then, the canonical ACS strategy is used to investigate the movement direction of each artificial ant in the EDF, which is regarded as the emotional label of the corresponding turn-based acoustic feature vector. The proposed EDF-ACS algorithm is evaluated on the continueous audio）＇visual emotion challenge （AVEC） 2012 dataset, which contains the spontaneous, non-prototypical and unsegmented speech emotion data. The experimental results show that the proposed EDF-ACS algorithm outperforms the existing state-of-the-art algorithm in turn-based speech emotion recognition.; 查诚陶华伟张昕然周琳赵力杨平

Cascaded projection of Gaussian mixture model for emotion recognition in speech and ECG signals被引量：1: 2015年; A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are generated using the marginal distribution model.Each sub-classifier is based on different feature sets.The cascaded structure is adopted to fuse the sub-classifiers dynamically to achieve sample adaptation ability.Secondly,the effectiveness of the proposed algorithm is verified on electrocardiogram emotional signal and speech emotional signal.Emotional data including fidgetiness,happiness and sadness is collected by induction experiments.Finally,the emotion feature extraction method is discussed,including heart rate variability, the chaotic electrocardiogram feature and utterance level static feature.The emotional feature reduction methods are studied, including principle component analysis,sequential forward selection, the Fisher discriminant ratio and maximal information coefficient.The experimental results show that the proposed classification algorithm can effectively improve recognition accuracy in two different scenarios.; 黄程韦吴迪张晓俊肖仲喆许宜申季晶晶陶智赵力

基于混合Gauss归一化的语音转换方法被引量：3: 2013年; 针对非对称语料库情况下的语音转换,提出了一种基于混合Gauss归一化的语音转换方法。通过背景说话人模型,分别自适应训练得到源说话人和目标说话人模型。利用训练得到的模型自适应参数,提出了基于Gauss归一化的特征映射方法,为了进一步提高转换效果,进而提出了混合Gauss归一化的方法。针对说话人模型中未被更新的参数,采用KL散度(Kullback-Leibler divergence)方法进行了优化。最后通过主客观实验对提出的算法的有效性进行了仿真和验证。实验结果表明:该文提出的基于混合Gauss归一化的语音转换方法,在倒谱失真度、转换语音的目标倾向度以及感知质量上,都获得了接近基于对称语音库的传统Gauss混合模型(Gaussian mixture model,GMM)方法的效果。; 宋鹏王浩赵力; 关键词：语音转换

Novel feature fusion method for speech emotion recognition based on multiple kernel learning: 2013年; In order to improve the performance of speech emotion recognition, a novel feature fusion method is proposed. Based on the global features, the local information of different kinds of features is utilized. Both the global and the local features are combined together. Moreover, the multiple kernel learning method is adopted. The global features and each kind of local feature are respectively associated with a kernel, and all these kernels are added together with different weights to obtain a mixed kernel for nonlinear mapping. In the reproducing kernel Hilbert space, different kinds of emotional features can be easily classified. In the experiments, the popular Berlin dataset is used, and the optimal parameters of the global and the local kernels are determined by cross-validation. After computing using multiple kernel learning, the weights of all the kernels are obtained, which shows that the formant and intensity features play a key role in speech emotion recognition. The classification results show that the recognition rate is 78. 74% by using the global kernel, and it is 81.10% by using the proposed method, which demonstrates the effectiveness of the proposed method.; 金赟宋鹏宋鹏郑文明

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(61273266)

文献类型

领域

主题

机构

作者

传媒

年份

用户反馈

国家自然科学基金(61273266)

文献类型

领域

主题

机构

作者

传媒

年份

用户登录

用户反馈