当前位置:网站首页>Multimodal emotion recognition_ Research on emotion recognition based on multimodal fusion
Multimodal emotion recognition_ Research on emotion recognition based on multimodal fusion
2022-06-25 01:58:00 【Full stack programmer webmaster】
Hello everyone , I meet you again , I'm your friend, Quan Jun .
Abstract :
Emotion is the important information that people transmit in the process of communication , The change of emotional state affects people's perception and decision-making . Emotion recognition is an important research field of pattern recognition , It introduces the emotional dimension into human-computer interaction . Modes of emotional expression include facial expressions 、 voice 、 posture 、 Physiological signals 、 Words etc. , Emotion recognition is essentially a multi-modal fusion problem . A multi-modal fusion algorithm for emotion recognition is proposed , Extract facial expression and speech features from facial image sequences and speech signals , Based on Hidden Markov model and multi-layer perceptron, an emotion classifier integrating expression and speech modality is designed . Establish an active appearance model of facial expression image , Realize the location and tracking of facial feature points ; According to the displacement of facial feature points , Calculate facial animation parameters as expression features . Time domain analysis of speech signal 、 And frequency domain analysis , Extract the short-time average energy of each frame 、 Pitch frequency and formant are used as speech features . Using the extracted facial expression and speech features , use Viterbi The algorithm trains hidden Markov models of various expressions and speech emotions ; Using the conditional probabilities of eigenvectors with respect to each hidden Markov model , Back propagation learning algorithm is used to train multilayer perceptron . Experimental results show that , The application of emotion recognition algorithm combining expression and speech in recognition samples 、 sad 、 anger 、 It has a high accuracy in emotional states such as disgust . The proposed multimodal recognition algorithm makes good use of the emotional information in video and audio , Compared with the recognition result using only speech mode, it has a great improvement , Compared with the recognition result of expression mode, it also has some improvement , It is an emotion recognition algorithm that can be used .
Publisher : Full stack programmer stack length , Reprint please indicate the source :https://javaforall.cn/151800.html Link to the original text :https://javaforall.cn
边栏推荐
- ‘distutils‘ has no attribute ‘version
- Search two-dimensional matrix [clever use of bisection + record solution different from inserting bisection]
- 获取图片外链的方法–网易相册[通俗易懂]
- jwt
- Android物联网应用程序开发(智慧园区)—— 设置传感器阈值对话框界面
- Specific list of regular and safe domestic stock trading account opening
- 通达信哪个开户更安全,更好点
- Redis 那些事
- IPC机制
- Cake review fatigue in the secondary market of innovative drugs: phase III clinical success and product approval
猜你喜欢

Day 04 - file IO

海河实验室创新联合体成立 GBASE成为首批创新联合体(信创)成员单位

How to prepare for the last day of tomorrow's exam? Complete compilation of the introduction to the second building test site

(CVPR 2020) Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

谷歌浏览器控制台 f12怎么设置成中文/英文 切换方法,一定要看到最后!!!

Numerical scheme simulation of forward stochastic differential equations with Markov Switching

非凸联合创始人李佐凡:将量化作为自己的终身事业

Redis 那些事

论文翻译 | RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

多模态数据也能进行MAE?伯克利&谷歌提出M3AE,在图像和文本数据上进行MAE!最优掩蔽率可达75%,显著高于BERT的15%
随机推荐
poj3669 Meteor Shower(bfs预处理)
Redistemplate operates redis. This article is enough (I) [easy to understand]
Combined with practice, you will understand redis persistence
‘distutils‘ has no attribute ‘version
Listen to the markdown file and hot update next JS page
中金证券靠谱吗?开证券账户安全吗?
带马尔科夫切换的正向随机微分方程数值格式模拟
多模态数据也能进行MAE?伯克利&谷歌提出M3AE,在图像和文本数据上进行MAE!最优掩蔽率可达75%,显著高于BERT的15%
动手学数据分析 数据建模和模型评估
Abnova 5-methylcytosine polyclonal antibody
"One good programmer is worth five ordinary programmers!"
Expectation and variance
1. package your own scaffold 2 Create code module
Ps5 connected to oppo K9 TV does not support 2160p/4k
Pbcms adding cyclic digital labels
TSDB在民机行业中的应用
Redis basic commands and types
通达信哪个开户更安全,更好点
Matlab rounding
創新藥二級市場審餅疲勞:三期臨床成功、產品獲批也不管用了