Towards Expressive Perception and Generation in Human-Computer Conversational Interaction
The construction of a natural interactive human-computer interaction has become an integral component of intelligent system development, constituting a core subject within the field of human-computer interaction. Grounded in the utilization of human perception theories, this study proposes a robust method for speech emotion recognition and a model for inferring user emotional state changes, thereby achieving a human-computer interaction experience characterized by both listening and articulating, and conveying sentiments effectively. Simultaneously, focusing on the audio-visual modalities within human-computer interaction, this research explores methods for constructing interactive feedback in auditory and visual modalities within real human-computer dialog scenarios. This aims to synchronize textual, verbal, and visual expressions in human-computer interaction, enhancing its naturalness, improving user experience, and augmenting satisfaction and pleasure during interaction. The research contributions are as follows:
Introducing a method that robustly identifies user emotional states in real human-computer speech dialogue scenarios through the utilization of local and global attention mechanisms. This method generates robust and distinguishable representations of speech emotion, thereby enhancing the robustness and accuracy of speech emotion recognition systems in authentic dialogue scenarios.
Sample Chapter(s)
Abstract (87 KB)
Components of the Book:
  • Abstract
  • Abbreviations
  • Chapter 1 Introduction
    • 1.1 Research Background and Significance
    • 1.2 Research Content and Contribution
  • Chapter 2 Related Work and Current Research
    • 2.1 Relevant Theories and Techniques
    • 2.2 Human-Machine Speech Dialogue Systems
    • 2.3 Speech Emotion Recognition
    • 2.4 Speech Synthesis
    • 2.5 Chapter Summary
  • Chapter 3 Speech Emotion Recognition in Human-Machine Dialogues
    • 3.1 Introduction of This Chapter
    • 3.2 Robust Speech Emotion Recognition in Complex Environments
    • 3.3 Experimental Results and Analysis
    • 3.4 Chapter Summary
  • Chapter 4 Predicting User Emotion Changes in Human-Computer Dialogue
    • 4.1 Introduction to This Chapter
    • 4.2 Definitions
    • 4.3 Data Observation and Analysis
    • 4.4 Predicting User Emotion Changes in Human-Computer Voice
    • 4.5 Experiment and Results Analysis
    • 4.6 Chapter Summary
  • Chapter 5 Personalized Expressive Natural Speech Generation
    • 5.1 Introduction to This Chapter
    • 5.2 Intrinsic Connection of Acoustic Parameters
    • 5.3 Natural Speech Synthesis Based on Structured Multi-Task Learning
    • 5.4 Expressive Speech Synthesis Based on Conditional Input Layer
    • 5.5 Personalized Speech Synthesis Based on Grid Modeling Method
    • 5.6 Experiments and Result Analysis
    • 5.7 Chapter Summary
  • Chapter 6 Generation of Visual Feedback in Human-Computer Interaction
    • 6.1 Introduction to This Chapter
    • 6.2 Speech-Driven Visual Feedback Method
    • 6.3 Robust Neural Network-Based Facial Video Rendering
    • 6.4 Chapter Summary
  • Chapter 7 Perception and Feedback of Expressiveness in Human-Computer Dialogue Systems
    • 7.1 Introduction to This Chapter
    • 7.2 Method for Generating System Feedback Based on Emotion Perception and Inference
    • 7.3 Framework of Human-Machine Voice Interaction System Based on “Emotional Intelligence”
    • 7.4 Experimental Results and Analysis
    • 7.5 Chapter Summary
  • Chapter 8 Conclusion and Future Work
    • 8.1 Work Summary
    • 8.2 Future Work Prospects
  • References
  • Acknowledgment
Readership: Students, academics, teachers and other people attending or interested in Human-Computer Conversational Interaction.

Abstract
Yaohua Bu, Runnan Li, Zhenwei You
PDF (87 KB)

Abbreviations
Yaohua Bu, Runnan Li, Zhenwei You
PDF (75 KB)

Chapter 1 Introduction
Yaohua Bu, Runnan Li, Zhenwei You
PDF (508 KB)

Chapter 2 Related Work and Current Research
Yaohua Bu, Runnan Li, Zhenwei You
PDF (45 KB)

Chapter 3 Speech Emotion Recognition in Human-Machine Dialogues
Yaohua Bu, Runnan Li, Zhenwei You
PDF (10221 KB)

Chapter 4 Predicting User Emotion Changes in Human-Computer Dialogue
Yaohua Bu, Runnan Li, Zhenwei You
PDF (21327 KB)

Chapter 5 Personalized Expressive Natural Speech Generation
Yaohua Bu, Runnan Li, Zhenwei You
PDF (60 KB)

Chapter 6 Generation of Visual Feedback in Human-Computer Interaction
Yaohua Bu, Runnan Li, Zhenwei You
PDF (45 KB)

Chapter 7 Perception and Feedback of Expressiveness in Human-Computer Dialogue Systems
Yaohua Bu, Runnan Li, Zhenwei You
PDF (1927 KB)

Chapter 8 Conclusion and Future Work
Yaohua Bu, Runnan Li, Zhenwei You
PDF (133 KB)

References
Yaohua Bu, Runnan Li, Zhenwei You
PDF (199 KB)

Acknowledgment
Yaohua Bu, Runnan Li, Zhenwei You
PDF (60 KB)
Yaohua Bu
Dr. Yaohua Bu, PhD in Design, Master of Science in Computer Science and Technology, is currently a lecturer in the Design Department of the School of Digital Media and Design Arts at Beijing University of Posts and Telecommunications. Her research interests include multimedia human-computer interaction and virtual human interaction design. She has published more than 10 academic papers at top international conferences such as ACM CHI, ACM MM, and AAAI.

Runnan Li
Dr. Runnan Li, Associate Professor in Beijing University of Posts and Telecommunications, member of the CCF Human-Computer Interaction Committee, member of the CCF Multimedia Technology Committee, has conducted extensive research in the field of human-computer interaction and has led the development and implementation of various HCI products in the industry, gaining deep insights into HCI applications.

Zhenwei You
Dr. Zhenwei You, Professor, member of the German Design Council, member of Japan Designers’ Association, member of Design Ergonomics Branch of Chinese Ergonomics Society, member of Beijing Mechanical Engineering Society, is currently the head of the design department in the School of Digital Media and Design Arts of Beijing University of Posts and Telecommunications, and the editor-in-chief of the American Journal Art and Design Review.

Copyright © 2006-2026 Scientific Research Publishing Inc. All Rights Reserved.
Top