Speech and Computer

Speech and Computer

26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25-28, 2024, Proceedings, Part II

Karpov, Alexey; Delic, Vlado

Springer International Publishing AG

01/2025

373

Mole

9783031780134

Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.
1 Computational Paralinguistics.- A Cross-Multi-Modal Fusion Approach for Enhanced Engagement Recognition.- Automatic Assessment of Signs of Alcohol Dependency Syndrome from Spontaneous Speech.- An Enhanced Compact Convolution Transformer for Age, Gender and Emotion Detection in Egyptian Arabic Speech.- RAG and Few-Shot Prompting in Emotional Text Generation.- Sentiment Analysis for Egyptian Arabic-English Code-Switched Data using Traditional Neural Models and Advanced Language Models.- Automatic Detection of Irony Based on Acoustic Features and Facial Expressions.- Affective Computing.- Emotion Recognition by Vocalizations of Nonhuman Primates: Human and Automatic Classification.- MMHS: Multimodal Model for Hate Speech Intensity Prediction.- Multimodal Emotion Recognition using Compressed Graph Neural Networks.-Utilizing Speaker Models and Topic Markers for Emotion Recognition in Dialogues.- How Children Recognize Emotions from Video and Audio.- Speaker Recognition.- On the Influence of CNN-based Feature Learning Modules in Neural Speaker Verification Framework.- Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition.- Transformation of Emotional Speech to Anger Speech to Reduce Mismatches in Testing and Enrollment Speech for Speaker Recognition System.- Investigating Data Requirements for Hindi Speaker Recognition: A Comparative Study with English.- Practical Evaluation and Validation of Methods for Automatic Speaker Identification (as Applied to Various Languages).- Digital Speech Processing.- In Pursuit for the Best Error Metric for Optimisation of Articulatory Vowel Synthesis.- Exploring MetaConformer for Speech Enhancement.- Integration of Short-Term and Long-Term Harmonic Peaks in a Two-Level Discriminative Weight Training Framework for Voice Activity Detection.- Separating Party Conversation by Applying Contrastive Learning Methodology.- DuFCALF: Instilling Sentience in Computerized Song Analysis.- Natural Language Processing.-Harnessing Knowledge Distillation for Enhanced Text-to-Text Translation in Low-Resource Languages.- Bias Unveiled: Enhancing Fairness in German Word Embeddings with Large Language Models.- Conformer LLM - Convolution Augmented Large Language Models.- How to Detect Imbalances in the Google Books Ngram Corpus?.- Predicting the Valence Rating of Russian Words Using Various Pre-Trained Word Embeddings.- 3 Ancient Egyptian Hieroglyphic Texts Structure Identification.
Este título pertence ao(s) assunto(s) indicados(s). Para ver outros títulos clique no assunto desejado.
artificial intelligence;natural language interfaces;natural language processing;cognitive science;speech recognition;discourse, dialogue and pragmatics;knowledge representation and reasoning;HCI theory, concepts and models;interactive systems and tools;user interface management systems;user interface programming;user models;collaborative interaction;graphical user interfaces;web-based interaction;mixed / augmented reality;interaction techniques;accessibility theory, concepts and paradigms;accessibility technologies;multimedia databases