This book constitutes the refereed proceedings of the 17th National Conference on Man-Machine Speech Communication, NCMMSC 2022, held in China, in December 2022.
The 21 full papers and 7 short papers included in this book were carefully reviewed and selected from 108 submissions. They were organized in topical sections as follows: MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model, MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.
Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.
Vendeur : Books From California, Simi Valley, CA, Etats-Unis
paperback. Etat : Very Good. N° de réf. du vendeur mon0003596167
Quantité disponible : 1 disponible(s)
Vendeur : Brook Bookstore On Demand, Napoli, NA, Italie
Etat : new. Questo è un articolo print on demand. N° de réf. du vendeur GBTYCH6YJP
Quantité disponible : Plus de 20 disponibles
Vendeur : Ria Christie Collections, Uxbridge, Royaume-Uni
Etat : New. In. N° de réf. du vendeur ria9789819924004_new
Quantité disponible : Plus de 20 disponibles
Vendeur : BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -This book constitutes the refereed proceedings of the 17thNational Conference on Man-Machine Speech Communication, NCMMSC 2022, held inChina, in December 2022.The 21 full papers and 7 short papers included in this book were carefully reviewed andselected from 108 submissions. They were organized in topical sections as follows:MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model,MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset. 344 pp. Englisch. N° de réf. du vendeur 9789819924004
Quantité disponible : 2 disponible(s)
Vendeur : Books Puddle, New York, NY, Etats-Unis
Etat : New. 1st ed. 2023 edition NO-PA16APR2015-KAP. N° de réf. du vendeur 26396348631
Quantité disponible : 4 disponible(s)
Vendeur : moluna, Greven, Allemagne
Kartoniert / Broschiert. Etat : New. N° de réf. du vendeur 848781672
Quantité disponible : Plus de 20 disponibles
Vendeur : Majestic Books, Hounslow, Royaume-Uni
Etat : New. Print on Demand. N° de réf. du vendeur 401109768
Quantité disponible : 4 disponible(s)
Vendeur : Biblios, Frankfurt am main, HESSE, Allemagne
Etat : New. PRINT ON DEMAND. N° de réf. du vendeur 18396348637
Quantité disponible : 4 disponible(s)
Vendeur : Revaluation Books, Exeter, Royaume-Uni
Paperback. Etat : Brand New. 343 pages. 9.25x6.10x0.72 inches. In Stock. N° de réf. du vendeur x-9819924006
Quantité disponible : 2 disponible(s)
Vendeur : buchversandmimpf2000, Emtmannsberg, BAYE, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - Print on Demand Titel. Neuware -MCPN: A Multiple Cross-Perception Network for Real-Time Emotion Recognition in Conversation.- Baby Cry Recognition Based on Acoustic Segment Model.- A Multi-feature Sets Fusion Strategy with Similar Samples Removal for Snore Sound Classification.- Multi-Hypergraph Neural Networks for Emotion Recognition in Multi-Party Conversations.- Using Emoji as an Emotion Modality in Text-Based Depression Detection.- Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis.- Semantic enhancement framework for robust speech recognition.- Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model.- Predictive AutoEncoders are Context-Aware Unsupervised Anomalous Sound Detectors.- A pipelined framework with serialized output training for overlapping speech recognition.- Adversarial Training Based on Meta-Learning in Unseen Domains for Speaker Verification.- Multi-Speaker Multi-Style Speech Synthesis with Timbre and Style Disentanglement.- Multiple Confidence Gates for Joint Training of SE and ASR.- Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Linguistic Information Fusion.- Pre-training Techniques For Improving Text-to-Speech Synthesis By Automatic Speech Recognition Based Data Enhancement.- A Time-Frequency Attention Mechanism with Subsidiary Information for Effective Speech Emotion Recognition.- Interplay between prosody and syntax-semantics: Evidence from the prosodic features of Mandarin tag questions.- Improving Fine-grained Emotion Control and Transfer with Gated Emotion Representations in Speech Synthesis.- Violence Detection through Fusing Visual Information to Auditory Scene.- Mongolian Text-to-Speech Challenge under Low-Resource Scenario for NCMMSC2022.- VC-AUG Voice Conversion based Data Augmentation for Text-Dependent Speaker Veri¿cation.- Transformer-based potential emotional relation mining network for emotion recognition in conversation.- FastFoley Non-Autoregressive Foley Sound Generation Based On Visual Semantics.- Structured Hierarchical Dialogue Policy with Graph Neural Networks.- Deep Reinforcement Learning for On-line Dialogue State Tracking.- Dual Learning for Dialogue State Tracking.- Automatic Stress Annotation and Prediction For Expressive Mandarin TTS.- MnTTS2 An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.Springer-Verlag KG, Sachsenplatz 4-6, 1201 Wien 344 pp. Englisch. N° de réf. du vendeur 9789819924004
Quantité disponible : 1 disponible(s)