![]() |
- nullCCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition
- null基于时间分段和重组聚类的说话人日志方法
- nullLeveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting
- nullTE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
- null基于多尺度距离矩阵的语音关键词检测与细粒度定位方法
- nullOn Local Temporal Embedding for Semi-Supervised Sound Event Detection
- nullReproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection
- nullObjective Class-based Micro-expression Recognition under Partial Occlusion via Region-inspired Relation Reasoning Network
- nullTime Domain Speech Enhancement Using SNR Prediction and Robust Speaker Classification
- nullLearning Device-invariant and Location-invariant Embedding for Speaker Verification Using Adversarial Multi-task Training