![]() |
- nullCCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition
- nullOn Local Temporal Embedding for Semi-Supervised Sound Event Detection
- nullTE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
- null基于多尺度距离矩阵的语音关键词检测与细粒度定位方法
- nullLeveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting
- null基于时间分段和重组聚类的说话人日志方法
- nullWeighted contrastive learning using pseudo labels for facial expression recognition
- nullTime Domain Speech Enhancement Using SNR Prediction and Robust Speaker Classification
- nullDCTCN:Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement
- nullAdaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection