|
- nullLeveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting
- nullTE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting
- null基于多尺度距离矩阵的语音关键词检测与细粒度定位方法
- nullReproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection
- nullTime Domain Speech Enhancement Using SNR Prediction and Robust Speaker Classification
- nullLearning Device-invariant and Location-invariant Embedding for Speaker Verification Using Adversarial Multi-task Training
- nullDual-path transformer network: Direct context-aware modeling for end-to-end monaural speech separation
- nullOn synthesis for supervised monaural speech separation in time domain
- nullWeakly Supervised Sentiment-Specific Region Discovery for VSA
- nullA context aware-based deep neural network approach for simultaneous speech denoising and dereverberation

