Multiple sound sources localization from coarse to fine R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 147 | 2020 |
Audio caption: Listen and tell M Wu, H Dinkel, K Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 74 | 2019 |
Investigating local and global information for automated audio captioning with transfer learning X Xu, H Dinkel, M Wu, Z Xie, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 57 | 2021 |
Towards duration robust weakly supervised sound event detection H Dinkel, M Wu, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 887-900, 2021 | 57 | 2021 |
What does a Car-ssette tape tell? X Xu, H Dinkel, M Wu, K Yu arXiv preprint arXiv:1905.13448v1, 2019 | 54* | 2019 |
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning. X Xu, H Dinkel, M Wu, K Yu DCASE, 225-229, 2020 | 46 | 2020 |
Depa: Self-supervised audio embedding for depression detection P Zhang, M Wu, H Dinkel, K Yu Proceedings of the 29th ACM international conference on multimedia, 135-143, 2021 | 41 | 2021 |
Voice activity detection in the wild: A data-driven approach using teacher-student training H Dinkel, S Wang, X Xu, M Wu, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1542-1555, 2021 | 40 | 2021 |
Building interpretable interaction trees for deep nlp models D Zhang, H Zhang, H Zhou, X Bao, D Huo, R Chen, X Cheng, M Wu, ... Proceedings of the AAAI conference on artificial intelligence 35 (16), 14328 …, 2021 | 38 | 2021 |
Can audio captions be evaluated with image caption metrics? Z Zhou, Z Zhang, X Xu, Z Xie, M Wu, KQ Zhu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 33 | 2022 |
Voice activity detection in the wild via weakly supervised sound event detection H Dinkel, Y Chen, M Wu, K Yu arXiv preprint arXiv:2003.12222, 2020 | 30 | 2020 |
The SJTU system for DCASE2022 challenge task 6: Audio captioning with audio-text retrieval pre-training X Xu, Z Xie, M Wu, K Yu DCASE 2022 Challenge, Tech. Rep., 2022 | 29 | 2022 |
Text-based depression detection on sparse data H Dinkel, M Wu, K Yu arXiv preprint arXiv:1904.05154, 2019 | 25 | 2019 |
Audio-text retrieval in context S Lou, X Xu, M Wu, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Decoupled dialogue modeling and semantic parsing for multi-turn text-to-SQL Z Chen, L Chen, H Li, R Cao, D Ma, M Wu, K Yu arXiv preprint arXiv:2106.02282, 2021 | 21 | 2021 |
LLM-empowered chatbots for psychiatrist and patient simulation: application and evaluation S Chen, M Wu, KQ Zhu, K Lan, Z Zhang, L Cui arXiv preprint arXiv:2305.13614, 2023 | 18 | 2023 |
Text-to-audio grounding: Building correspondence between captions and sound events X Xu, H Dinkel, M Wu, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 18 | 2021 |
Psychiatric scale guided risky post screening for early detection of depression Z Zhang, S Chen, M Wu, KQ Zhu arXiv preprint arXiv:2205.09497, 2022 | 17 | 2022 |
Audio caption in a car setting with a sentence-level loss X Xu, H Dinkel, M Wu, K Yu 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 17 | 2021 |
Kunyao Lan, Zhiling Zhang, and Lyuchun Cui. 2023. LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation S Chen, M Wu, KQ Zhu arXiv preprint arXiv:2305.13614, 2023 | 16 | 2023 |