Transformer-based online CTC/attention end-to-end speech recognition architecture H Miao, G Cheng, C Gao, P Zhang, Y Yan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 129 | 2020 |
Using neural network front-ends on far field multiple microphones based speech recognition Y Liu, P Zhang, T Hain 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 107 | 2014 |
Integrating the data augmentation scheme with various classifiers for acoustic scene modeling H Chen, Z Liu, Z Liu, P Zhang, Y Yan arXiv preprint arXiv:1907.06639, 2019 | 84 | 2019 |
DPT-FSNet: Dual-path transformer based full-band and sub-band fusion network for speech enhancement F Dang, H Chen, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 82 | 2022 |
The effect of silence and dual-band fusion in anti-spoofing system Y Zhang12, W Wang12, P Zhang12 Proc. Interspeech, 2021 | 67 | 2021 |
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. H Miao, G Cheng, P Zhang, T Li, Y Yan Interspeech, 2623-2627, 2019 | 58 | 2019 |
Online hybrid CTC/attention end-to-end automatic speech recognition architecture H Miao, G Cheng, P Zhang, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1452-1465, 2020 | 49 | 2020 |
Semi-supervised DNN training in meeting recognition P Zhang, Y Liu, T Hain 2014 IEEE Spoken Language Technology Workshop (SLT), 141-146, 2014 | 38 | 2014 |
Self-attention based prosodic boundary prediction for chinese speech synthesis C Lu, P Zhang, Y Yan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 31 | 2019 |
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition. Y Zhang, P Zhang, Y Yan Interspeech, 3857-3861, 2017 | 31 | 2017 |
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling. H Chen, P Zhang, H Bai, Q Yuan, X Bao, Y Yan Interspeech, 3304-3308, 2018 | 29 | 2018 |
Improving ctc-based speech recognition via knowledge transferring from pre-trained language models K Deng, S Cao, Y Zhang, L Ma, G Cheng, J Xu, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 26 | 2022 |
Open source magicdata-ramc: A rich annotated mandarin conversational (ramc) speech dataset Z Yang, Y Chen, L Luo, R Yang, L Ye, G Cheng, J Xu, Y Jin, Q Zhang, ... arXiv preprint arXiv:2203.16844, 2022 | 26 | 2022 |
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Multi-accent adaptation based on gate mechanism H Zhu, L Wang, P Zhang, Y Yan arXiv preprint arXiv:2011.02774, 2020 | 22 | 2020 |
An audio scene classification framework with embedded filters and a DCT-based temporal module H Chen, P Zhang, Y Yan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 20 | 2019 |
A fast fuzzy keyword spotting algorithm based on syllable confusion network J Shao, Q Zhao, P Zhang, Z Liu, Y Yan eps 2 (q1), q3, 2007 | 20 | 2007 |
Pre-training transformer decoder for end-to-end asr model with unpaired text data C Gao, G Cheng, R Yang, H Zhu, P Zhang, Y Yan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 18 | 2021 |
Keyword spotting based on phoneme confusion matrix P Zhang, J Shao, J Han, Z Liu, Y Yan Proc. of ISCSLP 2, 408-419, 2006 | 17 | 2006 |
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech. Z Shang, Z Huang, H Zhang, P Zhang, Y Yan Interspeech, 1619-1623, 2021 | 15 | 2021 |