zhang pengyuan

Cited by

	All	Since 2019
Citations	1524	1310
h-index	18	16
i10-index	32	29

440

220

110

330

2007200820092010201120122013201420152016201720182019202020212022202320245 7 4 8 8 15 11 13 29 30 38 43 63 119 205 331 430 158

Public access

View all

55 articles

32 articles

available

not available

Based on funding mandates

Co-authors

Jian ShaoZhejiang UniversityVerified email at zju.edu.cn
Thomas HainProfessor of Speech Technology, University of SheffieldVerified email at sheffield.ac.uk
Xiaofei WangMicrosoftVerified email at jhu.edu

zhang pengyuan

Institute of Acoustics, Chinese Academy of Sciences

Verified email at hccl.ioa.ac.cn

speech processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformer-based online CTC/attention end-to-end speech recognition architecture H Miao, G Cheng, C Gao, P Zhang, Y Yan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	129	2020
Using neural network front-ends on far field multiple microphones based speech recognition Y Liu, P Zhang, T Hain 2014 IEEE international conference on acoustics, speech and signal …, 2014	107	2014
Integrating the data augmentation scheme with various classifiers for acoustic scene modeling H Chen, Z Liu, Z Liu, P Zhang, Y Yan arXiv preprint arXiv:1907.06639, 2019	84	2019
DPT-FSNet: Dual-path transformer based full-band and sub-band fusion network for speech enhancement F Dang, H Chen, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	82	2022
The effect of silence and dual-band fusion in anti-spoofing system Y Zhang12, W Wang12, P Zhang12 Proc. Interspeech, 2021	67	2021
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. H Miao, G Cheng, P Zhang, T Li, Y Yan Interspeech, 2623-2627, 2019	58	2019
Online hybrid CTC/attention end-to-end automatic speech recognition architecture H Miao, G Cheng, P Zhang, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1452-1465, 2020	49	2020
Semi-supervised DNN training in meeting recognition P Zhang, Y Liu, T Hain 2014 IEEE Spoken Language Technology Workshop (SLT), 141-146, 2014	38	2014
Self-attention based prosodic boundary prediction for chinese speech synthesis C Lu, P Zhang, Y Yan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	31	2019
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition. Y Zhang, P Zhang, Y Yan Interspeech, 3857-3861, 2017	31	2017
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling. H Chen, P Zhang, H Bai, Q Yuan, X Bao, Y Yan Interspeech, 3304-3308, 2018	29	2018
Improving ctc-based speech recognition via knowledge transferring from pre-trained language models K Deng, S Cao, Y Zhang, L Ma, G Cheng, J Xu, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	26	2022
Open source magicdata-ramc: A rich annotated mandarin conversational (ramc) speech dataset Z Yang, Y Chen, L Luo, R Yang, L Ye, G Cheng, J Xu, Y Jin, Q Zhang, ... arXiv preprint arXiv:2203.16844, 2022	26	2022
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	25	2022
Multi-accent adaptation based on gate mechanism H Zhu, L Wang, P Zhang, Y Yan arXiv preprint arXiv:2011.02774, 2020	22	2020
An audio scene classification framework with embedded filters and a DCT-based temporal module H Chen, P Zhang, Y Yan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	20	2019
A fast fuzzy keyword spotting algorithm based on syllable confusion network J Shao, Q Zhao, P Zhang, Z Liu, Y Yan eps 2 (q1), q3, 2007	20	2007
Pre-training transformer decoder for end-to-end asr model with unpaired text data C Gao, G Cheng, R Yang, H Zhu, P Zhang, Y Yan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	18	2021
Keyword spotting based on phoneme confusion matrix P Zhang, J Shao, J Han, Z Liu, Y Yan Proc. of ISCSLP 2, 408-419, 2006	17	2006
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech. Z Shang, Z Huang, H Zhang, P Zhang, Y Yan Interspeech, 1619-1623, 2021	15	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors