Tom Ko

Cited by

	All	Since 2019
Citations	3622	3204
h-index	17	17
i10-index	31	27

840

420

210

630

20132014201520162017201820192020202120222023202418 23 15 42 94 189 312 441 551 755 839 302

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Tom Ko

ByteDance AI Lab Hong Kong

Verified email at bytedance.com - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Audio augmentation for speech recognition. T Ko, V Peddinti, D Povey, S Khudanpur Interspeech 2015, 3586, 2015	1320	2015
A study on data augmentation of reverberant speech for robust speech recognition T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur 2017 IEEE international conference on acoustics, speech and signal …, 2017	986	2017
Self-attentive speaker embeddings for text-independent speaker verification. Y Zhu, T Ko, D Snyder, B Mak, D Povey Interspeech 2018, 3573-3577, 2018	289	2018
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ... arXiv preprint arXiv:2110.07205, 2021	152	2021
Jhu aspire system: Robust lvcsr with tdnns, ivector adaptation and rnn-lms V Peddinti, G Chen, V Manohar, T Ko, D Povey, S Khudanpur 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	132	2015
An empirical exploration of CTC acoustic models Y Miao, M Gowayyed, X Na, T Ko, F Metze, A Waibel 2016 IEEE international conference on acoustics, speech and signal …, 2016	102	2016
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ... arXiv preprint arXiv:2303.17395, 2023	57	2023
Lighthubert: Lightweight and configurable speech representation learning with once-for-all hidden-unit bert R Wang, Q Bai, J Ao, L Zhou, Z Xiong, Z Wei, Y Zhang, T Ko, H Li arXiv preprint arXiv:2203.15610, 2022	51	2022
An encoder-decoder based audio captioning system with transfer and reinforcement learning X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... arXiv preprint arXiv:2108.02752, 2021	45	2021
M³ST: Mix at Three Levels for Speech Translation X Cheng, Q Dong, F Yue, T Ko, M Wang, Y Zou ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	40	2023
Multi-view self-attention based transformer for speaker recognition R Wang, J Ao, L Zhou, S Liu, Z Wei, T Ko, Q Li, Y Zhang ICASSP 2022-2022 IEEE international conference on acoustics, speech and …, 2022	38	2022
Mixup Learning Strategies for Text-Independent Speaker Verification. Y Zhu, T Ko, B Mak Interspeech, 4345-4349, 2019	31	2019
An investigation of few-shot learning in spoken term classification Y Chen, T Ko, L Shang, X Chen, X Jiang, Q Li arXiv preprint arXiv:1812.10233, 2018	31*	2018
Findings of the IWSLT 2023 evaluation campaign M Agarwal, S Agarwal, A Anastasopoulos, L Bentivogli, O Bojar, C Borg, ... Association for Computational Linguistics, 2023	25	2023
CL4AC: A contrastive loss for audio captioning X Liu, Q Huang, X Mei, T Ko, HL Tang, MD Plumbley, W Wang arXiv preprint arXiv:2107.09990, 2021	24	2021
Token-level supervised contrastive learning for punctuation restoration Q Huang, T Ko, HL Tang, X Liu, B Wu arXiv preprint arXiv:2107.09099, 2021	22	2021
Prototypical networks for small footprint text-independent speaker verification T Ko, Y Chen, Q Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	19	2020
Pre-training transformer decoder for end-to-end asr model with unpaired speech data J Ao, Z Zhang, L Zhou, S Liu, H Li, T Ko, L Dai, J Li, Y Qian, F Wei arXiv preprint arXiv:2203.17113, 2022	17	2022
Leveraging pseudo-labeled data to improve direct speech-to-speech translation Q Dong, F Yue, T Ko, M Wang, Q Bai, Y Zhang arXiv preprint arXiv:2205.08993, 2022	15	2022
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6 X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021	15	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by