Qing Li

Cited by

	All	Since 2019
Citations	1423	1292
h-index	18	16
i10-index	18	17

360

180

270

2016201720182019202020212022202320248 55 60 99 149 220 298 354 172

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Siyuan HuangBeijing Institute for General Artificial Intelligence (BIGAI)Verified email at ucla.edu
song-chun zhuProfessor of Statistics and Computer Science, UCLAVerified email at stat.ucla.edu
Jiebo LuoAlbert Arendt Hopeman Professor of Engineering, University of RochesterVerified email at cs.rochester.edu
Yining HongUniverisy of California, Los AngelesVerified email at cs.ucla.edu
Danna GurariAssistant Professor, University of Colorado Boulder - Director of Image and Video Computing GroupVerified email at colorado.edu
Tao MeiHiDream.ai; Fellow of CAE/IEEE/IAPRVerified email at hidream.ai
Yixin ChenUniversity of California, Los AngelesVerified email at g.ucla.edu
Zhaofan QiuAI Research, JD.COMVerified email at mail.ustc.edu.cn
Ting YaoJD.com and previously Microsoft ResearchVerified email at jd.com
Yong Rui, Fellow of ACM, IEEE, AAAS,...Lenovo Research and previously Microsoft ResearchVerified email at lenovo.com
Kristen GraumanProfessor of Computer Science, University of Texas at AustinVerified email at cs.utexas.edu
Ying Nian WuUCLA Department of StatisticsVerified email at stat.ucla.edu
Shafiq JotyResearch Director at Salesforce Research, Assoc. Prof. at NTU (on leave)Verified email at ntu.edu.sg

Qing Li

UCLA

Verified email at ucla.edu - Homepage

Neural-Symbolic Learning Vision and Language Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vizwiz grand challenge: Answering visual questions from blind people D Gurari, Q Li, AJ Stangl, A Guo, C Lin, K Grauman, J Luo, JP Bigham Proceedings of the IEEE conference on computer vision and pattern …, 2018	530	2018
Action recognition by learning deep multi-granular spatio-temporal video representation Q Li, Z Qiu, T Yao, T Mei, Y Rui, J Luo Proceedings of the 2016 ACM on international conference on multimedia …, 2016	147	2016
Vqa-e: Explaining, elaborating, and enhancing your answers for visual questions Q Li, Q Tao, S Joty, J Cai, J Luo Proceedings of the European Conference on Computer Vision (ECCV), 552-567, 2018	107	2018
Vizwiz-priv: A dataset for recognizing the presence and purpose of private visual information in images taken by blind people D Gurari, Q Li, C Lin, Y Zhao, A Guo, A Stangl, JP Bigham Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	87	2019
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning Q Li, S Huang, Y Hong, Y Chen, YN Wu, SC Zhu ICML, 2020	76	2020
Tell-and-answer: Towards explainable visual question answering using attributes and captions Q Li, J Fu, D Yu, T Mei, J Luo EMNLP, 2018	66	2018
Learning by fixing: Solving math word problems with weak supervision Y Hong, Q Li, D Ciao, S Huang, SC Zhu Proceedings of the AAAI conference on artificial intelligence 35 (6), 4959-4967, 2021	50	2021
Why does a visual question have different answers? N Bhattacharya, Q Li, D Gurari Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	50	2019
Sqa3d: Situated question answering in 3d scenes X Ma, S Yong, Z Zheng, Q Li, Y Liang, SC Zhu, S Huang arXiv preprint arXiv:2210.07474, 2022	44	2022
Vireo@ TRECVID 2017: Video-to-text, ad-hoc video search and video hyperlinking PA Nguyen, Q Li, ZQ Cheng, YJ Lu, H Zhang, X Wu, CW Ngo IEEE, 2017	36	2017
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering Q Li, S Huang, Y Hong, SC Zhu ECCV, 2020	32	2020
Smart: A situation model for algebra story problems via attributed grammar Y Hong, Q Li, R Gong, D Ciao, S Huang, SC Zhu Proceedings of the AAAI conference on artificial intelligence 35 (14), 13009 …, 2021	30	2021
Yourefit: Embodied reference understanding with language and gesture Y Chen, Q Li, D Kong, YL Kei, SC Zhu, T Gao, Y Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	28	2021
Vlgrammar: Grounded grammar induction of vision and language Y Hong, Q Li, SC Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	28	2021
Learning hierarchical video representation for action recognition Q Li, Z Qiu, T Yao, T Mei, Y Rui, J Luo International Journal of Multimedia Information Retrieval 6, 85-98, 2017	24	2017
Msr asia msm at thumos challenge 2015 Z Qiu, Q Li, T Yao, T Mei, Y Rui CVPR workshop 8, 2015	23	2015
3d-vista: Pre-trained transformer for 3d vision and text alignment Z Zhu, X Ma, Y Chen, Z Deng, S Huang, Q Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	20	2023
Towards a unified foundation model: Jointly pre-training transformers on unpaired images and text Q Li, B Gong, Y Cui, D Kondratyuk, X Du, MH Yang, M Brown arXiv preprint arXiv:2112.07074, 2021	20	2021
An embodied generalist agent in 3d world J Huang, S Yong, X Ma, X Linghu, P Li, Y Wang, Q Li, SC Zhu, B Jia, ... arXiv preprint arXiv:2311.12871, 2023	5	2023
A minimalist dataset for systematic generalization of perception, syntax, and semantics Q Li, S Huang, Y Hong, Y Zhu, YN Wu, SC Zhu ICLR, 2023	5	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors