Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems T Chen, M Li, Y Li, M Lin, N Wang, M Wang, T Xiao, B Xu, C Zhang, ... arXiv preprint arXiv:1512.01274, 2015 | 1760 | 2015 |
Empirical evaluation of rectified activations in convolutional network B Xu, N Wang, T Chen, M Li arXiv preprint arXiv:1505.00853, 2015 | 1714 | 2015 |
Scaling distributed machine learning with the parameter server M Li, DG Andersen, JW Park, AJ Smola, A Ahmed, V Josifovski, J Long, ... 11th {USENIX} Symposium on Operating Systems Design and Implementation …, 2014 | 1237 | 2014 |
Efficient mini-batch training for stochastic optimization M Li, T Zhang, Y Chen, AJ Smola Proceedings of the 20th ACM SIGKDD international conference on Knowledge …, 2014 | 552 | 2014 |
Communication Efficient Distributed Machine Learning with the Parameter Server. M Li, DG Andersen, AJ Smola, K Yu NIPS 2, 1.4-2.2, 2014 | 413 | 2014 |
Bag of tricks for image classification with convolutional neural networks T He, Z Zhang, H Zhang, Z Zhang, J Xie, M Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 361 | 2019 |
Emotion classification based on gamma-band EEG M Li, BL Lu 2009 Annual International Conference of the IEEE Engineering in medicine and …, 2009 | 351 | 2009 |
Resnest: Split-attention networks H Zhang, C Wu, Z Zhang, Y Zhu, Z Zhang, H Lin, Y Sun, T He, J Mueller, ... arXiv preprint arXiv:2004.08955, 2020 | 165 | 2020 |
Parameter Server for Distributed Machine Learning M Li, L Zhou, Z Yang, A Li, F Xia, DG Andersen, A Smola | 153 | 2013 |
Dive into deep learning JM Czum Journal of the American College of Radiology: JACR 17 (5), 637-638, 2020 | 150* | 2020 |
Making large-scale Nyström approximation possible M Li, JTY Kwok, B Lü ICML 2010-Proceedings, 27th International Conference on Machine Learning, 631, 2010 | 129 | 2010 |
Large-scale Nyström kernel matrix approximation using randomized SVD M Li, W Bi, JT Kwok, BL Lu IEEE transactions on neural networks and learning systems 26 (1), 152-164, 2014 | 91 | 2014 |
Iterative row sampling M Li, GL Miller, R Peng 2013 IEEE 54th Annual Symposium on Foundations of Computer Science, 127-136, 2013 | 81 | 2013 |
Bag of freebies for training object detection neural networks Z Zhang, T He, H Zhang, Z Zhang, J Xie, M Li arXiv preprint arXiv:1902.04103, 2019 | 67 | 2019 |
Time and space efficient spectral clustering via column sampling M Li, XC Lian, JT Kwok, BL Lu CVPR 2011, 2297-2304, 2011 | 67 | 2011 |
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing. J Guo, H He, T He, L Lausen, M Li, H Lin, X Shi, C Wang, J Xie, S Zha, ... Journal of Machine Learning Research 21 (23), 1-7, 2020 | 57 | 2020 |
Difacto: Distributed factorization machines M Li, Z Liu, AJ Smola, YX Wang Proceedings of the Ninth ACM International Conference on Web Search and Data …, 2016 | 55 | 2016 |
Distributed delayed proximal gradient methods M Li, DG Andersen, A Smola NIPS Workshop on Optimization for Machine Learning 3, 3, 2013 | 54 | 2013 |
Optimizing {CNN} model inference on cpus Y Liu, Y Wang, R Yu, M Li, V Sharma, Y Wang 2019 {USENIX} Annual Technical Conference ({USENIX}{ATC} 19), 1025-1040, 2019 | 52 | 2019 |
Revise saturated activation functions B Xu, R Huang, M Li arXiv preprint arXiv:1602.05980, 2016 | 52 | 2016 |