Rohan Anil
Rohan Anil
Senior Staff Software Engineer, Google Brain
Verified email at google.com
Title
Cited by
Cited by
Year
Wide & deep learning for recommender systems
HT Cheng, L Koc, J Harmsen, T Shaked, T Chandra, H Aradhye, ...
Proceedings of the 1st workshop on deep learning for recommender systems, 7-10, 2016
17672016
Large scale distributed neural network training through online distillation
R Anil, G Pereyra, AT Passos, R Ormandi, G Dahl, G Hinton
Sixth International Conference on Learning Representations, 2018
2212018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1042019
Tf-ranking: Scalable tensorflow library for learning-to-rank
RK Pasumarthi, S Bruch, X Wang, C Li, M Bendersky, M Najork, J Pfeifer, ...
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019
722019
Robust bi-tempered logistic loss based on bregman divergences
E Amid, MK Warmuth, R Anil, T Koren
2019 Conference on Neural Information Processing Systems, 2019
482019
Memory-efficient adaptive optimization for large-scale learning
R Anil, V Gupta, T Koren, Y Singer
2019 Conference on Neural Information Processing Systems, 2019
22*2019
Scalable Second Order Optimization for Deep Learning
R Anil, V Gupta, T Koren, K Regan, Y Singer
NeurIPS’19 Workshop on “Beyond First Order Methods in ML", Spotlight, 2020
17*2020
Disentangling adaptive gradient methods from learning rates
N Agarwal, R Anil, E Hazan, T Koren, C Zhang
arXiv preprint arXiv:2002.11803, 2020
13*2020
Wide and deep machine learning models
T Shaked, R Anil, HB Aradhye, G Anderson, W Chai, ML Koc, J Harmsen, ...
US Patent 10,762,422, 2020
102020
A large batch optimizer reality check: Traditional, generic optimizers suffice across batch sizes
Z Nado, JM Gilmer, CJ Shallue, R Anil, GE Dahl
arXiv preprint arXiv:2102.06356, 2021
52021
Stochastic Optimization with Laggard Data Pipelines
N Agarwal, R Anil, T Koren, K Talwar, C Zhang
2020 Conference on Neural Information Processing Systems, 2020
32020
Knowledge distillation: A good teacher is patient and consistent
L Beyer, X Zhai, A Royer, L Markeeva, R Anil, A Kolesnikov
arXiv preprint arXiv:2106.05237, 2021
22021
Large-Scale Differentially Private BERT
R Anil, B Ghazi, V Gupta, R Kumar, P Manurangsi
Privacy Preserving Machine Learning 2021, 2021
12021
LocoProp: Enhancing BackProp via Local Loss Optimization
E Amid, R Anil, MK Warmuth
arXiv preprint arXiv:2106.06199, 2021
12021
Measuring and Harnessing Transference in Multi-Task Learning
C Fifty, E Amid, Z Zhao, T Yu, R Anil, C Finn
arXiv preprint arXiv:2010.15413, 2020
12020
Step-size Adaptation Using Exponentiated Gradient Updates
E Amid, R Anil, C Fifty, MK Warmuth
ICML’20 Workshop on “Beyond First Order Methods in ML", Spotlight, 2020
1*2020
Efficiently Identifying Task Groupings for Multi-Task Learning
C Fifty, E Amid, Z Zhao, T Yu, R Anil, C Finn
2021 Conference on Neural Information Processing Systems, Spotlight, 2021
2021
Wide and deep machine learning models
T Shaked, R Anil, HB Aradhye, G Anderson, W Chai, ML Koc, JJ Harmsen, ...
US Patent App. 16/991,258, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–18