Nikhil Vyas

vyasnikhil96 at gmail.com
Twitter

I am currently a researcher at OpenAI. Before this, I was a postdoc at Harvard. I finished my PhD in 2022 at the TOC group at MIT where I was advised by Prof. Ryan Williams.

My current research focuses on deep learning, most recently on optimization of neural networks.

Publications (ML)

SOAP: Improving and Stabilizing Shampoo using Adam.
Nikhil Vyas*, Depen Morwani*, Rosie Zhao, Itai Shapira, David Brandfonbrener, Lucas Janson, Sham Kakade
ICLR 2025
Further improvements

A New Perspective on Shampoo's Preconditioner.
Depen Morwani*, Itai Shapira*, Nikhil Vyas*, Eran Malach, Sham Kakade, Lucas Janson
ICLR 2025

Deconstructing What Makes a Good Optimizer for Language Models.
Rosie Zhao*, Depen Morwani*, David Brandfonbrener*, Nikhil Vyas*, Sham Kakade
ICLR 2025

Mixture of Parrots: Experts improve memorization more than reasoning.
Samy Jelassi, Clara Mohri, David Brandfonbrener, Alex Gu, Nikhil Vyas, Nikhil Anand, David Alvarez-Melis, Yuanzhi Li, Sham Kakade, Eran Malach
ICLR 2025

How Does Critical Batch Size Scale in Pre-training?
Hanlin Zhang, Depen Morwani, Nikhil Vyas, Jingfeng Wu, Difan Zou, Udaya Ghai, Dean Foster, Sham Kakade
ICLR 2025

Loss-to-Loss Prediction: Language model scaling laws across datasets.
David Brandfonbrener, Nikhil Anand, Nikhil Vyas, Eran Malach, Sham Kakade
TMLR + Workshop on Attributing Model Behavior at Scale, Neurips 2024

Transformers can reinforcement learn to approximate Gittins Index.
Vladimir Petrov, Nikhil Vyas, Lucas Janson
Workshop on Scientific Methods for Understanding Deep Learning, NeurIPS 2024

Connections between Schedule-Free SGD, Accelerated SGD Variants, and Weight Averaging.
Depen Morwani*, Nikhil Vyas*, Hanlin Zhang, Sham Kakade
Optimization for Machine Learning Workshop, Neurips 2024

AdaMeM: Memory Efficient Momentum for Adafactor.
Nikhil Vyas, Depen Morwani, Sham Kakade
Oral in 2nd Workshop on Advancing Neural Network Training(WANT@ICML 2024)

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning.
Nikhil Vyas*, Depen Morwani*, Rosie Zhao*, Gal Kaplun*, Sham Kakade, Boaz Barak
ICML 2024 (Spotlight)

Distinguishing the Knowable from the Unknowable with Language Models.
Gustaf Ahdritz*, Tian Qin*, Nikhil Vyas, Boaz Barak, Benjamin L. Edelman
ICML 2024

On the benefits of learning to route in mixture-of-experts models.
Nishanth Dikkala*, Nikhil Ghosh*, Raghu Meka*, Rina Panigrahy*, Nikhil Vyas*, Xin Wang*
EMNLP 2023

Feature-Learning Networks Are Consistent Across Widths At Realistic Scales.
Nikhil Vyas*, Alexander Atanasov*, Blake Bordelon*, Depen Morwani, Sabarish Sainathan, Cengiz Pehlevan
NeurIPS 2023

On Privileged and Convergent Bases in Neural Network Representations.
Davis Brown*, Nikhil Vyas*, Yamini Bansal
Workshop on High-dimensional Learning Dynamics at ICML 2023

Provable Copyright Protection for Generative Models.
Nikhil Vyas, Sham Kakade, Boaz Barak
ICML 2023

Limitations of the NTK for Understanding Generalization in Deep Learning.
Nikhil Vyas, Yamini Bansal, Preetum Nakkiran
TMLR

Thwarting Adversarial Examples: An L_0-Robust Sparse Fourier Transform.
Mitali Bafna*, Jack Murtagh*, Nikhil Vyas*
NeurIPS 2018

Publications (Theory)

Quasi-Linear Size PCPs with Small Soundness from HDX.
Mitali Bafna, Dor Minzer, Nikhil Vyas
STOC 2025 (Best Paper Award)

On Oracles and Algorithmic Methods for Proving Lower Bounds.
Nikhil Vyas, Ryan Williams
ITCS 2023

On the Number of Quantifiers as a Complexity Measure.
Ronald Fagin, Jonathan Lenchner, Nikhil Vyas, Ryan Williams
MFCS 2022

Optimal Fine-grained Hardness of Approximation of Linear Equations.
Mitali Bafna, Nikhil Vyas
ICALP 2021

Multi-Structural Games and Number of Quantifiers.
Ronald Fagin, Jonathan Lenchner, Kenneth W. Regan, Nikhil Vyas
LICS 2021

Fast Low-Space Algorithms for Subset Sum.
Ce Jin, Nikhil Vyas, R. Ryan Williams
SODA 2021

Lower Bounds Against Sparse Symmetric Functions of ACC Circuits: Expanding the Reach of #SAT Algorithms.
Nikhil Vyas, R. Ryan Williams
STACS 2020

Near-Optimal Complexity Bounds for Fragments of the Skolem Problem.
S. Akshay, Nikhil Balaji, Aniket Murhekar, Rohith Varma, Nikhil Vyas
STACS 2020

Algorithms and Lower Bounds for Cycles and Walks: Small Space and Sparse Graphs.
Andrea Lincoln, Nikhil Vyas
ITCS 2020

Efficient Constructions for Almost-Everywhere Secure Computation.
Siddhartha Jayanti, Srinivasan Raghuraman, Nikhil Vyas
EUROCRYPT 2020

On Super Strong ETH.
Nikhil Vyas, Ryan Williams
SAT 2019 (Best Paper Award)

Imperfect Gaps in Gap-ETH and PCPs.
Mitali Bafna, Nikhil Vyas
CCC 2019

Approximation Algorithms for Min-Distance Problems.
Mina Dalirrooyfard, Virginia Vassilevska Williams, Nikhil Vyas, Nicole Wein, Yinzhan Xu, Yuancheng Yu
ICALP 2019

Tight Approximation Algorithms for Bichromatic Graph Diameter and Related Problems.
Mina Dalirrooyfard, Virginia Vassilevska Williams, Nikhil Vyas, Nicole Wein
ICALP 2019

Distribution-based objectives for Markov Decision Processes.
S. Akshay, Blaise Genest, Nikhil Vyas
LICS 2018

Complexity of Restricted Variants of Skolem and Related Problems.
S. Akshay, Nikhil Balaji, Nikhil Vyas
MFCS 2017

Faster space-efficient algorithms for subset sum and k-sum.
Nikhil Bansal, Shashwat Garg, Jesper Nederlof, Nikhil Vyas
STOC 2017

On Regularity of Unary Probabilistic Automata.
S. Akshay, Blaise Genest, Bruno Karelovic, Nikhil Vyas
STACS 2016

Teaching

Advanced Complexity Theory. (MIT 6.841, Spring 2022)
Teaching assistant to Professor Ryan Williams

Math for Computer Science. (MIT 6.042, Spring 2021)
Teaching assistant to Zachary Abel, Professors Nancy Lynch and Ryan Williams

Learning Augmented Algorithms. (MIT 6.890, Spring 2019)
Teaching assistant to Professors Costis Daskalakis and Piotr Indyk

Introduction to Computational Complexity. (IIT Bombay CS 721, Fall 2016)
Teaching assistant to Professor Nutan Limaye.

Discrete Structures. (IIT Bombay CS 207, Fall 2015)
Teaching assistant to Professor S. Akshay.