Manish Nagaraj
Uber Technologies, Purdue University, Nano(Neuro) Electronics Laboratory.
Data Efficiency · Efficient Fine-tuning · Large Language Models
I am currently working as a Machine Learning Engineer at Uber Technologies. Broadly, I’m interested in techniques that make foundation models, LLMs, vision models, and multimodal systems, more compact, accurate, and deployable in real applications.
I received my Ph.D. in Electrical and Computer Engineering at Purdue University, working with Professor Kaushik Roy. I also received my M.S. in Electrical and Computer Engineering from Purdue University and my B.E. in Electronics and Communications from PES Institute of Technology, Bangalore, India.
My doctoral dissertation, “Exploring Data Efficiency for Deep Learning Systems” looked at how to make modern deep learning, especially large language and vision models, more practical and scalable. I worked on methods that identify which data actually matters for training, so that we can fine-tune and deploy large models with less compute and without sacrificing performance. This has included:
- ‘TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning’ - a forward-only, attention-based approach for selecting instruction-tuning data for LLMs, accepted for publication at ICML 2026.
- ‘Coresets from Trajectories: Selecting Data via Correlation of Loss Differences’ - a gradient-free coreset method for large-scale vision training, accepted at TMLR.
- ‘TOFU: Federated Learning with Data and Communication Efficiency’ - improving data and communication efficiency in federated learning, published in IEEE Access.
- ‘DOTIE: Energy-Efficient Object Detection Using Event Cameras’ - event-based object detection with spiking neural networks, demonstrated at the 2023 IEEE International Conference on Robotics and Automation (ICRA) and CVPR workshops.
Across these projects, the common thread was data efficiency for large models: selecting informative subsets, scaling training under real-world constraints, and making models usable in settings like federated learning, robotics, and resource-limited hardware.
News
| May 18, 2026 | I’m happy to share that I’m starting a new position as Machine Learning Engineer at Uber Technologies!! |
|---|---|
| May 04, 2026 | I completed my PhD! I want to specially thank my committee members and friends for all the support! |
| Apr 30, 2026 | Thrilled to share that our paper TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning has been accepted to ICML 2026! 🎉 |
| Nov 18, 2025 | Coresets from Trajectories: Selecting Data via Correlation of Loss Differences got accepted for publication at TMLR! |
| Oct 25, 2024 | I passed my preliminary examination! |
Latest Posts
| Jul 10, 2023 | Blog post on Federated Learning |
|---|