Shresth Verma

I am a second-year PhD student at Harvard University advised by Prof. Milind Tambe. I’m interested in reinforcement learning and multi-agent systems for planning and optimization with applications in public health. My current work focuses on
- Fair and robust algorithms for sequential resource allocation
- Transparency and alignment in LLMs for planning tasks
- Decision-Focused Learning
Previously, I spent two wonderful years at Google Research India, working in the AI for Social Good lab where I was grateful to be advised by Dr. Aparna Taneja. I developed robust bandit algorithms for planning targeted health interventions which in turn improve health literacy and medical adherence in underserved communities in India.
Before that, I was a Data Scientist at United Health Group where I worked in the Chief Medical Officer’s team for modeling readmission risks for millions of beneficiaries. I also developed tools to visualize patient’s wellness journey using data obtained from the world’s largest healthcare graph database.
![]() | ![]() | ![]() |
2023 - Present | 2021 - 2023 | 2020 - 2021 |
News
Jul 24, 2024 | Our work on Group Fairness in Decision-Focused Learning has been accepted at UAI 2024! |
---|---|
Mar 24, 2024 | I’ll be attending Data Study Group at The Alan Turing Institute as a Facilitator! |
Feb 5, 2024 | Got accepted into Harvard’s Spring 2024 Technical AI Safety Fellowship! |
Nov 10, 2023 | Our paper about sequential allocation of multiple kinds of resources got accepted at IAAI 2024! |
Jun 15, 2023 | I’m starting PhD in CS at Harvard University advised by Prof. Milind Tambe! |
Mar 1, 2023 | Our work on developing an index for sequential resource allocation in non-markovian environments is accepted at IJCAI 2023! |
Jan 1, 2023 | Our work on field study of Decision Focused Learning has been accepted at AAMAS 2023! |
Selected Publications
2024
- IAAI’24Improving Health Information Access in the World’s Largest Maternal Mobile Health Program via Bandit AlgorithmsIn AAAI Conference on Artificial Intelligence, 2024
2023
- IJCAI’23Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child HealthcareIn International Joint Conference on Artificial Intelligence, 2023
- AAAI’23Scalable decision-focused learning in restless multi-armed bandits with application to maternal and child healthIn AAAI Conference on Artificial Intelligence, 2023
- AAAI’23Robust planning over restless groups: engagement interventions for a large-scale maternal telehealth programIn AAAI Conference on Artificial Intelligence, 2023
- AAMAS’23Restless Multi-Armed Bandits for Maternal and Child Health: Results from Decision-Focused Learning.In International Conference on Autonomous Agents and Multi Agent Systems, 2023
- IAAI’23Increasing impact of mobile health programs: SAHELI for maternal and child care🏆Best Deployed Application🏆In AAAI Conference on Artificial Intelligence, 2023
2022
- AAAI’22Field study in deploying restless multi-armed bandits: Assisting non-profits in improving maternal and child healthIn AAAI Conference on Artificial Intelligence, 2022
- TSRML-NeurIPS’22Case study: Applying decision focused learning in the real worldIn Workshop on Trustworthy and Socially Responsible Machine Learning at NeurIPS, 2022
2021
- AAMAS’21Towards Sample Efficient Learners in Population based Referential Games through Action AdvisingIn International Conference on Autonomous Agents and Multi Agent Systems, 2021
2020
- CoDS-COMAD’20Deep reinforcement learning for single-shot diagnosis and adaptation in damaged robotsIn ACM International Joint Conference on Data Science and Management of Data, 2020
- LaREL-ICML’20Emergence of Multilingualism in Population based Referential GamesIn Workshop on Language in Reinforcement Learning at ICML, 2020
- AAAI’20Emergence of Writing Systems through Multi-Agent Cooperation (Student Abstract)In AAAI Conference on Artificial Intelligence, 2020