
M.S. Thesis Defense: Bala Subramanyam Duggirala
Wednesday, July 23, 2025
11:00 AM
347 Avery Hall
Zoom: https://unl.zoom.us/j/98970713104
"Investigating The Impact Of Agent Openness On Multi-Agent Planning"
Multi-agent systems (MAS) possess significant potential for modeling real-world scenarios requiring coordinated actions (like wildfire fighting or ridesharing) among autonomous entities or agents (e.g., wildfire fighting agents) in complex, dynamic environments. Effective decision-theoretic planning (where each agent must carefully consider both the immediate and the future situations or states, and coordinate with the other agents (neighbors) to evaluate what needs to be done at present) within MAS, especially multiagent planning, where the planning agent directly models its neighbors in order to estimate their optimal actions, is critical, yet challenged by factors like partial observability, openness, and diverse agent types with different goals, capabilities, or both. Current state-of-the-art frameworks for this planning come with certain drawbacks. The standard Interactive Partially Observable Markov Decision Process (I-POMDP) suffers from scalability as it becomes intractable for the planning agent to maintain an interactive belief over beliefs for each of its neighbors. The relatively faster I-POMDP-Lite suffers from incorrectly assuming full observability to approximately solve a nested MDP for each neighbor. Although solvers like Partially Observable Monte Carlo Planning (POMCP) enhance I-POMDP by accounting for partial observability and approximating the resulting belief updates—planning ad hoc for each neighbor still renders real-time computation impractical for large domains. Offline solvers such as Point Based Value Iteration (PBVI) improve scalability as well as account for partial observability but have not been previously analyzed regarding agent openness or adapted for mixed observability – a hybrid of partial and full observability. Addressing these gaps, this thesis investigates PBVI as a scalable offline solver for mixed observability and integrates PBVI-generated neighbor action policies into the I-POMDP-based planning, while simultaneously adopting a model-aware belief update algorithm for efficient belief updates. Our contributions include algorithmic adaptations for mixed observability, efficient belief updates, adaptable implementations, and comprehensive empirical analysis highlighting agent behavior under varying agent openness levels in environments with different agent types (cooperating and competing). These modifications make effective MAS planning computationally feasible, fostering practical applicability across complex, real-world domains.
Committee:
Dr. Leen-Kiat Soh, Advisor
Dr. Nirnimesh Ghose
Dr. Hau Chan
Dr. Adam Eck