Neurosymbolic Transformers for Multi-Agent Communication

Inala, Jeevana Priya; Yang, Yichen; Paulos, James; Pu, Yewen; Bastani, Osbert; Kumar, Vijay; Rinard, Martin; Solar-Lezama, Armando

Javascript is disabled or not supported in your browser. JavaScript must be enabled in order for you to use WIKINDX fully. Enable JavaScript through your browser options then try again, otherwise, try using a different browser.

AI Bibliography

WIKINDX Resources

Inala, J. P., Yang, Y., Paulos, J., Pu, Y., Bastani, O., & Kumar, V., et al.. (2020). Neurosymbolic transformers for multi-agent communication. Advances in Neural Information Processing Systems, 33.

Resource type: Journal Article
BibTeX citation key: Inala2020
View all bibliographic details

Categories: Artificial Intelligence, Biological Science, Cognitive Science, Complexity Science, Computer Science, Data Sciences, Decision Theory, General, Mathematics
Subcategories: Autonomous systems, Big data, Decision making, Edge AI, Internet of things, Machine learning, Machine recognition, Markov models, Neural nets, Neurosymbolic, Systems theory
Creators: Bastani, Inala, Kumar, Paulos, Pu, Rinard, Solar-Lezama, Yang
Publisher:
Collection: Advances in Neural Information Processing Systems

Attachments

Abstract

We study the problem of inferring communication structures that can solve cooperative multi-agent planning problems while minimizing the amount of communication. We quantify the amount of communication as the maximum degree of the communication graph; this metric captures settings where agents have limited bandwidth. Minimizing communication is challenging due to the combinatorial nature of both the decision space and the objective; for instance, we cannot solve this problem by training neural networks using gradient descent. We propose a novel algorithm that synthesizes a control policy that combines a programmatic communication policy used to generate the communication graph with a transformer policy network used to choose actions. Our algorithm first trains the transformer policy, which implicitly generates a "soft" communication graph; then, it synthesizes a programmatic communication policy that "hardens" this graph, forming a neurosymbolic transformer. Our experiments demonstrate how our approach can synthesize policies that generate low-degree communication graphs while maintaining near-optimal performance.

WIKINDX 6.7.0 | Total resources: 1621 | Username: -- | Bibliography: WIKINDX Master Bibliography | Style: American Psychological Association (APA)