| International Journal of Computer Applications |
| Foundation of Computer Science (FCS), NY, USA |
| Volume 187 - Number 56 |
| Year of Publication: 2025 |
| Authors: Prudhvi Ratna Badri Satya, Ajay Guyyala, Vijay Putta, Krishna Teja Areti |
10.5120/ijca2025925957
|
Prudhvi Ratna Badri Satya, Ajay Guyyala, Vijay Putta, Krishna Teja Areti . Robustness of Automated AI Agents Against Adversarial Context Injection in MCP. International Journal of Computer Applications. 187, 56 ( Nov 2025), 1-14. DOI=10.5120/ijca2025925957
Multi-agent systems based on the Model Context Protocol enable agents to share information, tool outputs, and memory across distributed servers. While this design supports complex tasks such as browsing, coding, and data entry, it also expands the attack surface through adversarial context injection. Malicious inputs can enter through web pages, APIs, files, or memory and persist across steps, making detection difficult. Existing defenses often target single prompts and fail to address multi-step persistence or crossserver propagation. To address this gap, a defense stack was introduced that combines schema checks, anomaly detection, trustweighted arbitration, and quarantine. Evaluation was conducted on WebArena, Mind2Web, and InjectBench using reproducible trials with clean and injected runs. Results showed a reduction in ASR from over 60% to as low as 16.3% and improvements in decision accuracy up to 62.7%, with modest overhead of 2.6–3.0 seconds per task. The findings highlight the importance of layered defenses, reproducible testing, and transparent reporting for safe deployment of automated agent networks.