AI Research

AI research

Papers and publications

Recent papers and long-form publications, newest first. Where a public arXiv version exists, it is linked directly.

The current thread running through the work is AI safety, agent security, model unlearning, LLM robustness, confidential systems, and infrastructure at cloud scale.

Google Scholar profile

GitHub

2026

2026 | arXiv:2602.11416

Optimizing Agent Planning for Security and Autonomy

Authors: Aashish Kolluri, Rishi Sharma, Manuel Costa, Boris Kopf, Tobias Niessen, Mark Russinovich, Shruti Tople, Santiago Zanella-Beguelin | Venue: arXiv preprint

This paper argues that deterministic, information-flow-based defenses for AI agents become much more practical once planning is optimized correctly. The work focuses on preserving strong security guarantees against indirect prompt injection without paying unnecessary costs in task completion or token usage.

Papers and publications

2026

Optimizing Agent Planning for Security and Autonomy

GRP-Obliteration: Unaligning LLMs With a Single Unlabeled Prompt

Hey, That’s My Model! Introducing Chain & Hash, an LLM Fingerprinting Technique

Redefining the Software Engineering Profession for AI

2025

The Price of Intelligence

A Representation Engineering Perspective on the Effectiveness of Multi-Turn Jailbreaks

LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Securing AI Agents with Information-Flow Control

Jailbreaking is (Mostly) Simpler Than You Think

Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models

Lessons From Red Teaming 100 Generative AI Products

Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack

2024

The Price of Intelligence: Three Risks Inherent in LLMs

Confidential Computing Proofs

2023

Why Should I Trust Your Code?

Confidential Computing: Elevating Cloud Security and Privacy

Confidential Consortium Framework: Secure Multiparty Applications with Confidentiality, Integrity, and High Availability

Who’s Harry Potter? Approximate Unlearning in LLMs

Why Should I Trust Your Code? Confidential Computing Enables Users to Authenticate Code Running in TEEs, but Users Also Need Evidence This Code Is Trustworthy.

Confidential Computing: Elevating Cloud Security and Privacy: Working toward a More Secure and Innovative Future

2022

Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads

IA-CCF: Individual Accountability for Permissioned Ledgers

2021

Toward Confidential Cloud Computing

Virtual Machine Preserving Host Updates for Zero Day Patching in Public Cloud

Toward Confidential Cloud Computing: Extending Hardware-Enforced Cryptographic Protection to Data While in Use

2020

Toward ML-Centric Cloud Platforms

Protean: VM Allocation Service at Scale

Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider

Selected repositories

RefChecker

gRPC shared memory transport

Other public projects