AI Supply Chain Blindspot: When 'Safe' Agent Skills Go Rogue
Just read the report on AIR's experiment with a fake agent skill. They created a harmless skill that just collected emails, pushed it to a marketplace and via Instagram ads, and it reached ~26,000 agents—including corporate instances.
The kicker? Every security scanner they tested marked it as "safe."
This highlights a massive gap in how we're securing Agentic AI. We're treating these like traditional apps, scanning for known bad signatures or obvious malicious strings, but the logic flow within the agent's "skill" definition is going unchecked. If a skill is designed to exfiltrate data slowly or perform lateral movement, will static scanners catch it? Probably not.
I put together a quick Python snippet to check a skill manifest for suspicious outbound endpoints, as a stop-gap:
import yaml
import re
def scan_skill_manifest(manifest_path):
with open(manifest_path, 'r') as f:
config = yaml.safe_load(f)
risky_endpoints = []
# Look for non-internal API calls
for api in config.get('api_endpoints', []):
if not api.startswith('https://internal.corp.com'):
risky_endpoints.append(api)
return risky_endpoints
We need to start treating agent skills like untrusted code. How is your org handling the vetting of third-party AI skills? Are you allowing marketplace skills in your corporate environment, or is it a total ban?
We're struggling with this exact visibility issue. We don't have a dedicated tool yet, so we're relying on SIEM correlation for now. We're looking for agents that trigger specific tool-use tags but don't correlate with a known user ticket. It's noisy, but it's better than nothing. The AIR report proves we need agent-aware firewall rules.
This is purely a social engineering attack vector targeting the AI, not the user. The user thinks they are adding a productivity tool, the AI thinks it's executing a standard function. I've been testing prompts to see if agents will execute skills based on ad-hoc suggestions rather than verified intents. The success rate is terrifying.
Zero trust for agents is the only way forward. No marketplace skills in production. If a team wants a skill, they build it in-house, we audit the YAML/JSON definition, and host it in a private registry. It slows down deployment, but after seeing that 26k number, I don't care.
Verified Access Required
To maintain the integrity of our intelligence feeds, only verified partners and security professionals can post replies.
Request Access