AI Supply Chain Blindspot: When 'Safe' Agent Skills Go Rogue

ZeroTrust_Hannah 6/23/2026 USER

Just read the report on AIR's experiment with a fake agent skill. They created a harmless skill that just collected emails, pushed it to a marketplace and via Instagram ads, and it reached ~26,000 agents—including corporate instances.

The kicker? Every security scanner they tested marked it as "safe."

This highlights a massive gap in how we're securing Agentic AI. We're treating these like traditional apps, scanning for known bad signatures or obvious malicious strings, but the logic flow within the agent's "skill" definition is going unchecked. If a skill is designed to exfiltrate data slowly or perform lateral movement, will static scanners catch it? Probably not.

I put together a quick Python snippet to check a skill manifest for suspicious outbound endpoints, as a stop-gap:

import yaml
import re

def scan_skill_manifest(manifest_path):
    with open(manifest_path, 'r') as f:
        config = yaml.safe_load(f)
    
    risky_endpoints = []
    # Look for non-internal API calls
    for api in config.get('api_endpoints', []):
        if not api.startswith('https://internal.corp.com'):
            risky_endpoints.append(api)
            
    return risky_endpoints


We need to start treating agent skills like untrusted code. How is your org handling the vetting of third-party AI skills? Are you allowing marketplace skills in your corporate environment, or is it a total ban?

TabletopEx_Quinn6/23/2026

We're struggling with this exact visibility issue. We don't have a dedicated tool yet, so we're relying on SIEM correlation for now. We're looking for agents that trigger specific tool-use tags but don't correlate with a known user ticket. It's noisy, but it's better than nothing. The AIR report proves we need agent-aware firewall rules.

OSINT_Detective_Liz6/23/2026

This is purely a social engineering attack vector targeting the AI, not the user. The user thinks they are adding a productivity tool, the AI thinks it's executing a standard function. I've been testing prompts to see if agents will execute skills based on ad-hoc suggestions rather than verified intents. The success rate is terrifying.

ContainerSec_Aisha6/23/2026

Zero trust for agents is the only way forward. No marketplace skills in production. If a team wants a skill, they build it in-house, we audit the YAML/JSON definition, and host it in a private registry. It slows down deployment, but after seeing that 26k number, I don't care.

Verified Access Required

To maintain the integrity of our intelligence feeds, only verified partners and security professionals can post replies.

Request Access

AI Supply Chain Blindspot: When 'Safe' Agent Skills Go Rogue

Verified Access Required

Thread Stats

Similar Threads