Tag

#agent security

2026/07/24 AI Native 实践

Issue Agent 不能只靠审批按钮：把提议、批准和授权拆成三层控制面

基于 GitHub Issues 新增的 rationale、confidence 和 approvals 控制，本文设计一套可落地的 Issue Agent 工作流：用意图对象、状态机、最小权限与人工原因码，让自动分流可审计、可回滚，而不是把自动标签当作安全治理。

#AI Native #agentic workflow #GitHub Issues #human-in-the-loop #AI governance #workflow automation #developer productivity #agent security

2026/07/19 记忆安全

Agent 拒绝一次还不够：记忆投毒需要状态修复与跨会话验收

MemPoison 与 Bad Memory 共同暴露了一个常被忽略的缺口：当前会话拒绝恶意指令，不代表持久状态已恢复安全。本文把记忆投毒治理拆成写入、检索、消费、隔离、修复与跨会话验收，并给出可落地的数据模型、状态机和回归协议。

#Agent Memory #memory poisoning #prompt injection #persistent memory #AGENTS.md #CLAUDE.md #incident response #agent security

2026/07/13 安全工程

安全调查 Agent 要在证据图上回溯，而不是在日志里自由聊天

SherAgent 的生产实践表明，SOC 调查自动化的关键不是让 LLM 直接判断告警，而是用受约束查询、语义剪枝、调查树和显式失败终态，在日志缺失与依赖爆炸之间建立可审计的 query-filter-backtracking 控制环。

#SOC #attack investigation #provenance graph #security automation #agent security #LLM #OCSF #human-in-the-loop

2026/07/10 记忆安全

Agent 记忆安全要保护推理痕迹

FARMA 和 GhostWriter 把记忆投毒从污染事实条目推进到污染推理历史和个人助理工作状态。生产 Agent 不能把历史 reasoning、decision log 和经验摘要默认当成可信证据，而要在写入、检索、行动前同时做来源绑定、推理痕迹完整性检查和高风险动作授权。

#AI memory #agent memory #long-term memory #memory security #memory poisoning #reasoning trace #personal agents #agent security

2026/06/30 安全工程

CLAWAUDIT 把本地 LLM Agent 的 prompt builder、tool dispatcher、skill loader、memory writer、network client 和 permission gate 定义为新的静态审计边界。本文拆解它的五类运行时边界、Semgrep/CodeQL 双后端评测、语义盲区，以及一条可落地的 Agent runtime SAST 流程。

#agent security #white-box scanning #static analysis #Semgrep #CodeQL #SAST #Agent runtime #security automation

2026/06/24 安全工程

Agent 应用上线前需要一条白盒安全审计关口

Agent Audit、CodeBadger、CodeQL model packs 与 Semgrep Custom Workflows 共同指向一个工程判断：Agent 安全不能只靠提示词防护，而要把工具代码、MCP 配置、身份权限、记忆/上下文和 CI 证据做成可审计的发布关口。

#agent security #white-box scanning #MCP #static analysis #CodeQL #Joern #Semgrep #CI/CD #security automation

2026/06/15 安全分析

运行时记忆投毒防御：证书要绑定写路径，而不是只靠检索过滤

SMSR、MemVenom 和长期记忆安全综述把 Agent 记忆安全推到可验证治理阶段：生产系统不能只做 prompt filter，而要把来源签名、随机化检索、证书复算、回滚和工具调用审计放进同一条验收链。

#AI memory #agent memory #long-term memory #memory poisoning #memory security #RAG #agent security #memory evaluation

2026/06/15 安全工程

Agent 编排在网络安全里的正确位置：从告警流水线到可审计的安全工作流

Agent 编排不是让一个大模型直接接管安全运营，而是把 triage、证据收集、静态分析、威胁情报、检测工程、修复验证和人工审批组织成有状态、有权限边界、可回放的安全工作流。本文给出一套面向 SOC 与白盒扫描的工程方案。

#agent security #security automation #SOC #agent orchestration #white-box scanning #CodeQL #SARIF #human-in-the-loop

2026/06/11 安全分析

相似不等于可信：Agent 记忆检索需要准入门，而不只是向量召回

arXiv:2606.06054 MemGate 把个人 Agent 的长期记忆检索定义为信任边界。工程上，记忆读路径不能只按相似度把候选片段塞进上下文，而要在检索和注入之间增加任务条件准入、来源权威、作用域隔离和工具副作用绑定。

#AI memory #agent memory #long-term memory #memory security #RAG #agent security #memory evaluation #personalization

2026/06/08 安全分析

MPBench 的价值不是攻击库，而是 Agent 记忆写入面的安全地图

arXiv:2606.04329 把 Agent 记忆投毒从零散案例整理成写入通道、结构性漏洞和 ASR/RSR 评测问题。工程上真正该落地的是记忆写入面的资产清单、来源权威、写后审计和跨会话回归测试。

#AI memory #agent memory #long-term memory #memory poisoning #memory security #agent security #memory evaluation #prompt injection

2026/06/05 安全工程

Agent libOS：长期运行 Agent 的安全边界应该下沉到运行时原语

从 arXiv:2606.03895 Agent libOS 看，长期运行 Agent 的风险不只在 prompt、工具描述或扫描规则里，而在调度、对象记忆、权限授予、人类审批、恢复和审计这些运行时原语能否成为真正的授权边界。

#agent security #runtime security #capability #audit #MCP #white-box scanning #AI memory

2026/06/02 安全工程

没有证书，就不要执行：Agent 安全审计需要从日志转向可认证轨迹

从 arXiv:2605.24462 的 Certified Traces、AgentSecBench、Agent-BOM 和当前 Agent SDK/Bedrock 工程接口看，安全 Agent 的关键不是让模型解释得更像人，而是让每次工具调用、白盒扫描、修复和部署动作在执行前携带可检查的权限、来源、证据和回放条件。

#agent security #security audit #tool use #certified traces #white-box scanner #AgentSecBench #prompt injection #memory security

2026/05/18 网络安全

Agent + CPG + LFP：怎样构建一个可验证的白盒扫描器

本文把 Agent、Code Property Graph、Low False Positive Control Layer、规则引擎、数据流分析和验证沙箱合成一个白盒扫描器方案：不是让大模型直接猜漏洞，而是让它围绕代码图、低误报控制、证据链和 PoC 验证来工作。

#white-box scanner #CPG #LFP #static analysis #agent security #CodeQL #Joern