#paper-review — quidproquo

ai deep-dive 2026年6月6日

LLM Agent 的技能管理革命：從 Voyager 到 MUSE-Autoskill 的 Skill Lifecycle 全景

MUSE-Autoskill（2026）提出五階段 skill 生命週期框架，自創 skill 在 SkillsBench 達 60.35%（+7.16%），成功生成 skill 的任務上更達 87.94%，超越人工撰寫上限。本文整合六篇 arXiv 論文，梳理 skill evolution 研究全景。

#agent-skills #ai-agent #llm #self-refinement #memory #arxiv #paper-review

ai deep-dive 2026年5月24日

把 LLM Agent 的 skills / tools / code interpreter 真正組裝起來：一份論文導讀地圖

LLM agent 的難點不是把 function calling、skill、code interpreter、文件工具各自做出來，而是把它們組成一個會選工具、會寫程式、會拆任務、會驗證結果、又不會被 prompt injection 打穿的系統。這篇把代表論文整理成六個工程決策：function calling 可靠度、tool/skill selection、code-as-action、多步 planning、skill 系統、安全與文件生成。

#llm #agents #tool-use #skills #code-interpreter #function-calling #paper-review