AI summary 2 แหล่ง · 2 วันก่อน

วิจัยใหม่แก้ปัญหา LLM Agent ในงานยาว: จัดการ Context อัจฉริยะ ตรวจสอบความน่าเชื่อถือ

นักวิจัยเผยวิธีแก้ปัญหาหลักของ LLM agents ในงานระยะยาว — context degradation, distribution shift, และ prompt drift ที่ทำให้ agent ล้มเหลว งานวิจัยใหม่ๆ เสนอ AdaCoM (adaptive context management), event-sourced architecture, runtime verification, และ write-time intelligence เพื่อให้ agents ทำงานได้เสถียรและตรวจสอบได้ในระบบจริง ปัญหาเดิมคือ context ยาวขึ้น agent ใจลอย หรือ prompt เปลี่ยนแปลงเงียบๆ ตอนนี้มีวิธีควบคุมและตรวจสอบแบบ fine-grained แล้ว

แหล่งข่าว

ประเด็น

2 วันก่อน

อัปเดต

Context management ต้องปรับตัวตามแต่ละ agent — fixed strategy (summarization) ไม่พอ AdaCoM ทำได้โดยไม่ต้อง retrain closed-source models
Distribution shift ในหลายรอบสนทนาเพิ่มขึ้นแบบ quadratic — ต้องใช้ calibrated interactive RL แทน static offline logs
Runtime verification + event-sourced logs ให้ trace ได้ว่า agent เปลี่ยนใจเพราะอะไร (evidence, anchoring, prompt drift) ไม่ใช่ blackbox

แหล่งต้นทาง · 15

ลิงก์ต้นทางอยู่ครบ เพื่อให้เปิดอ่านเต็มและเทียบข้อมูลเองได้

arXiv — cs.AI 2 วันก่อน

Grokers: Bottom-Up Inductive Comprehension and Write-Time Intelligence over Typed Knowledge Graphs

arXiv — cs.AI 3 วันก่อน

Learning Agent-Compatible Context Management for Long-Horizon Tasks

arXiv — cs.AI 27 พ.ค.

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

arXiv — cs.AI 26 พ.ค.

Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction

arXiv — cs.AI 26 พ.ค.

BODHI: Precise OS Kernel Specification Inference