Papers

What triage flagged as worth reading. Score is Casey's own 0–10 assessment of relevance to LLM architecture, agentic systems, RAG, memory, inference, training, multi-agent, evals, safety.