27
1
State of the Agent: Do coding agents know what they don't know? (thinkwright.ai)
2
Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents (thinkwright.ai)
2
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents (thinkwright.ai)
1
Show HN: Impulse Cycler – Transient Motion Resynthesizer (aftertone.co)
1