Articles by ammar_x
17

DeepSWE: A contamination-free benchmark for long-horizon coding agents (datacurve.ai)

4

Xiaomi Mimo-v2.5 pricing is now permanently reduced (twitter.com/xiaomimimo)

2

Exploring Goodreads data: Analysis of 10M books (ammar-alyousfi.com)

1

A study confirms: Big changes in GPT-4 performance since its launch (twitter.com/emollick)

1

What's your favorite interface for GPT API?

5

Ask HN: What's your go-to platform for written discussions, and why?

7

Ask HN: Have you noticed decreased quality in GPT-4 reasoning recently?

1

Ask HN: What is the main communication channel at your remote company?