105
6
How to replicate the Claude Code attack with Promptfoo (promptfoo.dev)
315
Questions censored by DeepSeek (promptfoo.dev)
18
Llama 3.2 (huggingface.co)
3
Automated jailbreaking techniques with DALL-E (promptfoo.dev)
8
Show HN: Automated red teaming for your LLM app (promptfoo.dev)
2
Benchmark Command R vs. GPT/Claude on your own data (promptfoo.dev)
1
DBRX vs. Mixtral vs. GPT: create your own benchmark (promptfoo.dev)
0
How to benchmark Gemini vs. GPT with your own data (promptfoo.dev)
3
A collection of LLM evaluation tools (ianww.com)
2
How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs (promptfoo.dev)
1
Benchmark Llama 2 vs. GPT on your own data (promptfoo.dev)
3
Show HN: CLI for testing and evaluating LLM prompts and outputs (github.com/promptfoo)
3
An open-source framework for prompt engineering (ianww.com)
5
Show HN: Promptfoo – CLI for testing & improving LLM prompt quality (github.com/typpo)
3
Show HN: Text-to-Chart – embeddable natural language charts (quickchart.io)
53
The Circumnavigators (2017) (qrp-labs.com)
1
Show HN: A discord bot that remixes your friends' profile pictures (ianww.com)
4
Show HN: Generate SQL Queries from English (querymuse.com)
2