Cynddl - Hacker News

11

Thomson Reuters built its own AI model that now ranks among the best (thomsonreuters.com)

23 hours ago Cynddl thomsonreuters.com

1

Hijacking Defensive Cyber AI Agents for Remote Code Execution (ainowinstitute.org)

3 weeks ago Cynddl ainowinstitute.org

1

Double Agents: Defensive AI Agents Magnify Cyber Risks (ainowinstitute.org)

3 weeks ago Cynddl ainowinstitute.org

2

Our evaluation of OpenAI's GPT-5.5 cyber capabilities (aisi.gov.uk)

3 months ago Cynddl aisi.gov.uk

54

Making AI chatbots friendly leads to mistakes and support of conspiracy theories (theguardian.com)

4 months ago Cynddl theguardian.com

44

UK Biobank health data keeps ending up on GitHub (rocher.lc)

4 months ago Cynddl rocher.lc

1

Tracking takedown notices filed by UK Biobank (rocher.lc)

4 months ago Cynddl rocher.lc

2

ChatGPT Edu feature reveals researchers' project metadata across universities (fastcompany.com)

5 months ago Cynddl fastcompany.com

1

AI no better than other methods for patients seeking medical advice, study shows (reuters.com)

6 months ago Cynddl reuters.com

4

AI chatbots pose 'dangerous' risk when giving medical advice, study suggests (bbc.co.uk)

6 months ago Cynddl bbc.co.uk

1

Show HN: Small, anonymous app for teams to do retrospective sessions (rocher.lc)

6 months ago Cynddl rocher.lc

1

Measuring What Matters: Construct Validity in Large Language Model Benchmarks (arxiv.org)

9 months ago Cynddl arxiv.org

14

AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds (gizmodo.com)

9 months ago Cynddl gizmodo.com

3

AI's capabilities may be exaggerated by flawed tests, according to new study (nbcnews.com)

9 months ago Cynddl nbcnews.com

2

Experts find flaws in tests that check AI safety and effectiveness (theguardian.com)

9 months ago Cynddl theguardian.com

1

Measuring What Matters: Construct Validity in Large Language Model Benchmarks (oxrml.com)

9 months ago Cynddl oxrml.com

2

The quiet software tooling Renaissance (pdx.su)

11 months ago Cynddl pdx.su

4

Facial recognition works better in the lab than on the street, researchers show (theregister.com)

12 months ago Cynddl theregister.com

1

We Shouldn't Trust Facial Recognition's Glowing Test Scores (techpolicy.press)

12 months ago Cynddl techpolicy.press

135

Training language models to be warm and empathetic makes them less reliable (arxiv.org)

12 months ago Cynddl arxiv.org

3

AI's limited understanding of gender puts health equity at risk (ox.ac.uk)

a year ago Cynddl ox.ac.uk

1

Establishing meaningful data access for algorithm audits (ox.ac.uk)

a year ago Cynddl ox.ac.uk

1

Alpha Lyrae: This font 'randomly' pixelates characters in a block of text (vegaprotocol.github.io)

a year ago Cynddl github.io

1

Data anonymity methods and privacy safeguards unfit for modern data (ox.ac.uk)

2 years ago Cynddl ox.ac.uk