Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
How we helped a YC company (Upsolve) catch a GPT-5 regression (arthur.ai)
17 points by fryz 60 days ago | past
The Agent Development Lifecycle (ADLC) – A new way to build reliable Agents (arthur.ai)
6 points by ianmcgraw 70 days ago | past | 2 comments
Guide to LLM Experimentation and Development in 2024 (arthur.ai)
2 points by fryz on June 10, 2024 | past
The Generative Assessment Program (arthur.ai)
3 points by anonymousiam on Aug 17, 2023 | past
Hallucination Experiment (arthur.ai)
4 points by mfiguiere on Aug 17, 2023 | past
Arthur Bench: The Most Robust Way to Evaluate LLMs (arthur.ai)
14 points by rntn on Aug 17, 2023 | past
How to Think About Production Performance of Generative Text (arthur.ai)
1 point by fryz on April 26, 2023 | past
Machine Learning: Multivariate Data Drift with Tabular Data (arthur.ai)
1 point by chsirabella on Aug 17, 2022 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: