Submissions from arthur.ai

		How we helped a YC company (Upsolve) catch a GPT-5 regression (arthur.ai)
		17 points by fryz 60 days ago \| past
		The Agent Development Lifecycle (ADLC) – A new way to build reliable Agents (arthur.ai)
		6 points by ianmcgraw 70 days ago \| past \| 2 comments
		Guide to LLM Experimentation and Development in 2024 (arthur.ai)
		2 points by fryz on June 10, 2024 \| past
		The Generative Assessment Program (arthur.ai)
		3 points by anonymousiam on Aug 17, 2023 \| past
		Hallucination Experiment (arthur.ai)
		4 points by mfiguiere on Aug 17, 2023 \| past
		Arthur Bench: The Most Robust Way to Evaluate LLMs (arthur.ai)
		14 points by rntn on Aug 17, 2023 \| past
		How to Think About Production Performance of Generative Text (arthur.ai)
		1 point by fryz on April 26, 2023 \| past
		Machine Learning: Multivariate Data Drift with Tabular Data (arthur.ai)
		1 point by chsirabella on Aug 17, 2022 \| past \| 1 comment