yes, stumble on a correct answer and also pushing down incorrect answer probabil... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		whimsicalism 11 months ago \| parent \| context \| favorite \| on: DeepSeek-R1: Incentivizing Reasoning Capability in... yes, stumble on a correct answer and also pushing down incorrect answer probability in the meantime. their base model is pretty good

stri8ted 11 months ago [–]

It seems a strong base model is what enabled this. The models needs to be smart enough to get it right at least some times.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact