Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Serious question for people who are concerned about security here.

Do you believe that AI browser automation like this will lead to more, or less overall information exfiltration (including phishing).

I work at Anthropic so maybe I'm biased, but it's not clear to me that this is worse than the status quo





Well, instead of one agent (the user) to phish there's two (both the user and the browser agent) and you only need to convince one.

I claim that is not true, because very soon AI agents (probably built into Chrome) will detect and warn. In which case you need to phish the agent, tricking the human won't be enough.

If the human is much easier to phish than the agent (which I believe is true in most cases) then this would be a win


Yet, you add another attack vector, something that is very willing to do stuff, as long as you prompt it right…

As Simon Wilison clearly laid out, 99% secure isn’t secure and you think you can fix it by adding mor/better prompts?

Which methods do you have planned outside of “better prompting/fine tuning”?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: