Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sure - I guess what I was asking is how to make sure everything is okay in the unstructured -> structured conversion.

"My name is John and I'm 40 years old" -> {name:"John", age:40}

How can you gain confidence that the AI doesn't spit out {name:"John", age:41}

The only thing I do currently is have a massive test suite to gain some statistical confidence it works, but I worry about situations like a person having a rare unicode character in their name (not to even speak of people intentionally trying to trick the system)



Don't have the AI do the data parsing. Have the AI write a parser and have the parser do the parsing. Think about how a person would parse vasts amounts of data. They write a parser to do it. Devil is of course in the details.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: