Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Check out the simd JSON project if you’re interested in a super fast JSON parser:

https://github.com/lemire/simdjson

I’ve been using to process and maintain giant JSON structures and it’s faster than any other parser I’ve tried. I was able to replace my previous batch job with this as it gives real-time performance.



This seems to have nothing to do with the article though?


It’s a JSON parser?


How does it do on the article's test suite?


[ Original designer of much of simdjson here ]

We haven't used that particular suite, but almost everything in that suite is something we've thought about. In many cases we do the right thing by not innovating and randomly allowing stuff that isn't in the spec.

I see exactly one thing we didn't think about, as our construction of a parse tree is pretty basic and we don't build an associative structure even when building up an object - thus we would not register an error when confronted with the malformed input listed under "2.4 Objects Duplicated Keys", but happily build a parse tree with duplicated keys (which will be built up strictly as a linear structure, not an associative one).

There seems to be leeway on this point as to what an implementation should do. It certainly doesn't fit our usage model very well to build a associative structure right there on the spot - some of our users wouldn't want that much complexity/overhead.


I haven’t tested it but it parses all my JSON just fine




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: