When I played around with tree sitter a bit I noticed there were situations wher...

maxbrunsfeld · on Feb 22, 2021

Most tree-sitter grammars just parse comments as a single token. Can you give an example of what you mean when you say "contents of the comment parsed out"?

Are you talking about conventions like JSDoc, for putting structured data inside of comments? On GitHub, we handle that by parsing JSDoc comments in a separate pass, using a separate parser. We do it this way because JSDoc isn't really part of the JavaScript language, not all projects use JSDoc, and not all applications are interested in parsing the text inside of comments.

gugagore · on Feb 22, 2021

My guess is that they meant parsing code that has been "commented out".

rattray · on Feb 22, 2021

I interpreted it to mean, "Remove the *s from code like this:"

    /* This comment
     * Should just be alphanumeric.
     */

gravypod · on Feb 23, 2021

Yep, this is exactly what I meant. Turning

    /* Something */

or

    { Something }

into:

    " Something "

Or, even better, into:

    "Something"