feat: support writing the tests in two separate input and output files #1536

aminya · 2021-12-15T11:16:23Z

This adds support for writing the tests in separate neighbour files
The input file name should match *.tsin.*.
The output file name should match *.tsout.scm

For example, test1.tsin.js and test1.tsout.scm

The output file is automatically updated if the --update flag is passed.

maxbrunsfeld · 2021-12-23T18:47:03Z

I don't think I want to support a separate alternative way of writing tests; I'd rather just have one way of doing things. Is there a problem with the current test file format that you're hitting?

This reminds me that we never documented this feature, which allows you to write tests for source code containing --- or ===.

aminya · 2021-12-23T18:53:40Z

I don't think I want to support a separate alternative way of writing tests; I'd rather just have one way of doing things. Is there a problem with the current test file format that you're hitting?

Yes. The main reason is that testing existing code is hard with the current format. Tree sitter invents its own format, which is different than the actual usage of the language. This means that manual work is involved in writing such tests.
The format I am proposing also allows things like snapshot tests. Tree sitter can automatically generate the output if requested, and then use that as the expected output.

maxbrunsfeld · 2021-12-23T19:00:35Z

Tree sitter can automatically generate the output if requested, and then use that as the expected output.

That's already supported in the current system using the tree-sitter test --update command.

aminya · 2021-12-26T07:42:24Z

Tree sitter can automatically generate the output if requested, and then use that as the expected output.

That's already supported in the current system using the tree-sitter test --update command.

Yeah, but it is hard for the files of a language in some external project. People should do this by hand.

This adds support for writing the tests in separate neighbor files The input file name should match `*.tstest.*` The output file name should match `*.tstest.scm` For example, `test1.tstest.js` and `test1.tstest.scm`

sogaiu · 2023-02-23T10:22:41Z

I think one of the original points of the PR was that other projects already might have tests in some existing format and that it would be nice to be able to work with an existing setup without having to change too much.

Apart from that though a few things I've noticed while using the built-in corpus tests include:

Upon test failure, I don't see information about start and end positions of nodes (or field names) displayed (the green and red text is pretty though)
To get that kind of information I typically go searching for the file the test that failed was in, look through the file to find the relevant input, copy that input to another file, and then run the parse subcommand (may be I'm unaware of a better method)

I'm trying out an alternative arrangement where I have one file for each input with a descriptive name and a corresponding file that contains output from the parse subcommand (typically the s-expression output with field and position info).

Now when there is a failure I am presented with a file path (no searching necessary) but also the expected and actual parse information. Here's a bit of a sample (some parts are elided as ... for brevity):

1..44
ok 1
ok 2
ok 3
...
ok 19
not ok 20 - test/input/sym_lit-unihan.janet
  ---
  found:
    (source [0, 0] - [1, 0]
      (sym_lit [0, 0] - [0, 6]))
  wanted:
    (source [0, 0] - [1, 0]
      (sym_lit [0, 0] - [0, 5]))
  ...
ok 21
...

I chose TAP (well, something close enough) so I can feed the output to a TAP consumer (of which there are a number to choose from) and see a concise summary.

Below is some sample output assuming the whole output from the example above is fed to it:

...................F........................
======================================================================
FAIL: <file=stream>
- test/input/sym_lit-unihan.janet
----------------------------------------------------------------------

----------------------------------------------------------------------
Ran 44 tests in 0.000s

FAILED (failures=1)

As the path is presented in such a way that no extra info needs to be added to it (e.g. like corpus or test/corpus), it's straight-forward to view files that lead to test failures as well as use parse on them.

It seems that this type of approach could be applied to larger ("real-world") files relatively easily as well. Something I presume many folks are already doing as hand-crafted small examples, though useful in more than one way, don't seem sufficient for testing purposes.

aminya · 2023-10-25T17:26:33Z

@sogaiu Yes, the approach I added in this merge request is very crucial for me as I don't need to do any preprocessing to make the tests ready, and any file with the correct extension can be a test input. I treat the tests as snapshot tests, and the generated grammar structure is like the snapshot. This has facilitated testing significantly, and it is a great addition to the tree-sitter.

aminya added 2 commits January 21, 2022 22:41

feat: support writing the tests in two separate input and output files

ec58463

This adds support for writing the tests in separate neighbor files The input file name should match `*.tstest.*` The output file name should match `*.tstest.scm` For example, `test1.tstest.js` and `test1.tstest.scm`

fix: use tsin and tsout as extensions

7e3268e

aminya force-pushed the test-file branch from c3ba052 to 7e3268e Compare January 22, 2022 06:41

aminya added 4 commits November 11, 2022 15:40

Merge remote-tracking branch 'upstream/master' into test-file

a875d12

fix: directly make the test entry

45ad15d

fix: support updating the tsout.scm files on --update

adbb9a7

fix: do not pop the first char for constructed inputs

1ded95c

aminya force-pushed the test-file branch from 6969800 to 1ded95c Compare November 12, 2022 00:45

maxbrunsfeld force-pushed the master branch from 3707f48 to 125503f Compare February 14, 2023 07:40

sogaiu mentioned this pull request Feb 24, 2023

Formatting of grammar.js is unusual sogaiu/tree-sitter-clojure#39

Closed

sogaiu mentioned this pull request Oct 25, 2023

Add a --test-input <test_number> parameter for the CLI parse command #2726

Closed

sogaiu mentioned this pull request Nov 20, 2023

Include field names in test logs #2772

Closed

amaanq force-pushed the master branch from c206aad to 0a5a564 Compare March 10, 2024 21:15

amaanq force-pushed the master branch from 16be3ee to d569d0e Compare March 17, 2024 23:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support writing the tests in two separate input and output files #1536

feat: support writing the tests in two separate input and output files #1536

aminya commented Dec 15, 2021 •

edited

Loading

maxbrunsfeld commented Dec 23, 2021

aminya commented Dec 23, 2021

maxbrunsfeld commented Dec 23, 2021

aminya commented Dec 26, 2021 •

edited

Loading

sogaiu commented Feb 23, 2023 •

edited

Loading

aminya commented Oct 25, 2023

feat: support writing the tests in two separate input and output files #1536

Are you sure you want to change the base?

feat: support writing the tests in two separate input and output files #1536

Conversation

aminya commented Dec 15, 2021 • edited Loading

maxbrunsfeld commented Dec 23, 2021

aminya commented Dec 23, 2021

maxbrunsfeld commented Dec 23, 2021

aminya commented Dec 26, 2021 • edited Loading

sogaiu commented Feb 23, 2023 • edited Loading

aminya commented Oct 25, 2023

aminya commented Dec 15, 2021 •

edited

Loading

aminya commented Dec 26, 2021 •

edited

Loading

sogaiu commented Feb 23, 2023 •

edited

Loading