Fix bug in `isColumnSeparator()` parsing logic #61

nekno · 2022-03-13T22:49:56Z

The isColumnSeparator() logic is intended to check that each row contains the same (non-space) character in the same position.

Instead of checking the rows all the way down, however, it was only checking the first and second rows with a call to slice(0,1).

Additionally, it was checking every column, when columns should each be padded by a space, making the minimum distance between columns two spaces.

Take the following table as an example:

AAA	BBB
BAB	ABA
ABA	BAB

If you enter that table as input, and then click the Parse button to parse the output, it considers any position where the characters in the same column position match across two rows to be a column separator.

So it interprets the middle column of each group as a separator, producing the following:

A	A	B	B
B	B	A	A
A	A	B	B

That stripped out the columns of:

A
A
B

and

B
B
A

because they have matching characters in the same column position in the first two rows.

Changes

Checks each row for a matching character in the same column position all the way to the last row (not just the first 2 rows), by recursively evaluating all remaining rows with slice(1) on each iteration.
Checks that each column separator (aside from the first column at index 0) was preceded by a space.
Because each column separator is padded by a space, two column separators should be a minimum of two columns apart. Adding this criterion ensures that empty columns with no value and columns containing a single character can be parsed correctly.

Tests

Input the following tables, then click the Parse button to parse the output back into the input.
Observe that each table parses incorrectly at https://ozh.github.io/ascii-tables
Verify that each table parses correctly at https://nekno.github.io/ascii-tables (where the code for this PR is live).

Single value columns
```
A	B
A	B
```
Empty columns
```
A	B	C
A		C
```
```
A		C
A		C
```
Columns with repeating values
```
AAA	BBB
BAB	ABA
ABA	BAB
```

ozh · 2022-03-14T21:45:06Z

Hi, thank for this PR. Could you provide an example of a bug that would be fixed with this code? I'm not sure I'm getting it :)

nekno · 2022-03-18T06:00:17Z

Hi @ozh — Your request to produce a test that demonstrated the fix challenged me to come up with a good, representative test, and I found that my code changes didn't address a lot of similar scenarios.

So I added some additional changes to more thoroughly evaluate the conditions for column separators, in order to handle the test cases I added in the updated PR description.

I'm hoping this is good-to-go now. 🤞

ozh · 2022-03-19T11:21:58Z

Super clear explanation. Thanks ! :)

Fix bug in isColumnSeparator() logic

514c501

nekno added 2 commits March 17, 2022 02:48

Enhance isColumnSeparator() to check surrounding chars

97b4a1a

Skip minimum distance between columns

2762be9

nekno force-pushed the gh-pages branch from 7793322 to 2762be9 Compare March 18, 2022 05:29

ozh merged commit d92c1f0 into ozh:gh-pages Mar 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in `isColumnSeparator()` parsing logic #61

Fix bug in `isColumnSeparator()` parsing logic #61

nekno commented Mar 13, 2022 •

edited

Loading

ozh commented Mar 14, 2022

nekno commented Mar 18, 2022

ozh commented Mar 19, 2022

Fix bug in isColumnSeparator() parsing logic #61

Fix bug in isColumnSeparator() parsing logic #61

Conversation

nekno commented Mar 13, 2022 • edited Loading

Changes

Tests

ozh commented Mar 14, 2022

nekno commented Mar 18, 2022

ozh commented Mar 19, 2022

Fix bug in `isColumnSeparator()` parsing logic #61

Fix bug in `isColumnSeparator()` parsing logic #61

nekno commented Mar 13, 2022 •

edited

Loading