docs(csv): more examples for `parse` and `CsvParseStream` #5605

magurotuna · 2024-08-01T15:30:46Z

This commit adds more examples to parse function and CsvParseStream class to cover all the provided options.

Also fixes a few other things:

Replace stale description ParseError with the correct SyntaxError.
Fix the default value of comment property. The old comment says the default value is #, but this was wrong.
Get negative value in fieldsPerRecord option working in parse as documented (closes csv: fieldsPerRecord: -1 doesn't seem working #5616)

codecov · 2024-08-01T15:39:33Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.37%. Comparing base (cd0bc9f) to head (01e3d68).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #5605   +/-   ##
=======================================
  Coverage   96.37%   96.37%           
=======================================
  Files         466      466           
  Lines       37572    37574    +2     
  Branches     5539     5539           
=======================================
+ Hits        36211    36213    +2     
  Misses       1319     1319           
  Partials       42       42

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

magurotuna · 2024-08-01T15:39:38Z

csv/parse.ts

+ * const result = parse(string, { skipFirstRow: true });
+ *
+ * assertEquals(result, [{ a: "d", b: "e", c: "f" }]);
+ * assertType<IsExact<typeof result, Record<string, string | undefined>[]>>(true);


I added assertType here because I think this is a very good document for users given that the return type varies depending on the option we pass in.

While I was writing these assertions, though, I was unsure in what cases the value in the return value becomes undefined. Does anyone know why we have Record<string, string | undefined> instead of Record<string, string> here?

I think you can get an undefined value if you had the following input a,b,c\nd,,f. Perhaps, we should add another test for that case.

I tried the following three test cases and these pass. I couldn't find a case where the parsed value becomes undefined

await t.step({ name: "BlankField2", fn() { const input = "a,b,c\nd,,f"; assertEquals( parse(input, { skipFirstRow: true }), [{ a: "d", b: "", c: "f" }], ); }, }); await t.step({ name: "Exessive fields with skipFirstRow: true", fn() { const input = "a,b,c\nd,,f,g"; assertThrows( () => parse(input, { skipFirstRow: true }), Error, "Error number of fields line: 1\nNumber of fields found: 3\nExpected number of fields: 4", ); }, }); await t.step({ name: "Insufficient fields with skipFirstRow: true", fn() { const input = "a,b,c\nd,e"; assertThrows( () => parse(input, { skipFirstRow: true }), Error, "Error number of fields line: 1\nNumber of fields found: 3\nExpected number of fields: 2", ); }, });

Added (or fixed the existing) tests on what happens if the number of fields in records doesn't match that of the header. This fix contains the change in how we display the line number is error messages (which was 0-based but now changed to 1-based).

a91806b

iuioiua

Great work

iuioiua · 2024-08-01T23:47:53Z

csv/parse.ts

+ * const result = parse(string, { skipFirstRow: true });
+ *
+ * assertEquals(result, [{ a: "d", b: "e", c: "f" }]);
+ * assertType<IsExact<typeof result, Record<string, string | undefined>[]>>(true);


I think you can get an undefined value if you had the following input a,b,c\nd,,f. Perhaps, we should add another test for that case.

iuioiua · 2024-08-02T00:03:16Z

csv/parse.ts

+ * assertEquals(result, [["a", "b", "c"], ["d", "e", "f"]]);
+ * ```
+ *
+ * @example Trim leading space with `trimLeadingSpace: true`


Is there a reason there's an option to trim only leading spaces? Why not just trim the end too using String.prototype.trim()?

I'm not sure either, but according to the head comment of parse.ts, many parts of the implementation was ported from Go which has the same option.
https://github.com/golang/go/blob/go1.12.5/src/encoding/csv/reader.go#L136

I can't think of a reason to only trim the start... I suggest changing to trim: boolean. Not a strong opinion. What do we think?

Asked ChatGPT, it answered some application may add leading whitespaces when exporting to CSV. I don't know if this is true at all 😄 Anyway trim: boolean should be useful in some scenarios and I think we could add that after 1.0 without breaking things?

I think trailing spaces are usually considered as a part of value of the last column (For example, excel handles it in that way)

I couldn't find the case when this returns Record<string, string | undefined>, but I also noticed that fieldsPerRecord option doesn't seem working as described.

It says:

If negative, no check is made and records may have a variable number of fields.

But parse doesn't seem allowing wrong number of fields even when I passed negative number to this option. Maybe this current state is related to it?

But parse doesn't seem allowing wrong number of fields even when I passed negative number to this option. Maybe this current state is related to it?

Oh that's a good find. I'll add a test case and fix it

… into magurotuna/csv

kt3k

LGTM

…#5617) As we discussed in #5605 (comment), it seems like we never get `undefined` as a parse result of fields. If there is a mismatch in the number of fields across rows, the parse just throws an error. To better reflect this in typing, this commit removes `undefined` from the record value type.

magurotuna added 8 commits August 1, 2024 16:05

wrap the result in a code block

35944e1

fix the wrong default value

f75240e

more examples for parse

df4b749

link to MDN for SyntaxError

bfe0837

more specific import

39a3d17

wip

9aa83cc

Merge branch 'main' into magurotuna/csv

deac6d4

fieldsPerRecord example

1257b55

magurotuna requested a review from kt3k as a code owner August 1, 2024 15:30

github-actions bot added the csv label Aug 1, 2024

fix

23309d7

magurotuna commented Aug 1, 2024

View reviewed changes

magurotuna and others added 2 commits August 2, 2024 01:10

example

02030a4

tweaks

dcdd83f

iuioiua approved these changes Aug 2, 2024

View reviewed changes

This was referenced Aug 2, 2024

chore(csv): release csv@1.0.0 #5219

Merged

docs(csv): more examples for stringify and CsvStringifyStream #5606

Merged

magurotuna added 4 commits August 2, 2024 11:53

Merge branch 'main' into magurotuna/csv

97b1fc4

Merge branch 'magurotuna/csv' of https://github.com/magurotuna/deno_std…

55306c7

… into magurotuna/csv

show 1-based line number in header and record length mismatch

a91806b

Merge branch 'main' into magurotuna/csv

01e3d68

kt3k approved these changes Aug 2, 2024

View reviewed changes

fix negative fieldsPerRecord in parse

cd43e78

magurotuna merged commit b0f2088 into denoland:main Aug 2, 2024
13 checks passed

magurotuna deleted the magurotuna/csv branch August 2, 2024 04:24

magurotuna mentioned this pull request Aug 2, 2024

fix(csv): remove undefined from possible value type of parse result #5617

Merged

magurotuna mentioned this pull request Aug 2, 2024

csv: fieldsPerRecord: -1 doesn't seem working #5616

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(csv): more examples for `parse` and `CsvParseStream` #5605

docs(csv): more examples for `parse` and `CsvParseStream` #5605

magurotuna commented Aug 1, 2024 •

edited

Loading

codecov bot commented Aug 1, 2024 •

edited

Loading

magurotuna Aug 1, 2024

iuioiua Aug 1, 2024

magurotuna Aug 2, 2024

magurotuna Aug 2, 2024

iuioiua left a comment

iuioiua Aug 1, 2024

iuioiua Aug 2, 2024

magurotuna Aug 2, 2024

iuioiua Aug 2, 2024

magurotuna Aug 2, 2024

kt3k Aug 2, 2024

kt3k Aug 2, 2024

magurotuna Aug 2, 2024

kt3k left a comment

docs(csv): more examples for parse and CsvParseStream #5605

docs(csv): more examples for parse and CsvParseStream #5605

Conversation

magurotuna commented Aug 1, 2024 • edited Loading

codecov bot commented Aug 1, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

iuioiua left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kt3k left a comment

Choose a reason for hiding this comment

docs(csv): more examples for `parse` and `CsvParseStream` #5605

docs(csv): more examples for `parse` and `CsvParseStream` #5605

magurotuna commented Aug 1, 2024 •

edited

Loading

codecov bot commented Aug 1, 2024 •

edited

Loading