Skip to content

Commit

Permalink
header in multiple file chunks bugfix
Browse files Browse the repository at this point in the history
  • Loading branch information
Tagar committed Nov 5, 2017
1 parent c6c3cb2 commit 5ffe084
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion abalon/spark/sparkutils.py
Original file line number Diff line number Diff line change
Expand Up @@ -186,7 +186,8 @@ def dataframeToHDFSfile (dataframe, dst_file, overwrite=False,

(dataframe
.write
.option('header', header)
.option('header', False) # always save without header as if there are multiple partitions,
# each datafile will have a header - not good.
.option('delimiter', delimiter)
.option('quoteMode', quoteMode)
.mode('overwrite') # temp directory will always be overwritten
Expand Down

0 comments on commit 5ffe084

Please sign in to comment.