-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add new table format presets. Add support for custom format #23
Conversation
docs/CHANGELOG.md
Outdated
@@ -3,6 +3,11 @@ | |||
|
|||
## 1.0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That could use an update as well :)
docs/CHANGELOG.md
Outdated
@@ -3,6 +3,11 @@ | |||
|
|||
## 1.0 | |||
|
|||
### 2.0.0 | |||
|
|||
- **[breaking change]** Make `:orc` table format to have `orc.column.index.access` disabled by default. `:orc_legacy` preset replaces the previous one |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- **[breaking change]** Make `:orc` table format to have `orc.column.index.access` disabled by default. `:orc_legacy` preset replaces the previous one | |
- **[breaking change]** Make `:orc` table format use column names instead of indexes by default. `:orc_legacy` preset preserves existing behavior. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Ciebiada I like the direction 👍
lib/egis/table_ddl_generator.rb
Outdated
STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' | ||
OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat' | ||
SQL | ||
when :orc_legacy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
legacy
has a negative connotation. How about we call it explicitly like orc_with_index_access
?
How about we also support json format, we use it in Socialguide A+ query
when :json
'ROW FORMAT SERDE 'org.apache.hive.hcatalog.data.JsonSerDe'
lib/egis/table_ddl_generator.rb
Outdated
|
||
def serde_properties(format) | ||
return '' unless format.key?(:serde_properties) | ||
return "ROW FORMAT #{format}" if format.is_a?(String) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As discussed the String option should also allow to define STORED AS option. I'd remove the "ROW FORMAT" from here and allow user to fill it in or not, for example if he just want to use "STORED AS PARQUET" option
@mkrawc I've merged this. However. If you have any further comments please let me know and I will get back to it |
Checklist
lib/egis/version.rb