-
Notifications
You must be signed in to change notification settings - Fork 212
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No. Row group? #2540
Comments
According to the lance file layout, the current lance V2 cancels the concept of row group. What is the relationship between DataFragment and row group in the code? |
DataFragment is a table-level concept. It has a fixed number of rows. When you first write data, it typically corresponds to a single data file. This is different than a row group. Row groups are inside files; as in, there are multiple row groups in a file. But Lance V2 doesn't have row groups. The layout of data fragments is described here: https://lancedb.github.io/lance/format.html#fragments |
Thank you. Here's another question. If lance supports different number of rows for different columns, and DataFragment needs to have the same number of rows, how is this DataFragment represented? Is this expressed in one DataFragment, or different DataFragments? |
Each file must have the same number of rows per column. No row groups means there isn't a smaller unit that is required to have the same number of rows per column. |
According to the lance file layout, the current lance V2 cancels the concept of row group. What is the relationship between DataFragment and row group in the code?
The DataFragment concept describes how to express different numbers of rows in different columns of the same row. Is this function implemented?
The text was updated successfully, but these errors were encountered: