Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tfrecords to parquet #1085

Merged
merged 34 commits into from
Sep 17, 2021
Merged
Show file tree
Hide file tree
Changes from 31 commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
d708172
API Overhaul
benfred Dec 14, 2020
abc0cb4
remove debug print statement
benfred Dec 14, 2020
b69ad67
Fix test_io unittest
benfred Dec 14, 2020
2fd3db9
Merge remote-tracking branch 'origin/main' into new_api
benfred Dec 14, 2020
2e918cd
Handle multi-column joint/combo categorify
benfred Dec 15, 2020
23a4eb9
Update JoinGroupby
benfred Dec 15, 2020
711efd8
Fix differencelag
benfred Dec 15, 2020
fd3f35a
add dependencies method (#498)
rnyak Dec 16, 2020
9b78a46
Convert TargetEncoding op
benfred Dec 16, 2020
82f1c17
Merge branch 'new_api' of github.com:NVIDIA/NVTabular into new_api
benfred Dec 16, 2020
5c28e85
Update nvtabular/workflow.py
benfred Dec 16, 2020
6a84e7d
Update nvtabular/workflow.py
benfred Dec 16, 2020
b21ffd8
Remove workflow code from dataloaders
benfred Dec 16, 2020
5467d86
Merge branch 'new_api' of github.com:NVIDIA/NVTabular into new_api
benfred Dec 16, 2020
f216edf
Unittest ops + bugfix in Bucketize (#496)
bschifferer Dec 16, 2020
b44dfa6
First draft get_embedding_sizes support
benfred Dec 16, 2020
27b4e33
isort
benfred Dec 16, 2020
a95a4d9
Remove groupbystatistics
benfred Dec 16, 2020
0e55c2a
implement serialization of statistics
benfred Dec 17, 2020
ee9367c
Fix TF dataloader unittests
benfred Dec 17, 2020
7bf624f
test_torch_dataloader fixes
benfred Dec 17, 2020
4c99186
doc strings
bschifferer Dec 17, 2020
aee86e2
resovled
bschifferer Jan 6, 2021
ef52f5a
Merge remote-tracking branch 'upstream/main'
bschifferer Feb 14, 2021
a2cdcc6
Merge remote-tracking branch 'upstream/main'
bschifferer Feb 22, 2021
1783f8c
merge
bschifferer Mar 29, 2021
3ebf331
t checkout master
bschifferer May 19, 2021
e51e75c
Merge branch 'main' of https://github.com/NVIDIA/NVTabular
bschifferer Aug 31, 2021
4125637
tfrecords to parquet
bschifferer Aug 31, 2021
f7669ba
leverage pandas-tfrecords
bschifferer Sep 1, 2021
390dacb
write to one parquet
Sep 10, 2021
797e575
updates
Sep 14, 2021
8183370
Merge branch 'main' into tfrecords_to_parquet
benfred Sep 16, 2021
f8f630f
Merge branch 'main' into tfrecords_to_parquet
benfred Sep 16, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading