Fix JSON serialization error when creating flow runs #7385

peytonrunyan · 2022-10-31T12:57:06Z

flow.serialize_parameters is called when the client creates a flow run. This can return invalid JSON in certain cases, such as when serializing dataframes. pd.DataFrame({'col':['1']}) -> {"col": {0: "1"}}.

This resulted in an error when calling FlowRunCreate.dict(json_compatible=True).

This pull request modifies the FlowRunCreate to use orjson.dumps with OPT_NON_STR_KEYS enabled to handle these cases. The code is modified at the child-level because allowing non-string keys has performance implications, so this limits the hit to creating flow runs.

I tried a very simple flow with print statements to check that this is behaving as expected in aggregate (the combination of parameter serialization + flow run serialization), but I'd like a sanity check from @madkinsz to make sure that I'm not missing something.

from prefect import task, flow
import pandas as pd

@task
def task1():
    return 'complete'
    
@flow
def flow1(df):
    print(df.dtypes)
    print(df.head())
    task()
    
df_works = pd.DataFrame({'col':[1]})
flow1(df_works)

df_used_to_error = pd.DataFrame({'col':['1']})
flow1(df_used_to_error)

08:56:01.701 | INFO    | prefect.engine - Created flow run 'masterful-lorikeet' for flow 'flow1'
col    int64
dtype: object
   col
0    1
08:56:01.921 | INFO    | Flow run 'masterful-lorikeet' - Finished in state Completed()
08:56:02.029 | INFO    | prefect.engine - Created flow run 'mysterious-jackdaw' for flow 'flow1'
col    object
dtype: object
  col
0   1
08:56:02.201 | INFO    | Flow run 'mysterious-jackdaw' - Finished in state Completed()

Example

Checklist

This pull request references any related issue by including "closes <link to issue>"
- If no issue exists and your change is not a small fix, please create an issue first.
This pull request includes tests or only affects documentation.
This pull request includes a label categorizing the change e.g. fix, feature, enhancement

netlify · 2022-10-31T12:57:10Z

✅ Deploy Preview for prefect-orion ready!

Name	Link
🔨 Latest commit	`4fcadcf`
🔍 Latest deploy log	https://app.netlify.com/sites/prefect-orion/deploys/635fc6801e2d7c0008cff964
😎 Deploy Preview	https://deploy-preview-7385--prefect-orion.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

anna-geller · 2022-10-31T13:25:01Z

Quick QA:

Version:             2.6.5
API version:         0.8.3
Python version:      3.10.6
Git commit:          9fc2658f
Built:               Thu, Oct 27, 2022 2:24 PM
OS/Arch:             darwin/arm64
Profile:             dev
Server type:         cloud

Using this branch, works:

zanieb · 2022-11-02T17:44:01Z

@zangell44 any qualms?

I like that this is scoped to the create action!

peytonrunyan added 3 commits October 28, 2022 14:05

update FlowRunCreate encoder

d1160ca

add tests

1b37fe9

update tests

c30b3e1

peytonrunyan added v2 fix A fix for a bug in an existing feature labels Oct 31, 2022

Merge branch 'main' into serializer-representation

4fcadcf

peytonrunyan marked this pull request as ready for review October 31, 2022 13:43

peytonrunyan requested review from zangell44 and zanieb as code owners October 31, 2022 13:43

zanieb approved these changes Nov 2, 2022

View reviewed changes

zanieb merged commit f25f035 into main Nov 3, 2022

zanieb deleted the serializer-representation branch November 3, 2022 16:31

zanieb pushed a commit that referenced this pull request Nov 3, 2022

Fix JSON serialization error when creating flow runs (#7385)

53e63c5

peytonrunyan mentioned this pull request Dec 15, 2022

Float64-only pandas Dataframes not supported as flow arguments #7910

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix JSON serialization error when creating flow runs #7385

Fix JSON serialization error when creating flow runs #7385

peytonrunyan commented Oct 31, 2022 •

edited

Loading

netlify bot commented Oct 31, 2022 •

edited

Loading

anna-geller commented Oct 31, 2022

zanieb commented Nov 2, 2022

Fix JSON serialization error when creating flow runs #7385

Fix JSON serialization error when creating flow runs #7385

Conversation

peytonrunyan commented Oct 31, 2022 • edited Loading

Example

Checklist

netlify bot commented Oct 31, 2022 • edited Loading

✅ Deploy Preview for prefect-orion ready!

anna-geller commented Oct 31, 2022

zanieb commented Nov 2, 2022

peytonrunyan commented Oct 31, 2022 •

edited

Loading

netlify bot commented Oct 31, 2022 •

edited

Loading