Don't Split Frames for UCX #3584

quasiben · 2020-03-17T20:23:59Z

PR fixes #3580 .

Previously, during merge_frames dask would call ensure_bytes on a list of objects. This would trigger a conversion to host memory and would invalidate deserialization logic downstream. This PR fixes this issue by checking if the list is composed of cuda objects. If it is, simply extend the output list

Update:
PR adds functionality which allows for frames to not be split during serialization. Additionally, communication with UCX will no longer split frames

distributed/protocol/utils.py

quasiben · 2020-03-17T22:15:06Z

distributed/protocol/tests/test_serialize.py

+
+@pytest.mark.asyncio
+async def test_serialize_no_splitting():
+    cp = pytest.importorskip("cupy")


I am not super happy that this requires cupy but I was unable to trigger a result with numpy. Open to suggestions if folks have any

Yeah there isn't one really as this is done correctly when using the "dask" serializer. We have to use something that uses the "cuda" serializer.

quasiben · 2020-03-17T22:17:43Z

@mrocklin @jakirkham changed the PR around based on comments to not split frames when using UCX

jakirkham

LGTM. Thanks Ben! 😄

cjnolet · 2020-03-17T22:57:59Z

LGTM, verified that Naive Bayes is working as expected with these changes. Thanks again, Ben!

cjnolet · 2020-03-17T23:27:07Z

Also, thank you to @jakirkham for spending the time to help me narrow this down.

mrocklin · 2020-03-18T00:02:22Z

distributed/protocol/core.py

-                frames = frame_split_size(frames)
+                # splitting frames is not the default behavior for UCX
+                if split_frames:
+                    frames = frame_split_size(frames)


It looks like we only consider splitting frames due to compression. I wonder if we can make this a bit simpler and avoid the keyword argument if we always include "compression": False in the header?

That would probably work but there may be a GPU compression story in the future. I suppose we could handle that then if/when it becomes available

Give me a couple minutes to look things over

I still suggest that we go with setting compression=False for now rather than introduce the new keyword argument.

Tying frame splitting to compression is not entirely clean, but it does happen to be the only reason to split frames today. In the future if that continues to be the case then we might have compressors register a maximum frame size. If that doesn't continue to be the case (perhaps some comms also have a maximum size limit) then we will have to rejigger this code regardless.

However, looking at this a bit more, I notice that cuda_serialize already specifies a compression value. My guess is that this is coming from serializing tuples/lists/dicts of objects, and that we don't move compression values through in that case. I'll take a look at this later tonight.

Yeah ok, so coupling compression with serialization gets complicated, at least with our current logic.

distributed/distributed/protocol/core.py

Lines 51 to 57 in 511427b

if "compression" not in head:

frames = frame_split_size(frames)

if frames:

compression, frames = zip(*map(maybe_compress, frames))

else:

compression = []

head["compression"] = compression

Currently a serializer can choose to pre-compress data. If so, it adds the compression used to the header. Great, we can pass on compression (and thus frame splitting). The CUDA serializers avoids this by setting compression=None claiming "I've already handled compression, don't bother" to downstream. Great. If we send a single cupy array or something along then we'll be fine.

However, if we happen to include a few things in a small dict/tuple/list then we lose this information. The headers for the individual bits get thrown into a subheader, and so we lose direct access to the compression.

distributed/distributed/protocol/serialize.py

Lines 144 to 181 in 511427b

if (

type(x) in (list, set, tuple)

and len(x) <= 5

or type(x) is dict

and len(x) <= 5

and dict_safe

):

if isinstance(x, dict):

headers_frames = []

for k, v in x.items():

_header, _frames = serialize(

v, serializers=serializers, on_error=on_error, context=context

)

_header["key"] = k

headers_frames.append((_header, _frames))

else:

headers_frames = [

serialize(

obj, serializers=serializers, on_error=on_error, context=context

)

for obj in x

]

frames = []

lengths = []

for _header, _frames in headers_frames:

frames.extend(_frames)

length = len(_frames)

lengths.append(length)

headers = [obj[0] for obj in headers_frames]

headers = {

"sub-headers": headers,

"is-collection": True,

"frame-lengths": lengths,

"type-serialized": type(x).__name__,

}

return headers, frames

Currently our logic though is "If you specify any compression anywhere, don't do anything". But really we want to handle this on a frame-by-frame basis. Otherwise the first time someone sends a combined result of [numpy_array, cupy_array] things will get messy.

I think I'm shaving a yak here, but it could be that the right thing to do here is to rework how we handle compression on a frame by frame basis. Submitting PR now.

Well the other thing I was thinking about was checking head for "cuda" not in "serializer", which would also be easy and avoid this.

…hen serializing

quasiben · 2020-03-18T02:12:58Z

Sorry, I missed the discussion as I was sorting through the same issues of collections. Looking at #3586 now

quasiben · 2020-03-18T14:05:07Z

closing in favor of #3586

do not convert cuda objects to bytes during merge_frames

9884dde

jakirkham reviewed Mar 17, 2020

View reviewed changes

distributed/protocol/utils.py Outdated Show resolved Hide resolved

do not split frames when using ucx

4aceb39

quasiben commented Mar 17, 2020

View reviewed changes

jakirkham approved these changes Mar 17, 2020

View reviewed changes

jakirkham mentioned this pull request Mar 17, 2020

CuPy (De)serialization error #3580

Closed

cjnolet mentioned this pull request Mar 17, 2020

[REVIEW] Additional Naive Bayes improvements rapidsai/cuml#1891

Merged

quasiben changed the title ~~FIX CUDA Obj check in merge_frames~~ Don't Split Frames for UCX Mar 18, 2020

mrocklin reviewed Mar 18, 2020

View reviewed changes

move test to cupy test file

c75b417

mrocklin mentioned this pull request Mar 18, 2020

Optionally compress on a frame-by-frame basis #3586

Merged

set compression for cuda objects to false and check for collections w…

51df9d3

…hen serializing

quasiben closed this Mar 18, 2020

quasiben deleted the fix-cupy-transfer branch March 18, 2020 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't Split Frames for UCX #3584

Don't Split Frames for UCX #3584

quasiben commented Mar 17, 2020 •

edited

Loading

quasiben Mar 17, 2020

jakirkham Mar 17, 2020

quasiben commented Mar 17, 2020

jakirkham left a comment

cjnolet commented Mar 17, 2020

cjnolet commented Mar 17, 2020

mrocklin Mar 18, 2020

quasiben Mar 18, 2020

mrocklin Mar 18, 2020

mrocklin Mar 18, 2020

mrocklin Mar 18, 2020

jakirkham Mar 18, 2020

quasiben commented Mar 18, 2020

quasiben commented Mar 18, 2020

	if "compression" not in head:
	frames = frame_split_size(frames)
	if frames:
	compression, frames = zip(*map(maybe_compress, frames))
	else:
	compression = []
	head["compression"] = compression

	if (
	type(x) in (list, set, tuple)
	and len(x) <= 5
	or type(x) is dict
	and len(x) <= 5
	and dict_safe
	):
	if isinstance(x, dict):
	headers_frames = []
	for k, v in x.items():
	_header, _frames = serialize(
	v, serializers=serializers, on_error=on_error, context=context
	)
	_header["key"] = k
	headers_frames.append((_header, _frames))
	else:
	headers_frames = [
	serialize(
	obj, serializers=serializers, on_error=on_error, context=context
	)
	for obj in x
	]

	frames = []
	lengths = []
	for _header, _frames in headers_frames:
	frames.extend(_frames)
	length = len(_frames)
	lengths.append(length)

	headers = [obj[0] for obj in headers_frames]
	headers = {
	"sub-headers": headers,
	"is-collection": True,
	"frame-lengths": lengths,
	"type-serialized": type(x).__name__,
	}
	return headers, frames

Don't Split Frames for UCX #3584

Don't Split Frames for UCX #3584

Conversation

quasiben commented Mar 17, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quasiben commented Mar 17, 2020

jakirkham left a comment

Choose a reason for hiding this comment

cjnolet commented Mar 17, 2020

cjnolet commented Mar 17, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quasiben commented Mar 18, 2020

quasiben commented Mar 18, 2020

quasiben commented Mar 17, 2020 •

edited

Loading