[Java] Add job name to GCS custom audit info #31316
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR implements feature request #31299 for Java SDK.
We also try to make the Java user-agent consistent with the Python one (see #31300)
by inserting "apache-beam/VERSION".
Some more notes:
In Java SDK, GCSIO (from
com.google.cloud.hadoop.gcsio
) is used as an accessing layer to communicate with GCS. Internally, it depends on the storage client (fromcom.google.api.services.storage
) to assemble and handle HTTP requests.GCSIO can either create a new Google Cloud Storage client during initialization or utilize an existing one, which is the approach adopted by Beam. Ideally, it will be better if Beam can use the first code path so that we can leave all the GCS connection detail to GCSIO.
Follow-up tasks