Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rose_bunch: on failure, print bunch err to stderr #2126

Conversation

benfitzpatrick
Copy link
Contributor

This is a small change to print out the individual bunch stderr to the job.err on failure so that debugging is easier.

@matthewrmshin matthewrmshin added this to the next-release milestone Oct 25, 2017
@oliver-sanders
Copy link
Member

The test failures are unrelated to this change and will be fixed by #2128 and #2123.

@benfitzpatrick
Copy link
Contributor Author

Great, I did wonder!

@arjclark
Copy link
Contributor

arjclark commented Nov 2, 2017

@benfitzpatrick - is this going to result in the entire stderr contents from the bunch-.err file getting dumped in the stderr file?

@benfitzpatrick
Copy link
Contributor Author

Yep, just for the ones that failed

@arjclark
Copy link
Contributor

arjclark commented Nov 2, 2017

Yep, just for the ones that failed

Not massively ideal then as the sort of things that should be being run through rose bunch should be variants on a command so, for the general case, one fails: all fail. I'm not sure duplicating the entire .stderr file is the way to go - particularly if you consider something like the reconfiguration (original use case) where it chucks out hundreds of lines of warnings to stderr during even a successful run. It may be better to resurrect item 2 of #1876 instead.

@benfitzpatrick
Copy link
Contributor Author

A system's pollution of their output and error is their own problem!

Better to see too much than nothing at all - and this is mainly targeted at users of the cylc gui rather than Rose Bush.

@arjclark
Copy link
Contributor

arjclark commented Nov 2, 2017

Better to see too much than nothing at all

So you don't see creation of redundant files as a bad thing to do? No point in having the bunch.err file if you're going to shove it in the job.err.

@benfitzpatrick
Copy link
Contributor Author

Given that neither of us are in the Rose team any more, our opinions are partly moot.

The point of having the bunch.err would be in your situation marked above - drilling down to specifics, particularly if there is a lot of error. Also in your situation above, if the system produces lots of warnings you may want to look at the bunch.err even when the job does not fail. I don't think it's redundant at all.

At the moment everyone does a two-step dance - look at the job.err to see what failed, then look at one of the bunch errors. If they are doing it in the cylc gui, they have to load up Rose Bush to get the bunch error. I can actually find out what happened in my failed rose_bunch tasks from the cylc gui! Or in one step on Rose Bush!

@arjclark
Copy link
Contributor

arjclark commented Nov 2, 2017

Don't think we're going to agree on this so I'll leave this one to @matthewrmshin to think about.

I wonder if a COMBINE_ERR_ON_FAIL type option might be the middle ground for turning on/off this behaviour though?

@matthewrmshin
Copy link
Member

🤔

@benfitzpatrick
Copy link
Contributor Author

This is a hard option to name and describe correctly, given that I think this new behaviour should be the default (I reckon everyone wants it except you Andy :p). disable-combined-stderr-on-failure (default false of course) is the best name I can come up with.

If we can't name it correctly, is it worth having at all? Just pass this in as-is?

@matthewrmshin
Copy link
Member

Are we going for the old or new behaviour as the default?

@benfitzpatrick
Copy link
Contributor Author

new new new

@arjclark
Copy link
Contributor

Discussed offline with @benfitzpatrick and suggested some alternative names for the setting. @benfitzpatrick to follow up with other major users of rose bunch and double check they'd be ok with a change in behaviour.

@arjclark
Copy link
Contributor

arjclark commented Jan 2, 2018

@benfitzpatrick - does the extra log files setting mentioned here: cylc/cylc-flow#2515 make this change redundant?

@benfitzpatrick
Copy link
Contributor Author

No, although it would be more useful if it supported wildcards...

@arjclark
Copy link
Contributor

Might it be worth extending the cylc functionality then instead?

@matthewrmshin matthewrmshin modified the milestones: next-release, soon Jan 31, 2018
@matthewrmshin
Copy link
Member

Now that it is spring, I think it is time for a spring clean. Please raise a new pull request when we are ready to have another crack at this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants