-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
executor: revert test case introduced in #18819 to solve the data race on release-4.0 #19722
executor: revert test case introduced in #18819 to solve the data race on release-4.0 #19722
Conversation
No release note, Please follow https://github.com/pingcap/community/blob/master/contributors/release-note-checker.md |
/cc @lzmhhh123 |
/cc @zz-jason |
/run-all-tests |
/cc @SunRunAway |
/run-all-tests |
What's the root cause of the CI broken? |
I am currently not sure, I still need more time to investigate about this (hugely because of PointGet under parallel execution), but it can be obvious that this PR causes the broken CI, and in order not to block PRs to release-4.0, I proposed this revert PR. |
another question is, why the CI in the master branch isn't broken 😂 |
It might be fixed, but the fix is not ported to 4.0. I will look into it. |
/run-all-tests |
1 similar comment
/run-all-tests |
- implement Execute using ExecuteStmt - Execute is only for internal usage and accept only one statement - clean up the old execute function
After investigation, the cause of this data race is similar with this one #16817 (comment) , both are due to point get at the outer side of look up merge join, which after execution, enters Line 144 in 104d372
It does not cause panic, just because it only checks if The reason it does not happen on master is because @tiancaiamao moved this check from I do not have a fix for this right now, I plan to port this PR to 4.0-release to see if it could solve the problem, however this PR changes too much code, it would be splendid if @tiancaiamao could help with this. |
01b5098
to
8368bee
Compare
@zz-jason After discussion with @tiancaiamao , we decided to temporarily comment out the offending test case, once we solved the race condition, we will add it back. |
/run-unit-test |
OK. Does it only affect the unit tests? |
/rebuild |
As for now, only this unit test will cause the data race, but this is indeed a bug that need to be fixed no matter what, and it is already fixed on master, just not yet ported to release-4.0. The port (or cherry-pick) is complicated, and need more time, therefore we would like to temporarily fix it by revert the test. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can use
if israce.RaceEnabled {
c.Skip("skip race test")
}
instead of comment out the cases.
Seems we only need to fix the test? Just confirm, is there a potential data race issue when users file queries? |
Yes, it is possible, therefore we need to fix it. |
@zz-jason @SunRunAway let's merge this revert. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/merge |
/run-all-tests |
@ichn-hu merge failed. |
/run-tics-test |
Reverts #18819 , partially solved #19700
This PR causes a data race that blocks all subsequent PRs, therefore it should consider a revert.
See: https://github.com/pingcap/tidb/pulls?q=is%3Apr+base%3Arelease-4.0+is%3Aclosed+sort%3Aupdated-desc
starting from #18819 , all CIs are broken.
Release Note