-
-
Notifications
You must be signed in to change notification settings - Fork 6.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI/azure: disable flaky tests in the msys2 builds #5954
Conversation
@mback2k don't think we can just as well disable them until we have a better idea on how to make them somewhat reliable? |
No, this is not correct. They don't fail every time currently due to the flakiness, but something broke the RTSP test 571 while dropping WinSock 1 support. I wanted to investigate that this week(end), but was occupied until today. |
Isn't it strange that only msys2 builds fail that test if its winsock related? |
Yes, indeed that is very strange. That is why I triggered some CI re-runs at the beginning of the week to identify the problematic commit via some manual CI bisecting. Turns out this is the last commit that worked according to the CI history: This is currently the first commit that failed according to the CI history: After that test 571 never seems to have worked again on the msys2 based CI builds. I am guess that the following commit is problematic, but since it was pushed with the commit above there was no dedicated CI run yet: 728f8d3 I will trigger one now to see if this guess is true. |
I manually bisected with my local msys2 and found commit 728f8d3 to be the root cause of test 571 failing with msys2. The CI run for that commit also confirms this: https://github.com/curl/curl/runs/1103146941 |
This is an example output of the failing test run and the shown diff is consistent over multiple test runs:
|
That makes me wonder why connection reuse would be problematic there and nowhere else? Perhaps the verbose output differences from a working vs a non-working case says something? If not, sprinkle |
The first difference is this only showing for the working case:
And the trailer of the stderr output logfile is basically completely different after that. The failing case also shows:
|
I believe the key is the |
Seems like there is really something different with the connection. This is the diff of the RTSP test server logs (without timestamps):
|
And here is another set of complete log data and corresponding packet dumps stored as dump.pcap in the files: |
I don't think you get the answer to this by looking at PCAPs. The answer most likely lies within |
I have the feeling that the socket is readable, because there is still data pending to be read. I will try to add some more debug output and check if that is the case. |
Yeah, and if that really is the case, the test is flawed... |
Since I don't know how fast I can further investigate the issue this week, you may also go ahead and just ignore the result of test 571 for the msys2 builds by adding |
Good idea! |
af92abe
to
ab94f8b
Compare
|
ab94f8b
to
14bf6b9
Compare
it looks better. Out of the 4 msys2 builds that often fails, now only two failed. One on test 190 and one on test 1527... |
@mback2k I presume you're 👍 with merging this as at least a step along the way? |
👍 with me, I would just put the |
They fail almost every CI job.