fix those damn sporadically false positive usage tests #342

Dieterbe · 2016-10-13T14:34:08Z

after lots of experimentation I figured out the mock clock sometimes
simply doesn't properly trigger, so that in Usage.Report() sometimes
nothing is received on the tick channel, despite advancing the fake
clock by more than strictly nessecary (i tried with an extra ms),
despite calling runtime.Goshed() ourselves, and despite sleeping
20 ms with the real clock.

The author of the clock package confirms that due to the way the
runtime schedules goroutines, there's no way around the fake clock
sometimes not working. See
https://gophers.slack.com/archives/general/p1462238960008162

Furthermore, in discussion with the golang developers at
golang/go#8869 it becomes clear that it's
unlikely that we'll have a fakeable clock anytime soon.

Ben Johnson (clock author) suggests in the above mentioned gophers
thread that we could mock out the tick function and pass in a different
function in tests. However, that changes so much of the time logic
that it becomes pointless to do any time-based testing in this design.

We could also switch to simply test the basics, not time based.
Since the timing code is pretty easy.

However before we go that route, I wanted to try working with the real
clock. Basically run the usage reporting in real time, but scaled down
to millisecond level instead of second level, to make it finish fairly
quickly.

So now some semantics are changing a bit:

we allow up to ms for the usage report to be in the state we
need it
so we now works with steps, which don't happen at exact predictable
timestamps, rather they have to happen within a timeframe
checking timestamp would have gotten more complicated, so I just
removed it. It's easy to reason that if the updates come within the
alotted times, then the timestamps should also be set correctly.
there's no serious need to explicitly pass around interval settings
anymore, we just use 1 everywhere.

If it turns out that this approach also triggers false positives
(for example due to circleCI machines being maxed out of CPU and the
reporting unable to happen within the needed time) then we can address
as needed and still switch to the simpler approach.
But that seems very unlikely. This should work.

after lots of experimentation I figured out the mock clock sometimes simply doesn't properly trigger, so that in Usage.Report() sometimes nothing is received on the tick channel, despite advancing the fake clock by more than strictly nessecary (i tried with an extra ms), despite calling runtime.Goshed() ourselves, and despite sleeping 20 ms with the real clock. The author of the clock package confirms that due to the way the runtime schedules goroutines, there's no way around the fake clock sometimes not working. See https://gophers.slack.com/archives/general/p1462238960008162 Furthermore, in discussion with the golang developers at golang/go#8869 it becomes clear that it's unlikely that we'll have a fakeable clock anytime soon. Ben Johnson (clock author) suggests in the above mentioned gophers thread that we could mock out the tick function and pass in a different function in tests. However, that changes so much of the time logic that it becomes pointless to do any time-based testing in this design. We could also switch to simply test the basics, not time based. Since the timing code is pretty easy. However before we go that route, I wanted to try working with the real clock. Basically run the usage reporting in real time, but scaled down to millisecond level instead of second level, to make it finish fairly quickly. So now some semantics are changing a bit: * we allow up to <period> ms for the usage report to be in the state we need it * so we now works with steps, which don't happen at exact predictable timestamps, rather they have to happen within a timeframe * checking timestamp would have gotten more complicated, so I just removed it. It's easy to reason that if the updates come within the alotted times, then the timestamps should also be set correctly. * there's no serious need to explicitly pass around interval settings anymore, we just use 1 everywhere. If it turns out that this approach also triggers false positives (for example due to circleCI machines being maxed out of CPU and the reporting unable to happen within the needed time) then we can address as needed and still switch to the simpler approach. But that seems very unlikely. This should work.

Dieterbe · 2016-10-13T15:02:14Z

for the record I just ran i=1; while echo $i && go test; do i=$((i+1)); sleep 0.${RANDOM}s; done until 310 and then canceled it. with master it would always fail before reaching 150, often before 20.

woodsaj · 2016-10-13T17:22:33Z

I think you are going about testing this the wrong way. I would

move all of the inline functions to actual functions
create unit tests for the custom ticker.
split "Report()" into a "Run()" and an "Report()" function. Run() just runs the ticker calling "report()" on every tick until it is stopped.
In your unit tests you then no longer need to worry about timing. You can just create a new "usage" instance, "u", and immediately stop it (ending the "Run" goroutine). Then call u.Add() to add with as many values you want and call u.Report() to push the usage metrics into aggmetrics where you can then read and validate them.

Dieterbe · 2017-09-30T17:38:28Z

#666

Dieterbe assigned woodsaj Oct 13, 2016

Dieterbe force-pushed the fix-usage-tests branch from 7f66846 to f035541 Compare October 13, 2016 14:46

woodsaj assigned Dieterbe and unassigned woodsaj Oct 14, 2016

Dieterbe added a commit that referenced this pull request Jan 2, 2017

disable usage tests. they do not work. see #342

8b130ed

Dieterbe added a commit that referenced this pull request Jan 2, 2017

disable usage tests. they do not work. see #342

e7dac70

Dieterbe added a commit that referenced this pull request Jan 3, 2017

disable usage tests. they do not work. see #342

20d1839

Dieterbe added a commit that referenced this pull request Jan 3, 2017

disable usage tests. they do not work. see #342

c289e19

Dieterbe mentioned this pull request Jul 19, 2017

remove usage reporting from MT #666

Merged

Dieterbe closed this Sep 30, 2017

Dieterbe deleted the fix-usage-tests branch January 2, 2018 16:01

imiric mentioned this pull request Jul 7, 2020

Flaky tests grafana/k6#1357

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix those damn sporadically false positive usage tests #342

fix those damn sporadically false positive usage tests #342

Dieterbe commented Oct 13, 2016

Dieterbe commented Oct 13, 2016 •

edited

Loading

woodsaj commented Oct 13, 2016 •

edited

Loading

Dieterbe commented Sep 30, 2017

fix those damn sporadically false positive usage tests #342

fix those damn sporadically false positive usage tests #342

Conversation

Dieterbe commented Oct 13, 2016

Dieterbe commented Oct 13, 2016 • edited Loading

woodsaj commented Oct 13, 2016 • edited Loading

Dieterbe commented Sep 30, 2017

Dieterbe commented Oct 13, 2016 •

edited

Loading

woodsaj commented Oct 13, 2016 •

edited

Loading