Add more diagnostics for compiler performance analysis #5760

retronym · 2017-03-07T03:19:05Z

-Yprofile will output the difference between snapshots of GC and CPU
snapshot data, sourced from platform MBeans
This output can be sent to a file with -Yprofile-destination <filename>
By default, output is to console
Use -Yprofile-external-tool will generate a call to the static method
before and after around each of the the given phases. This can be
used to communicate with an external profiler, such as YourKit, to generate a
profile for a subset of the compiler.
-Yprofile-run-gc runs the GC after each phase to help more accurately
attribute retained heap to a given phase.

- `-Yprofile` will output the difference between snapshots of GC and CPU snapshot data, sourced from platform MBeans - This output can be sent to a file with `-Yprofile-destination <filename>` By default, output is to console - Use `-Yprofile-external-tool` will generate a call to the static method `before` and `after` around each of the the given phases. This can be used to communicate with an external profiler, such as YourKit, to generate a profile for a subset of the compiler. - `-Yprofile-run-gc` runs the GC after each phase to help more accurately attribute retained heap to a given phase. Co-Authored by: Jason Zaugg <[email protected]>

This is handy when collecting samples in YourKit. The actual result of this class is just an approximation, we rely on JMH in scala/compiler-benchmark for more rigourous statistics. ``` ./build/quick/bin/scala -J-Dscala.benchmark.iterations=2000 scala.tools.nsc.MainBench sandbox/test.scala ```

retronym · 2017-03-07T03:23:43Z

Rebase of #5758

lrytz

I started reviewing this today and got sucked into getting per-phase profiling with JFR. This is my WIP branch: lrytz@9d16bb3

Run the compiler with -Yprofile-enabled -J-XX:+UnlockCommercialFeatures
In JMC, for the compiler process, open the "MBean Server" console and go to the "Triggers" tab
Import the config https://gist.github.com/lrytz/aa0e72e00a3f1fadee92d62f9708dfbf, adjust hard coded paths
enable the triggers, recording starts and stops automatically during typer

I haven't figured out how to do that from the command line, unfortunately..

lrytz · 2017-03-10T13:12:18Z

src/compiler/scala/tools/nsc/settings/ScalaSettings.scala

+    withPostSetHook( _ => YprofileEnabled.value = true )
+  val YprofileExternalTool = PhasesSetting("-Yprofile-external-tool", "Enable profiling for a phase using an external tool hook. Generally only useful for a single phase", "typer").
+    withPostSetHook( _ => YprofileEnabled.value = true )
+  val YprofileRunGcBetweenPhases = PhasesSetting("-Yprofile-run-gc", "Run a GC between phases - this allows heap size to be accurate at the expense of more time. Specify a list of phases, or *", "_").


both _ and * are not valid, should use all instead

lrytz · 2017-03-10T13:24:24Z

src/compiler/scala/tools/nsc/profile/Profiler.scala

+    s2 - s1
+  }
+  private def doGC(): Unit = {
+    System.gc()


Javadoc says: "When control returns from the method call, the Java Virtual Machine has made a best effort to reclaim space from all discarded objects", I guess that's still true for concurrent GC? Anyway, we just have to be aware that System.gc is probably not the most reliable tool.

the GC before/after was intended to provide some indication of the ratio of allocation vs retained sizes. Generally the information that this tool provide is indicative, and not 100 % reproducible, but with sufficient iteration and post processing of the data can provide a high confidence that a particular PR affected a certain metric

for the record - it also has inaccuracies related to finalization and object graphs that require multiple GC/finalization cycles to reclaim, and also with soft references

SethTisue · 2017-03-21T22:00:26Z

tentatively retargeted for 2.12.3, change it back if that's wrong

mkeskells · 2017-03-30T23:16:02Z

Hi @retronym we have updated the base profiler to support capturing of stats from background threads (needed for #5815), like thread CPU time and allocations

This also includes better output control and formatting in https://github.com/rorygraves/scalac_perf/tree/2.12.x_profile2

is it best to take these additonal commits to a new PR or adjust the base of this one?

retronym · 2017-07-03T23:29:19Z

I merged the original PR instead. I'll salvage any useful changes in this PR in a new one.

scala-jenkins added this to the 2.12.2 milestone Mar 7, 2017

mkeskells and others added 3 commits March 7, 2017 13:19

Update SBT parser for new command line options

c858493

retronym force-pushed the topic/perf branch from c2e156c to 48ce9cd Compare March 7, 2017 03:21

retronym mentioned this pull request Mar 7, 2017

Report per-phase JVM statistics (e.g cpu time, user time, allocated bytes) #5758

Closed

lrytz reviewed Mar 10, 2017

View reviewed changes

SethTisue modified the milestones: 2.12.3, 2.12.2 Mar 21, 2017

adriaanm mentioned this pull request Mar 27, 2017

Compiler performance scala/scala-dev#322

Closed

7 tasks

This was referenced Mar 30, 2017

partial parallelisation of genbcode, and code that it touches #5815

Closed

per run immutable settings #5825

Closed

mkeskells mentioned this pull request Apr 12, 2017

optimise completeSilentlyAndCheckErroneous #5832

Merged

adriaanm added the performance the need for speed. usually compiler performance, sometimes runtime performance. label May 25, 2017

retronym closed this Jul 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add more diagnostics for compiler performance analysis #5760

Add more diagnostics for compiler performance analysis #5760

Uh oh!

retronym commented Mar 7, 2017 •

edited

Loading

Uh oh!

retronym commented Mar 7, 2017

Uh oh!

lrytz left a comment

Uh oh!

lrytz Mar 10, 2017

Uh oh!

lrytz Mar 10, 2017

Uh oh!

mkeskells Mar 30, 2017

Uh oh!

mkeskells Mar 30, 2017

Uh oh!

SethTisue commented Mar 21, 2017

Uh oh!

mkeskells commented Mar 30, 2017

Uh oh!

retronym commented Jul 3, 2017

Uh oh!

Uh oh!

Add more diagnostics for compiler performance analysis #5760

Add more diagnostics for compiler performance analysis #5760

Uh oh!

Conversation

retronym commented Mar 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

retronym commented Mar 7, 2017

Uh oh!

lrytz left a comment

Choose a reason for hiding this comment

Uh oh!

lrytz Mar 10, 2017

Choose a reason for hiding this comment

Uh oh!

lrytz Mar 10, 2017

Choose a reason for hiding this comment

Uh oh!

mkeskells Mar 30, 2017

Choose a reason for hiding this comment

Uh oh!

mkeskells Mar 30, 2017

Choose a reason for hiding this comment

Uh oh!

SethTisue commented Mar 21, 2017

Uh oh!

mkeskells commented Mar 30, 2017

Uh oh!

retronym commented Jul 3, 2017

Uh oh!

Uh oh!

retronym commented Mar 7, 2017 •

edited

Loading