0% found this document useful (0 votes)

48 views9 pages

Quick-Reference Guide To Optimization With Intel® Compilers Version 12

All optimization levels assume support for the SSE2 instruction set by default. To run on older IA-32 processors such as the Intel(r) Pentium(r) III processor, the option / arch:IA32 (Windows) or -mia32 (Linux) must be added.

Uploaded by

johnm77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views9 pages

Quick-Reference Guide To Optimization With Intel® Compilers Version 12

Uploaded by

johnm77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Quick-Reference Guide to Optimization with Intel Compilers version 12

For IA-32 processors and Intel 64 processors

Application Performance A Step-by-Step Approach to Application Tuning with Intel Compilers Before you begin performance tuning, you may want to check correctness of your application by building it without optimization using /Od (-O0). In this compiler version, all optimization levels assume support for the SSE2 instruction set by default. To run on older IA-32 processors such as the Intel Pentium III processor, the option /arch:IA32 (Windows*) or mia32 (Linux*) must be added. 1. Use the general optimization options (Windows /O1, /O2 or /O3; Linux and Mac OS* X -O1, -O2, or -O3) and determine which one works best for your application by measuring performance with each. Most users should start at /O2 (O2) (default) before trying more advanced optimizations. Next, try /O3 (-O3) for loopintensive applications. These options are available for both Intel and non-Intel microprocessors but they may perform more optimizations for Intel microprocessors than they perform for non-Intel microprocessors. Fine-tune performance to target IA-32 and Intel 64-based systems with processor-specific options. Examples are /QxSSE4.2 (xsse4.2) for the Intel Core processor family, e.g. the Intel Core i7 processor, and /arch:SSE3 (-msse3) for compatible, non-Intel processors that support at least the SSE3 instruction set. Alternatively, you can use /QxHOST (-xhost) which will use the most advanced instruction set for the processor on which you compiled. This option is available for both Intel and non-Intel microprocessors but it may perform more optimizations for Intel microprocessors than it performs for non-Intel microprocessors. For a more extensive list of options that optimize for specific processors or instruction sets, see the table Recommended Processor-Specific Optimization Options . Add interprocedural optimization (IPO), /Qipo (-ipo) and/or profile-guided optimization (PGO), /Qprof-gen and /Qprof-use (-prof-gen and -prof-use), then measure performance again to determine whether your application benefits from one or both of them. Optimize your application for vector and parallel execution on multi-threaded, multi-core and multi-processor systems using: advice from the new Guided Auto-Parallelism (GAP) feature, /Qguide (-guide); the Intel Cilk Plus language extensions for C/C++; the parallel performance options /Qparallel (-parallel) or /Qopenmp (openmp); or by using the Intel Performance Libraries included with the product. These optimization steps are applicable to both Intel and non-Intel microprocessors, but may result in a greater performance gain on Intel microprocessors than on non-Intel microprocessors. Use Intel VTune Amplifier XE to help you identify serial and parallel performance hotspots so that you know which specific parts of your application could benefit from further tuning. Use Intel Inspector XE to reduce the time to market for threaded applications by diagnosing memory and threading errors and speeding up the development process. These products cannot be used on non-Intel microprocessors.

Please consult the main product documentation for more details.

Intel Software Development Products

General Optimization Options

These options are available for both Intel and non-Intel microprocessors but they may result in more optimizations for Intel microprocessors than for non-Intel microprocessors. Windows* /Od Linux* Mac OS* X -O0 Comment No optimization. Used during the early stages of application development and debugging. Use a higher setting when the application is working correctly. Optimize for size. Omits optimizations that tend to increase object size. Creates the smallest optimized code in most cases. This option is useful in many large server/database applications where memory paging due to larger code size is an issue. /O2 -O2 Maximize speed. Default setting. Enables many optimizations, including vectorization. Creates faster code than /O1 (-O1) in most cases. Enables /O2 (-O2) optimizations plus more aggressive loop and memory-access optimizations, such as scalar replacement, loop unrolling, code replication to eliminate branches, loop blocking to allow more efficient use of cache and additional data prefetching. The /O3 (-O3) option is particularly recommended for applications that have loops that do many floating-point calculations or process large data sets. These aggressive optimizations may occasionally slow down other types of applications compared to /O2 (-O2). /Qopt-report[:n] /Qopt-reportphase:name -opt-report [n] -opt-reportphase=name Generates an optimization report directed to stderr. n specifies the level of detail, from 0 (no report) to 3 (maximum detail). Default is 2. Optimization reports are generated for phase name. The option can be used multiple times in the same compilation to get output from multiple phases. Some commonly used name arguments are as follows: all All possible optimization reports for all phases (default) ipo_inl Inlining report from the Interprocedural Optimizer hlo High Level Optimizer (includes loop and memory optimizations) hpo High Performance Optimizer (includes vectorizer and parallelizer) pgo Profile Guided Optimizer /Qopt-reporthelp /Qopt-reportroutine:string -opt-reporthelp -opt-reportroutine=string Displays all possible values of name for /Qopt-report-phase (-optreport-phase) above. No compilation is performed. Generates reports only for functions or subroutines whose names contain string. By default, reports are generated for all functions and subroutines.

/O1

-O1

/O3

-O3

Parallel Performance
Options that use OpenMP* or auto-parallelization are available for both Inteland non-Intel microprocessors, but these options may result in additional optimizations on Intel microprocessors that do not occur on non-Intel microprocessors. Windows* /Qopenmp /Qparallel Linux* Mac OS* X -openmp -parallel Comment Causes multi-threaded code to be generated when OpenMP directives are present. May require an increased stack size. The auto-parallelizer detects simply structured loops that may be safely executed in parallel, including loops implied by Intel Cilk Plus array notation, and automatically generates multi-threaded code for these loops. Controls the auto-parallelizers diagnostic level. n specifies the level of detail, from 0 (no report) to 3 (maximum detail). Default is 0. Sets a threshold for the auto-parallelization of loops based on the likelihood of a performance benefit. n=0 to 100, default 100. 0 Parallelize loops regardless of computation work volume. 100 Parallelize loops only if a performance benefit is highly likely Must be used in conjunction with /Qparallel (-parallel). /Qguide[:n] -guide[=n] Guided Auto-Parallelization. Causes the compiler to suggest ways to help loops to vectorize or auto-parallelize, without producing any objects or executables. Auto-parallelization advice is given only if the option parallel (Linux or Mac OS X) or /Qparallel (Windows) is also specified. n is an optional value from 1 to 4 specifying increasing levels of guidance to be provided, level 4 being the most advanced and aggressive. If n is omitted, the default is 4. This option enables [disables] a compiler-generated Matrix Multiply (matmul) library call by identifying matrix multiplication loop nests, if any, and replacing them with a matmul library call for improved performance. This option is enabled by default if options /O3 (-O3) and /Qparallel (-parallel) are specified. This option has no effect unless option /O2 (-O2) or higher is set. This option causes serialization of code containing Intel Cilk Plus language extensions. This means that the compiler will run such code as a serial C/C++ program. This option forces inclusion of a special header file (cilk_stubs.h) that includes preprocessor macros that make the Intel Cilk Plus keywords invisible to the compiler. This serialization and all Intel Cilk Plus keywords are fully described in the "Using Intel Cilk Plus section of the user and reference guide. Enables coarrays from the Fortran 2008 standard on shared memory systems (Fortran only). See the compiler reference guide for more options and detail. This option is available for both Intel and non-Intel microprocessors but it may result in more optimizations for Intel microprocessors than for non-Intel microprocessors.

/Qpar-report[:n] /Qparthreshold[:n]

-par-report[n] -par-threshold[n]

/Qopt-matmul[-]

-[no-]opt-matmul

/Qcilk-serialize

-cilk-serialize

/Qcoarray:shared

-coarray=shared

Recommended Processor-Specific Optimization Options

Windows* /Qxtarget Linux* Mac OS* X -xtarget Comment Generates specialized code for any Intel processor that supports the instruction set specified by target. The executable will not run on non-Intel processors or on Intel processors that support only lower instruction sets. Possible values of target, from highest to lowest instruction set: AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2 Note: On Mac OS X, options SSE3 and SSE2 are not supported. This option enables additional optimizations that are not enabled by the /arch or m options. /arch:target -mtarget Generates specialized code for any Intel processor or compatible, non-Intel processor that supports the instruction set specified by target. Running the executable on an Intel processor or compatible, non-Intel processor that does not support the specified instruction set may result in a run-time error. Possible values of target : SSE4.1, SSSE3, SSE3, SSE2, IA32 Note: Option IA32 generates non-specialized, generic x86/x87 code. It is supported on IA-32 architecture only. On Mac OS X, options SSE3, SSE2 and IA32 are not supported. Generates instruction sets up to the highest that is supported by the compilation host. On Intel processors, this corresponds to the most suitable /Qx (-x) option; on compatible, non-Intel processors, this corresponds to the most suitable of the /arch (-m) options IA32, SSE2 or SSE3. This option may result in additional optimizations for Intel microprocessors that are not performed for non-Intel microprocessors. May generate specialized code for any Intel processor that supports the instruction set specified by target, while also generating a default code path. Possible values of target : AVX, SSE4.2, SSE4.1, SSSE3, SSE3, SSE2 Multiple values, separated by commas, may be used to tune for additional Intel processors in the same executable, e.g. /QaxSSE4.2,SSE3. The default code path will run on any Intel or compatible, non-Intel processor that supports at least SSE2, but may be modified by using in addition a /Qx (-x) or /arch (-m) switch. For example, to generate a specialized code path optimized for the Intel Core processor family and a default code path optimized for Intel processors or compatible, non-Intel processors that support at least SSE3, use /QaxSSE4.2 /arch:SSE3 (-axsse4.2 msse3 on Linux). At runtime, the application automatically detects whether it is running on an Intel processor, and if so, selects the most appropriate code path. If an Intel processor is not detected, the default code path is selected. Note: On Mac OS X, options sse3 and sse2 are not supported. This option may result in additional optimizations for Intel microprocessors that are not performed for non-Intel microprocessors. Please see the online article Intel compiler options for SSE generation and processor-specific optimizations to view the latest recommendations for processor-specific optimization options. These options are described in greater detail in the Intel Compiler User and Reference Guides.

/QxHOST

-xhost

/Qaxtarget

-axtarget

Interprocedural Optimization (IPO) and Profile-Guided Optimization (PGO) Options

Windows* /Qip /Qipo[n] Linux* Mac OS* X -ip -ipo[n] Comment Single file interprocedural optimizations, including selective inlining, within the current source file. Permits inlining and other interprocedural optimizations among multiple source files. The optional argument n controls the maximum number of link-time compilations (or number of object files) spawned. Default for n is 0 (the compiler chooses). Caution: This option can in some cases significantly increase compile time and code size. /Qipo-jobs[n] -ipo-jobs[n] Specifies the number of commands (jobs) to be executed simultaneously during the link phase of Interprocedural Optimization (IPO). The default is 1 job. /Ob2 -finline-functions -finline-level=2 This option enables function inlining within the current source file at the compilers discretion. This option is enabled by default at /O2 and /O3 (-O2 and O3). Caution: For large files, this option may sometimes significantly increase compile time and code size. It can be disabled by /Ob0 (fno-inline-functions on Linux and Mac OS X). /Qinline-factor:n -finline-factor=n This option scales the total and maximum sizes of functions that can be inlined. The default value of n is 100, i.e., 100% or a scale factor of one. Instruments a program for profile generation. Enables the use of profiling information during optimization. Specifies a directory for the profiling output files, *.dyn and *.dpi. Instruments functions so that a profile of execution time spent in each function may be generated. Instruments functions to generate a profile of each loop or loop nest. See Profile Function or Loop Execution Time in the main compiler documentation for additional detail and how to view profiles.

/Qprof-gen /Qprof-use /Qprof-dir dir /Qprofilefunctions /Qprofile-loops

-prof-gen -prof-use -prof-dir dir -profile-functions -profile-loops

Floating-Point Arithmetic Options

Windows* /fp:name Linux* Mac OS* X -fp-model name Comment May enhance the consistency of floating point results by restricting certain optimizations. Possible values of name: fast=[1|2] Allows more aggressive optimizations at a slight cost in accuracy or consistency. (fast=1 is the default) . This may include some additional optimizations that are performed on Intel microprocessors but not on non-Intel microprocessors. precise Allows only value-safe optimizations on floating point code. double/extended/source Intermediate results are computed in double, extended or source precision. Implies precise unless overridden. The double and extended options are not available for the Intel Fortran compiler. except Enforces floating point exception semantics. strict enables both the precise and except options and does not assume the default floating-point environment. Recommendation: /fp:precise /fp:source (-fp-model precise fp-model source) is the recommended form for the majority of situations where enhanced floating point consistency and reproducibility are needed. /Qftz[-] -ftz[-] When the main program or dll main is compiled with this option, denormals resulting from SSE instructions at run time are flushed to zero for the whole program (dll). The default is on except at /Od (-O0). This option defines the accuracy for math library functions. The default is OFF (compiler uses default heuristics). Possible values of name are high, medium and low. Reduced precision may lead to increased performance and vice versa. Many routines in the math library are more highly optimized for Intel microprocessors than for non-Intel microprocessors. Ensures that math library functions produce consistent results across different Intel or compatible, non-Intel processors of the same architecture. May decrease runtime performance. The default is false (off). Improves [reduces] precision of floating point divides. This may slightly degrade [improve] performance. Improves [reduces] precision of square root computations. This may slightly degrade [improve] performance.

/Qimf-precision:name

-fimf-precision:name

/Qimf-archconsistency:true

-fimf-arch-consistency=true

/Qprec-div[-] /Qprec-sqrt[-]

-[no-]prec-div -[no-]prec-sqrt

Fine-Tuning (All Processors)

Windows* /Qunroll[n] Linux* Mac OS* X -unroll[n] Comment Sets the maximum number of times to unroll loops. /Qunroll0 (-unroll0) disables loop unrolling. The default is /Qunroll (-unroll), which uses default heuristics. Controls the level of software prefetching. n is an optional value between 0 (no prefetching) and 4 (aggressive prefetching), with a default value of 2 when high level optimization is enabled. Warning: excessive prefetching may result in resource conflicts that degrade performance. Specifies preferred loop blocking factor n, the number of loop iterations in a block, overriding default heuristics. Loop blocking is enabled at /O3 (O3) and is designed to increase the reuse of data in cache. Specifies whether streaming stores may be generated. Values for mode: always Encourages the compiler to generate streaming stores that bypass cache, assuming application is memory bound with little data reuse never auto /Qrestrict[-] /Oa /Ow /Qalias-args[-] -[no]restrict -fno-alias -fno-fnalias -fargument[no]alias -[no-]opt-classanalysis Disables generation of streaming stores Default compiler heuristics for streaming store generation

/Qopt-prefetch:n

-opt-prefetch=n

/Qopt-blockfactor:n /Qopt-streamingstores:mode

-opt-blockfactor=n -opt-streamingstores mode

Enables [disables] pointer disambiguation with the restrict keyword. Off by default. (C/C++ only) Assumes no aliasing in the program. Off by default. Assumes no aliasing within functions. Off by default. Implies function arguments may be aliased [are not aliased]. On by default. (C/C++ only). fargument-noalias often helps the compiler to vectorize loops involving function array arguments. C++ class hierarchy information is used to analyze and resolve C++ virtual function calls at compile time. If a C++ application contains nonstandard C++ constructs, such as pointer down-casting, it may result in different behavior. Default is off, but it is turned on by default with the /Qipo (Windows) or ipo (Linux and Mac OS X) compiler option, enabling improved C++ optimization. (C++ only) -f-exceptions, default for C++, enables exception handling table generation -fno-exceptions, default for C or Fortran, may result in smaller code. For C++, it causes exception specifications to be parsed but ignored. Any use of exception handling constructs (such as try blocks and throw statements) will produce an error if any function in the call chain has been compiled with -fno-exceptions.

/Qopt-classanalysis[-]

-f[no-]exceptions

/Qvec-threshold:n

-vec-threshold n

Sets a threshold n for the vectorization of loops based on the probability of performance gain. 0 n 100, default n=100. 0 Vectorize loops regardless of amount of computational work. 100 Vectorize loops only if a performance benefit is almost certain

/Qvec-report:n

-vec-report n

Controls the vectorizers diagnostic levels. n specifies the level of detail, from 0 (no report) to 3 (maximum detail). Default is 0.

Debug Options
Windows* /Zi Linux* Mac OS* X -g Comment Generates debug information for use with any of the common platform debuggers. Turns off /O2 (-O2) and makes /Od (-O0) the default unless /O2 (-O2) (or another O option) is specified. keyword none No debugging information is generated (default) full (or all) produces debugging information for full symbolic debugging of unoptimized code. Same as g (/Zi), or as debug (/debug) with no keyword. extended produces additional information for improved symbolic debugging of optimized code (Linux and Mac OS X only) Debug symbols will generally increase the size of object modules and may slightly degrade performance of optimized code. Implies also debug full. parallel generates additional symbols and instrumentation for debugging threaded code. Does not imply debug full.

/debug[:keyword]

-debug [keyword]

Optimization Notice Intel Compiler includes compiler options that optimize for instruction sets that are available in both Intel and non-Intel microprocessors (for example SIMD instruction sets), but do not optimize equally for non-Intel microprocessors. In addition, certain compiler options for Intel Compiler are reserved for Intel microprocessors. For a detailed description of these compiler options, including the instruction sets they implicate, please refer to "Intel Compiler User and Reference Guides > Compiler Options." Many library routines that are part of Intel Compiler are more highly optimized for Intel microprocessors than for other microprocessors. While the compilers and libraries in Intel Compiler offer optimizations for both Intel and Intel-compatible microprocessors, depending on the options you select, your code and other factors, you likely will get extra performance on Intel microprocessors. While the paragraph above describes the basic optimization approach for Intel Compiler, with respect to Intel's compilers and associated libraries as a whole, Intel Compiler may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include Intel Streaming SIMD Extensions 2 (Intel SSE2), Intel Streaming SIMD Extensions 3 (Intel SSE3), and Supplemental Streaming SIMD Extensions 3 (Intel SSSE3) instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Intel recommends that you evaluate other compilers to determine which best meet your requirements. 8

For product and purchase information, visit the Intel Software Development Products site at: www.intel.com/software/products/compilers

Intel, the Intel logo, Pentium, Intel VTune, Intel Core and Intel Cilk are trademarks of Intel Corporation in the U.S. and other countries. * Other names and brands may be claimed as the property of others. 2010, Intel Corporation. All rights reserved.

Fortran Compiler - Developer Guide Reference - 2023.0 767251 767252
No ratings yet
Fortran Compiler - Developer Guide Reference - 2023.0 767251 767252
2,612 pages
Fortran Dev Guide 190 U5
No ratings yet
Fortran Dev Guide 190 U5
2,675 pages
CPP Compiler Classic
No ratings yet
CPP Compiler Classic
2,356 pages
H8S, H8/300 Series C/C++ Compiler
No ratings yet
H8S, H8/300 Series C/C++ Compiler
635 pages
HPC Clusters Best Practices Performance Study
No ratings yet
HPC Clusters Best Practices Performance Study
38 pages
Oneapi Fortran Compiler
No ratings yet
Oneapi Fortran Compiler
2,720 pages
Spnu 151 W
No ratings yet
Spnu 151 W
211 pages
22.optimization III
No ratings yet
22.optimization III
82 pages
M16C C Compiler Programming Guide
No ratings yet
M16C C Compiler Programming Guide
254 pages
Intel® Fortran Compiler For Linux Systems User's Guide
No ratings yet
Intel® Fortran Compiler For Linux Systems User's Guide
323 pages
Microsoft QuickC - Tool Kit - Microsoft Corporation PDF
100% (1)
Microsoft QuickC - Tool Kit - Microsoft Corporation PDF
376 pages
483327-002 Ic-86-286-386 Compiler Users Guide For DOS Systems 1991 PDF
No ratings yet
483327-002 Ic-86-286-386 Compiler Users Guide For DOS Systems 1991 PDF
484 pages
15DD
No ratings yet
15DD
51 pages
Spru187u Optimizingcompiler
No ratings yet
Spru187u Optimizingcompiler
278 pages
Ipp - User Guide 8.0 U1
No ratings yet
Ipp - User Guide 8.0 U1
46 pages
Lib For
No ratings yet
Lib For
578 pages
MKL Userguide LNX
No ratings yet
MKL Userguide LNX
114 pages
Quick
No ratings yet
Quick
13 pages
Dell Unity - Additional Procedures-SVC Commands
No ratings yet
Dell Unity - Additional Procedures-SVC Commands
10 pages
Manual de Compilacion GCC
No ratings yet
Manual de Compilacion GCC
288 pages
Practical 2 Installation of Linux Operating System
100% (1)
Practical 2 Installation of Linux Operating System
11 pages
Fortran UG 2
No ratings yet
Fortran UG 2
263 pages
C Ug lnx2
No ratings yet
C Ug lnx2
431 pages
Optimizing Itanium-Based Applications 1
No ratings yet
Optimizing Itanium-Based Applications 1
25 pages
Fortran Compiler - Get Started Guide - 2025.0 767260 834248
No ratings yet
Fortran Compiler - Get Started Guide - 2025.0 767260 834248
7 pages
Xlf11.1 Programming
No ratings yet
Xlf11.1 Programming
282 pages
Command-Line Options: H2Inc ML
No ratings yet
Command-Line Options: H2Inc ML
113 pages
Chapter3 C
No ratings yet
Chapter3 C
38 pages
Quick-Reference Guide To Optimization With Intel Compilers
No ratings yet
Quick-Reference Guide To Optimization With Intel Compilers
2 pages
Command-Line Options: H2Inc ML
No ratings yet
Command-Line Options: H2Inc ML
106 pages
Intel Architecture Code Analyzer, IACA-Guide
No ratings yet
Intel Architecture Code Analyzer, IACA-Guide
18 pages
6b-Serial Vect Opt Lab 2013
No ratings yet
6b-Serial Vect Opt Lab 2013
9 pages
SIP5 Com-Modbus V06.00 Manual C545-1 en
No ratings yet
SIP5 Com-Modbus V06.00 Manual C545-1 en
74 pages
Quick-Reference Guide To Optimization With Intel® Compilers: Intel® Software Development Products
No ratings yet
Quick-Reference Guide To Optimization With Intel® Compilers: Intel® Software Development Products
12 pages
BCC32 Compiler Options Functional
No ratings yet
BCC32 Compiler Options Functional
8 pages
Getting Started With The Intel (R) Fortran Compiler Professional Edition 11.1 For Linux OS
No ratings yet
Getting Started With The Intel (R) Fortran Compiler Professional Edition 11.1 For Linux OS
4 pages
Fortran Compiler - Get Started Guide - 2023.2 767260 781901
No ratings yet
Fortran Compiler - Get Started Guide - 2023.2 767260 781901
8 pages
MinGW-win32 gcc6.3.0
No ratings yet
MinGW-win32 gcc6.3.0
1 page
Getting Started C
No ratings yet
Getting Started C
5 pages
PL01 Guiao
No ratings yet
PL01 Guiao
3 pages
Intel® Compiler Suite Professional Edition 11.1: For Windows
No ratings yet
Intel® Compiler Suite Professional Edition 11.1: For Windows
3 pages
GNU Assembler
No ratings yet
GNU Assembler
122 pages
Manual ZTE ZXCTN-6200-V11 20090825 EN V1
No ratings yet
Manual ZTE ZXCTN-6200-V11 20090825 EN V1
73 pages
Questa Comms
No ratings yet
Questa Comms
8 pages
Khem Raj Embedded Linux Conference 2014, San Jose, CA
No ratings yet
Khem Raj Embedded Linux Conference 2014, San Jose, CA
29 pages
Multi2sim Quickstart
No ratings yet
Multi2sim Quickstart
10 pages
6101 Fundamentals Ofcomputer and IT MCQ
100% (1)
6101 Fundamentals Ofcomputer and IT MCQ
69 pages
The Software Optimization Cookbook: Richard Gerber Aart J.C. Bik Kevin B. Smith Xinmin Tian
No ratings yet
The Software Optimization Cookbook: Richard Gerber Aart J.C. Bik Kevin B. Smith Xinmin Tian
13 pages
Option Description: GDB Commands
No ratings yet
Option Description: GDB Commands
5 pages
FL32
No ratings yet
FL32
2 pages
Delta Ia-Plc As PM en 20190621 PDF
No ratings yet
Delta Ia-Plc As PM en 20190621 PDF
1,224 pages
GDB Command Line Arguments:: Option Description
No ratings yet
GDB Command Line Arguments:: Option Description
6 pages
Guacamole Screen Video-Recording
No ratings yet
Guacamole Screen Video-Recording
3 pages
Huawei IT Portfolio Cigre 2022
No ratings yet
Huawei IT Portfolio Cigre 2022
30 pages
How To Add Digital Signature Certificate On NSWS
No ratings yet
How To Add Digital Signature Certificate On NSWS
7 pages
Operating System For Ubiquiti Airmax Ac Series Products Release Version: 8
No ratings yet
Operating System For Ubiquiti Airmax Ac Series Products Release Version: 8
56 pages
DSLS Installation and Configuration Guide 23x
No ratings yet
DSLS Installation and Configuration Guide 23x
190 pages
TW DeployingMPLS Text
No ratings yet
TW DeployingMPLS Text
146 pages
Module 4
No ratings yet
Module 4
75 pages
Unit V Data Communication: Prepared by B.R.S.Reddy Lecturer/ ECE NIT
No ratings yet
Unit V Data Communication: Prepared by B.R.S.Reddy Lecturer/ ECE NIT
60 pages
1102 - Chapter 21 The Internet - Slide Handouts
No ratings yet
1102 - Chapter 21 The Internet - Slide Handouts
42 pages
HP ProBook 640 G1 Notebook PC - Identifying Components PDF
No ratings yet
HP ProBook 640 G1 Notebook PC - Identifying Components PDF
10 pages
Adveon Installation Guide V1.3 and V1.5 - Edgecam - Basic
No ratings yet
Adveon Installation Guide V1.3 and V1.5 - Edgecam - Basic
46 pages
Packet Tracer - CCNA Skills Integration Challenge: Topology
No ratings yet
Packet Tracer - CCNA Skills Integration Challenge: Topology
7 pages
Linux Mini Shell
No ratings yet
Linux Mini Shell
7 pages
CS2 Refresh Document
No ratings yet
CS2 Refresh Document
58 pages
Msi ms-7326 Rev 200 SCH
No ratings yet
Msi ms-7326 Rev 200 SCH
39 pages
2 Questions Per Lecture Distributed Systems
No ratings yet
2 Questions Per Lecture Distributed Systems
4 pages
Claro TV - T18
No ratings yet
Claro TV - T18
2 pages
TREK-753 DS 121611 A-DLoG20111219151134
No ratings yet
TREK-753 DS 121611 A-DLoG20111219151134
2 pages
ARM Load and Store Instructions PDF
No ratings yet
ARM Load and Store Instructions PDF
17 pages
17bce2396 VL2019205006307 Da
No ratings yet
17bce2396 VL2019205006307 Da
7 pages
Installing Synology DSM On HP Proliant Microserver Running VM ESXi With An RDM (Raw Device Mapped) Drive Foolishandfavouritethings
No ratings yet
Installing Synology DSM On HP Proliant Microserver Running VM ESXi With An RDM (Raw Device Mapped) Drive Foolishandfavouritethings
5 pages
(Brochure) Industrial-Tablet-Portfolio - EN PDF
No ratings yet
(Brochure) Industrial-Tablet-Portfolio - EN PDF
8 pages
TC1797 Product Brief 060709
No ratings yet
TC1797 Product Brief 060709
2 pages
Dspic™ Asymmetric Key Embedded Encryption Library: Execution Time
No ratings yet
Dspic™ Asymmetric Key Embedded Encryption Library: Execution Time
2 pages
SAP ABAP Performance Tuning
From Everand
SAP ABAP Performance Tuning
May
4.5/5 (28)
The Complete Guide to Installing Parrot OS
From Everand
The Complete Guide to Installing Parrot OS
mehul kothari
No ratings yet
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
From Everand
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
AJIT DASH
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
From Everand
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
Ojula Technology Innovations
No ratings yet
What's New in .NET 8? A Complete Guide to the Latest Features
From Everand
What's New in .NET 8? A Complete Guide to the Latest Features
Nitika
No ratings yet
TensorFlow Developer Certificate Exam Practice Tests 2024 Made Easy
From Everand
TensorFlow Developer Certificate Exam Practice Tests 2024 Made Easy
Mr Troy
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
Ansible For Windows By Examples
From Everand
Ansible For Windows By Examples
Berton
No ratings yet
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
Hunter Davis
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Quick-Reference Guide To Optimization With Intel® Compilers Version 12

Uploaded by

Quick-Reference Guide To Optimization With Intel® Compilers Version 12

Uploaded by

Quick-Reference Guide to Optimization with Intel Compilers version 12

For IA-32 processors and Intel 64 processors

Please consult the main product documentation for more details.

Intel Software Development Products

General Optimization Options

Recommended Processor-Specific Optimization Options

Interprocedural Optimization (IPO) and Profile-Guided Optimization (PGO) Options

/Qprof-gen /Qprof-use /Qprof-dir dir /Qprofilefunctions /Qprofile-loops

-prof-gen -prof-use -prof-dir dir -profile-functions -profile-loops

Floating-Point Arithmetic Options

Fine-Tuning (All Processors)

-opt-blockfactor=n -opt-streamingstores mode

You might also like