<<

PADD
    using GNU
PARALLEL
   Auto-parallelization: Enabling, Options, Directives, and Environment Variables
   Combined Parallel/Worksharing Constructs
   COPYIN Clause
   Debugging Shared Variables
   DEFAULT Clause
   OpenMP Directives and Clauses Summary
   Parallel Processing Thread Model
   Parallel Region Directives
   PRIVATE, FIRSTPRIVATE, and LASTPRIVATE Clauses
   Programming with Auto-parallelization
   Programming with OpenMP
   REDUCTION Clause
   SHARED Clause
parallel construct
    begin
    end
PARALLEL directive
   Auto-parallelization: Enabling, Options, Directives, and Environment Variables
   Parallel Region Directives
   Synchronization Constructs
PARALLEL DO
    use
PARALLEL DO directive
   Programming with Auto-parallelization
   Specifying Schedule Type and Chunk Size
parallel invocations with makefile
   Basic PGO Options
   Creating a Multifile IPO Executable with Makefile
PARALLEL PRIVATE
parallel processing
    directive groups
    thread model
        pseudo code
parallel program development
parallel regions
    debugging
    directives
    entry
PARALLEL SECTIONS
    use
PARALLEL SECTIONS/END PARALLEL SECTIONS
parallel/worksharing
   Combined Parallel/Worksharing Constructs
   Programming with OpenMP
parallelism
parallelization
    loops
    overview
       Compiling with OpenMP, Directive Format, and Diagnostics
       Default Compiler Optimizations
       Parallel Processing Thread Model
       Parallelism: an Overview
       Programming with Auto-parallelization
    relieves
parsing
    I/O
part
    mutually-exclusive
pathname
pcolor
PentiumŪ 4 processors
PentiumŪ III processors
PentiumŪ M processors
performance-critical
   Code-coverage Tool
   OpenMP Support Libraries
performance-related options
performance analysis
performance analyzer
   Debugging Multithread Programs Overview
   Using Intel Performance Analysis Tools
performing
    data flow
       Auto-parallelization Overview
       Parallelism: an Overview
    I/O
PGO
    environment variables
    methodology
    PGO API
    phases
    usage model
PGO API support
    dumping and resetting profile information
    dumping profile information
    interval profile dumping
    overview
    resetting the dynamic profile counters
    resetting the profile information
pgopti.dpi file
    compiler produces
    existing
    remove
pgopti.spi
   Code-coverage Tool
   Profile-guided Optimizations Methodology and Usage Model
   Test Prioritization Tool
PGOPTI_Prof_Dump
   Dumping Profile Information
   Merging the .dyn Files
PGOPTI_Prof_Dump_And_Reset
PGOPTI_Prof_Reset
   Dumping Profile Information
   Resetting the Dynamic Profile Counters
PGOPTI_Set_Interval_Prof_Dump
pgouser.h
phase1
phase2
pipelining
    ItaniumŪ-based applications
    optimization
placing
    PREFETCH
pointer aliasing
pointers
   IPO Overview
   OpenMP Directives and Clauses Summary
   Optimizations and Debugging
   Prefetching
   Stacks: Automatic Allocation and Checking
   Using Arrays Efficiently
   Vectorization Key Programming Guidelines
position-independent code
POSIX
preemption
    preemptable
    preempted
       Criteria for Inline Function Expansion
       Symbol Visibility Attribute Options
PREFETCH
    placing
prefetching
    optimizations
    option
    support
preparing
    code
preventing
    CRAY pointers
    inlining
PRINT
PRINT statement
prioritization
PRIVATE
    use
PRIVATE clause
   PRIVATE, FIRSTPRIVATE, and LASTPRIVATE Clauses
   REDUCTION Clause
private scoping
    variable
procedure names
process
    overview
process_data
processor
    processor-based
    processor-instruction
    targeting
       Optimizing for Specific Processors Overview
       Targeting a Processor, -tppn
produced
    IL
    multithreaded
       Parallelization with OpenMP* Overview
       Programming with OpenMP
    profile-optimized
prof_dpi file
prof_dpi Test1.dpi
prof_dpi Test2.dpi
prof_dpi Test3.dpi
PROF_DUMP_INTERVAL
   PGO API Support Overview
   PGO Environment Variables
PROF_NO_CLOBBER
profile-guided optimizations (see also PGO)
    instrumented program
    methodology
    overview
    phases
    utilities
profile-optimized
    executable
    generating
    produce
profile data
    dumping
profile IGS
    describe
    environment variable
    functions
    variable
profile information
    dumping
    generation support
profiling summary
    specifying
profmerge
    tool
       Merging the .dyn Files
       Test Prioritization Tool
    use
    utility
program
    affected aspect
program loops
    dataflow
programming
    high performance
project makefile
PROTECTED
providing
    superset
pseudo code
    parallel processing model
pushl
   Debugging Multiple Threads
   Debugging Parallel Regions



READ
    READ DATA
READ/WRITE statements
REAL
    REAL DATA
real object files
REAL*16
REAL*4
REAL*8
reassociation
   Floating-point Arithmetic Precision for ItaniumŪ-based Systems
   Improving/Restricting FP Arithmetic Precision
   REDUCTION Clause
rec8byte keyword
RECL
    value
recnbyte keyword
recommendations
    coding
    controlling alignment
record buffers
    efficient use of
RECORD statement
redeclaring
redirected standard
REDUCTION
    clause
    completed
    end
    use
    variables
       Debugging Shared Variables
       REDUCTION Clause
reduction/induction variable
ref_dpi_file
    respect
relieving
    I/O
relocating source files
removing
    pgopti.dpi
reordering
    transformations
repeating
    instrumentation
replicated code
report
    availability
    generation
    optimizer
    stderr
resetting
    dynamic profile counters
    profile information
restricting
    FP arithmetic precision
    optimizations
RESULT
results
    IPO
RETURN
    double-precision
    return values
REVERSE
rm PROF_DIR
round-to-nearest
rounding
    control
    significand
routines
    selecting
    timing
RTL
run
    differential coverage
    multithreaded
    test prioritization
run-time
    call
    checks for IA-32 systems
    library routines
    peeling
    performance
    processor-specific checks
    scheduling



SAVE statement
scalar
    -scalar_rep
    clean-up iterations
    replacement
    scalar_integer_expression
    scalar_logical_expression
SCHEDULE
    clause
    specifying
    use
scoping
SCRATCH
   COPYIN Clause
   THREADPRIVATE Directive
screenshot
SECNDS
SECTION
   OpenMP Directives and Clauses Summary
   Programming with OpenMP
   Worksharing Construct Directives
SECTION directive
   Combined Parallel/Worksharing Constructs
   PRIVATE, FIRSTPRIVATE, and LASTPRIVATE Clauses
   Worksharing Construct Directives
SECTIONS
    directive
       Combined Parallel/Worksharing Constructs
       Worksharing Construct Directives
    use
selecting
    routines
SEQUENCE
    omit
    specify
    statement
       Alignment Options
       Setting Data Type and Alignment
    use
setenv
setting
    arguments
    coloring scheme
    conditional parallel region execution
    DAZ
    email
    errno
    F_UFMTENDIAN variable
    FTZ
    html files
    integer and floating-point data
    optimization level
    units
SHARED
    clause
    debugging
    shared scoping
    shared variables
    updating
    use
significand
    round
SIMD
   Coding Guidelines for Intel Architectures
   Parallelism: an Overview
   Strip-mining and Cleanup
   Types of Loop Vectorized
   Vectorization Key Programming Guidelines
   Vectorization Overview
SIMD-encodings
    enabling
SIMD SSE2
    streaming
simple difference operator
SIN
   Statements in the Loop Body
   Types of Loop Vectorized
SINGLE
    directive
       Synchronization Constructs
       Worksharing Construct Directives
    encounters
    executing
    use
single-instruction
single-precision
   Floating-point Arithmetic Precision Overview
   Improving Runtime Efficiency
single-statement loops
single-threaded
small logical data items
small_bar
SMP
   Auto-parallelization Overview
   Coding Guidelines for Intel Architectures
   Parallelization with OpenMP* Overview
software pipelining
   Loop Count and Loop Distribution
   Parallelism: an Overview
   Pipelining for Itanium -based Applications
source
    code
    coding guidelines
    files relocation
    input
       Auto-parallelization: Enabling, Options, Directives, and Environment Variables
       Compiling with OpenMP, Directive Format, and Diagnostics
    listing
       Debugging Multiple Threads
       Debugging Parallel Regions
    view
specialized code
   Automatic Processor-specific Optimization (IA-32 only)
   Parallelism: an Overview
   Processor-specific Optimization (IA-32 only)
   Vectorizer Options
specific
    optimizing
specifying
    8-byte data
    DEFAULT
    directory
    END DO
    KIND
    ORDERED
    profiling summary
    RECL
    schedule
    SEQUENCE
    symbol visibility explicitly
    vectorizer
    visibility without symbol file
spi
    file
       Code-coverage Tool
       Test Prioritization Tool
    option
    pgopti.spi
       IntelŪ Compilers Code-coverage Tool
       Test Prioritization Tool
SQRT
SSE
   Coding Guidelines for Intel Architectures
   Floating-point Arithmetic Precision Overview
   Types of Loop Vectorized
   Vectorization Overview
SSE2
   Coding Guidelines for Intel Architectures
   Vectorization Overview
stacks
    size
standard
    OpenMP clauses
    OpenMP directives
    OpenMP environment variables
statements
    accessing
    BLOCKSIZE
    BUFFERCOUNT
    BUFFERED
    functions
STATIC
STATUS
stderr
    report
Stream_LF
streaming
    SIMD SSE2
Streaming SIMD Extensions
    single-precision
stride-1
    example
strings
strip-mining
STRUCTURE statements
   Alignment Options
   Setting Data Type and Alignment
SUBDOMAIN
subl
   Debugging Multiple Threads
   Debugging Parallel Regions
subobjects
suboption
subroutine
    machine code listing
    PADD
        entry
        source listing
    PARALLEL
    PGOPTI_PROF_DUMP
    VEC_COPY
    WORK
subscripts
    array
    loop
    varying
substring
    containing
superset
support
    DAZ
    FTZ
    loop unrolling
    MMX
    OpenMP Libraries
       Auto-parallelization Overview
       OpenMP Support Libraries
    prefetching
    symbolic debugging
    vectorization
    worksharing
SWP directive
symbol
    file
    preemption
    visibility attribute options
symbolic debugging
synchronization
    constructs
    identify
    worksharing construct directives
syntax
   Auto-parallelization: Enabling, Options, Directives, and Environment Variables
   Compiling with OpenMP, Directive Format, and Diagnostics
SYSTEM_CLOCK
systems


>>