Cypress CSC-1200T User's Guide Page 99

  • Download
  • Add to my manuals
  • Print
  • Page
    / 124
  • Table of contents
  • BOOKMARKS
  • Rated. / 5. Based on customer reviews
Page view 98
Chapter 9. Programming tools 99
performance information and instruction counts.
PAT is able to analyze programs written in Fortran 90, C, C++ and HPF.
The executable only needs to relinked, no recompiling is necessary. The
linker option -l pat along with the PAT specific cld file pat.cld are
required.
As an example, suppose that a Fortran 90 program in the file prog.f90
is to be analyzed. The following commands can be used:
t3e% f90 -c prog.f90
t3e% f90 prog.o -o prog -l pat pat.cld
C, C++ and HPF programs are linked similarly. A log file of the type
pdf.1234 is produced during the execution.
Timing information is then displayed with the command
t3e% pat -T prog pdf.1234
Sample output for a four PE run looks like
Elapsed Time 4.229 sec 4 PEs
User Time (ave) 3.441 sec 81%
System Time (ave) 0.023 sec 1%
The PAT option -m produces performance counter statistics:
t3e% pat -m prog pdf.1234
Performance counters for FpOps
Values given are in MILLIONS.
PE cycles operations ops/sec dcache misses/sec
misses
0 425.34 152.45 134.39 5.48 4.83
1 1574.81 152.40 36.29 5.61 1.34
2 1574.87 152.40 36.28 5.62 1.34
3 1575.15 152.40 36.28 5.62 1.34
The column ops/sec contains the floating point performance given in
Mflop/s for each PE. A high cache miss rate can be caused by less than
optimal program design. Instead of floating point operations it is pos-
sible to measure integer performance by setting
t3e% setenv PAT_SEL INTOPS
Memory load and store operations can also be monitored.
Profile information can be obtained with the option -p. Normally only
subroutine-level profile is available. To produce a line-level profile the
Page view 98
1 2 ... 94 95 96 97 98 99 100 101 102 103 104 ... 123 124

Comments to this Manuals

No comments