Re: Collecting Historical Data (Perf Toolbox ?) Revisited

From: Damir Delija (damir.delija@PBZ.HR)
Date: Fri Sep 19 2003 - 09:49:26 EDT


There are some usefull things in the "Managing Aix farms"

but scripts are scripts ...

I'm actually using PDT, sar, sargraph.sh, nmon wlmstat to get data
and than scripts are doing reports, filtering out etc and
data is presented trough NCP ....

Since we have some akward reporting system (weekly based) scripts do html pages
in format appropriate for upper people (actually I've never seen anyone reading
this reports ...)

and we get something like this

>

------------------------------------------------------------------------

Performance report AIX server XXXXXX

Measurement time from: 06/09/03 to: 06/09/03, report generated Mon Jun 16 07:47:51 DFT 2003

Tools used: sar pdt nmon

Conclusion
System load is XXX. Swap que and run que sizes are xx.

PDT tool report
 Performance Diagnostic Facility 1.0

 Report printed: Fri Jun 13 10:00:00 2003

 Host name: XXXXXX
 Range of analysis includes measurements
 from: Hour 9 on Thursday, May 22nd, 2003
 to: Hour 9 on Friday, June 13th, 2003

------------------------ Alerts ---------------------

  I/O CONFIGURATION
   - Note: volume hdiskpower199 has 69760 MB available for allocation
      while volume hdisk3 has 0 MB available

  PAGING CONFIGURATION
   - Paging space hd6 on volume group rootvg is fragmented

  I/O BALANCE
      volume hdiskpower160, mean util. = 4.61 %
      volume hdisk432, mean util. = 2.62 %
      volume hdisk0, mean util. = 2.60 %
      volume hdisk219, mean util. = 2.54 %
      volume hdisk2, mean util. = 1.98 %
      volume hdisk3, mean util. = 0.52 %
      volume hdisk1, mean util. = 0.27 %
      volume hdiskpower210, mean util. = 0.05 %
      volume hdiskpower188, mean util. = 0.04 %
      volume hdiskpower148, mean util. = 0.04 %
      volume hdiskpower143, mean util. = 0.04 %
      volume hdiskpower161, mean util. = 0.03 %
      volume hdiskpower142, mean util. = 0.03 %
      volume hdisk433, mean util. = 0.03 %
      volume hdisk299, mean util. = 0.03 %
      volume hdisk220, mean util. = 0.03 %
      volume hdiskpower149, mean util. = 0.02 %
      volume hdisk409, mean util. = 0.02 %
      volume hdisk408, mean util. = 0.02 %
      volume hdisk208, mean util. = 0.02 %
      volume hdiskpower131, mean util. = 0.01 %
      volume hdiskpower130, mean util. = 0.01 %
      volume hdisk420, mean util. = 0.01 %
      volume hdisk399, mean util. = 0.01 %
      volume hdisk398, mean util. = 0.01 %
      volume hdisk375, mean util. = 0.01 %
      volume hdisk374, mean util. = 0.01 %
      volume hdisk263, mean util. = 0.01 %
      volume hdisk207, mean util. = 0.01 %
      volume hdisk185, mean util. = 0.01 %
      volume hdisk184, mean util. = 0.01 %
   - Phys. vol. hdiskpower160 is significantly busier than others

  FILE SYSTEMS
   - File system lv00 (/test) is nearly full at 100 %

  VIRTUAL MEMORY
   - There is evidence of memory contention, yet Memory Load Control is disabled

---------------------- Upward Trends ----------------

  FILES
   - File (or directory) /usr/adm/wtmp SIZE is increasing
      now, 1020 KB and increasing an avg. of 25982 bytes/day

  FILE SYSTEMS
   - File system prod206 (/u06/oradata/prod2) is growing
      now, 50.00 % full, and growing an avg. of 1.57 %/day
      At this rate, prod206 will be full in about 28 days

  PAGE SPACE
   - Page space paging00 USE is growing
      now, 327.68 MB and growing an avg. of 16.03 MB/day
   - Page space hd6 USE is growing
      now, 327.68 MB and growing an avg. of 16.26 MB/day

---------------------- Downward Trends --------------

  FILE SYSTEMS
   - File system hd4 (/) is shrinking
      now, 59.00 % full, and declining an avg. of 0.83 %/day

----------------------- System Health ---------------

  SYSTEM HEALTH
   - Current process state breakdown:
      4.00 [ 1.4 %] : in state _
      283.80 [ 98.5 %] : active
      0.40 [ 0.1 %] : zombie
      288.20 = TOTAL

-------------------- Summary -------------------------
  This is a severity level 3 report
  No further details available at severity levels > 3
CPU usage
 Limits are: iow% > 25%, usr% + sys% > 80%, idle% < 10%
    DATE Msys Asys Mwio Awio Musr Ausr Midl Aidl
06/09/03 3 1 14 1 12 2 100 97
06/10/03 4 1 17 1 16 2 100 96
06/11/03 4 1 14 1 12 2 100 96
06/12/03 6 1 16 1 13 2 100 96
06/13/03 5 1 33 1 14 2 99 96
06/14/03 2 0 7 0 6 1 100 99
06/15/03 2 0 4 0 4 1 100 99
MAXIMUM 6 1 33 1 16 2 100 99
Run que and swap que size
There is 14 processors, limits are on 70 values: 5 x 14

    DATE Mrqs Arqs Msqs Asqs
06/09/03 3.3 1.3 3.0 1.3
06/10/03 4.0 1.3 3.0 1.3
06/11/03 3.0 1.3 4.0 1.3
06/12/03 3.0 1.3 3.0 1.3
06/13/03 4.0 1.3 5.2 1.3
06/14/03 3.0 1.2 2.0 1.1
06/15/03 4.0 1.1 4.0 1.1
MAXIMUM 4 1 5 1

Daily sar graphs

-------------------------------------------------------------------



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 22:17:13 EDT