Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The up.time The Uptime Infrastructure Monitor 4 Tru64 agent collect the following performance metrics from the systems on which it is installed:

...

Each set of performance metrics is averaged between the interval at which the up.time Uptime Infrastructure Monitor monitoring station polls the agent (e.g. every 10 minutes).

CPU

The up.time Uptime Infrastructure Monitor agent uses the sar utility (with the -u and -f options) to collect the metrics listed below from a Tru64 agent. The statistics returned by the agent are averaged for all CPUs on the system.

MetricExplanation
% UsrThe amount of time that the CPU spends in user mode.
% SysThe amount of time that the kernel spends processing system calls.
% WIOThe amount of waiting time that a runnable runable process for a device takes to perform an I/O operation.
Multi CPU UsageWhether or not a system wity with multiple CPUs is effectively balancing tasks between CPUs, or if processes are being forced off CPUs in certain circumstances.
Run Queue LengthThe percentage of time that one or more services or processes are waiting to be served by the CPU.
Run Queue OccupancyThe percentage of time that one or more services or processes are waiting to be served by the CPU.

Multi-CPU

The up.time Uptime Infrastructure Monitor agent uses the psrinfo utility on a Tru64 system to collect information about CPUs, then uses the sar and mpstat utilities to collect the metrics listed below from systems with multiple CPUs. The CPU statistics output by the agent are an average of all the CPUs on the server.

MetricExplanation
User %The percentage of CPU user processes that are in use.
System %The percentage of CPU kernel processes that are in use.
Wait I/O %The percentage of time that a process which can be run must wait for a device to perform an I/O operation.
SMTXThe number of read or write locks that a thread was not able to acquire on the first attempt, as reported by the mpstat command.
XCALThe number of interprocess cross-calls. In a multi-processor environment, one processor sends cross-calls to another processor to get that processor to do work. Cross-calls can also be used to ensure consistency in virtual memory. Heavy file system activity such as NFS can result in a high number of cross-calls.
InterruptsThe number of CPU interrupts.
Total %The total amount of User %, System %, and Wait I/O%.

Memory

The up.time Uptime Infrastructure Monitor agent uses the following commands to collect memory metrics from a Tru64 system:

...

MetricExplanation
Free MemoryThe amount of physical memory available to the operating system, system library files, and applications.
Cache Hit RateHow often the system accesses the CPU cache.
Page-outs/sThe rate at which pages were written to disk.
Page-ins/sThe rate at which pages were read from or written to the disk.
Page Frees/sThe number of pages that are freed from memory each second.
Attaches/sThe number of pages that get attached to memory each second.
Page-out Reqests/sThe number of requests to perform a write operation that occur each second.
Page-in reqs/sThe number of requests to perform a read operation that occur each second.
PageScans/sThe number of pages that are scanned each second.
PageFaults/sThe number of page faults that occur each second.
Software Locks/sThe number of software locks that are issued each second.

Disk

The up.time Uptime Infrastructure Monitor agent uses the following commands to collect disk metrics from a Tru64 system:

...

MetricExplanation
Disk (Spindle) NameThe names of each disk on the system.
Usage (% Busy)The percentage of time during which the disk drive is handling read or write requests.
Blocks per secondThe number of read and write operations on the disk that occur each second.
Transfers/sThe average number of bytes that have been transferred to or from the disk during write or read operations.
Average Queued RequestsThe number of threads that are waiting for processor time.
Average Service TimeThe average amount of time, in milliseconds, that is required for a request to be carried out.
Average Wait TimeThe average time, in milliseconds, that a transaction is waiting in a queue. The wait time is directly proportional to the length of the queue.

Network

The up.time Uptime Infrastructure Monitor agent uses the following commands to collect network metrics from a Tru64 system:

...

MetricExplanation
Receive RateThe rate, in kilobytes per seconds, at which data is received over a specific network adapter.
Send RateThe rate, in kilobytes per seconds, at which data is sent over a specific network adapter.
Packets Inbound ErrorsThe number of inbound packets that contained errors, which preventing those packets from being delivered to a higher-layer protocol.
Packets Outbound ErrorsThe number of outbound packets that could not be transmitted because of errors.
CollisionsThe number of signals from two separate nodes on the network that have collided.
TCP RetransmitsThe number of packets that have been re-sent over a network interface.

Process

The up.time Uptime Infrastructure Monitor agent uses the following commands to collect process metrics from a Tru64 system:

...

MetricExplanation
Number of ProcessesThe number of processes that are currently running on a system.
Process Creation RateThis metric determines whether or not there are runaway processes on a system or if a forking-based process (like a Web server) is spawning too many processes over a specified period of time.
Processes RunningThe number of processes that are currently running.
Processes BlockedThe number of processes that are currently being blocked from running.
Processes WaitingThe number of processes that are currently waiting to runnrun.
Workload - UserThe demand that network and local services are putting on the system, based on the IDs of the users who are logged into a system.
Workload - GroupThe demand that network and local services are putting on the system, based on the IDs of the user groups that are logged into a system.
Workload - Process NameThe demand that network and local services are putting on a system, based on the processes that are running.
Workload Top 10 - UserThe 10 network and local services that are are putting the most load on the system, based on the IDs of the users who are logged into a system.
Workload Top 10 - GroupThe 10 network and local services that are are putting the most load on the system, based on the IDs of the user groups who are logged into a system.
Workload Top 10 - Process NameThe 10 network and local services that are are putting the most load on the system, based on the processes that are running.

User

The up.time Uptime Infrastructure Monitor agent uses the following commands to collect user statistics from a Tru64 system:

...