PCPINTRO(1) | General Commands Manual | PCPINTRO(1) |
PCPIntro - introduction to the Performance Co-Pilot (PCP)
The Performance Co-Pilot (PCP) is a toolkit designed for monitoring and managing system-level performance. These services are distributed and scalable to accommodate the most complex system configurations and performance problems.
PCP supports many different platforms, including (but not limited to) Linux, MacOSX, Solaris and Windows. From a high-level PCP can be considered to contain two classes of software utility:
This manual entry describes the high-level features and options common to most PCP utilities available on all platforms.
The PCP architecture is distributed in the sense that any PCP tool may be executing remotely. On the host (or hosts) being monitored, each domain of performance metrics, whether the kernel, a service layer, a database management system, a web server, an application, etc. requires a Performance Metrics Domain Agent (PMDA) which is responsible for collecting performance measurements from that domain. All PMDAs are controlled by the Performance Metrics Collector Daemon (pmcd(1)) on the same host.
Client applications (the monitoring tools) connect to pmcd(1), which acts as a router for requests, by forwarding requests to the appropriate PMDA and returning the responses to the clients. Clients may also access performance data from sets of PCP archives (created using pmlogger(1)) for retrospective analysis.
PCP redistributes a wealth of performance information within a host and across its networks. The following security philosophy underlies the setting of several defaults that control how much information is sent and received.
By default, the information exposed by PMCD about a host is approximately of the same level of confidentiality as available to a completely unprivileged user on that host. So, performance data that is available to be read completely freely on a machine may be made available by PMCD to the network.
However, the host running PMCD and its network is not assumed to run only friendly applications. Therefore, write type operations, including from the local host, are not permitted by default.
These defaults may be overridden (expanded or reduced) in several ways, including by specifying network ACLs in pmcd.conf, activating non-default PMDAs, or by using PMCD connections that pass user credentials. For example, some PMDAs automatically provide greater information for particular credentialed users or groups.
The following performance monitoring applications are primarily console based, typically run directly from the command line, and are just a small subset of the tools available as part of the base PCP package.
Each tool or command is documented completely in its own reference page.
If the PCP GUI package is installed then the following additional tools are available.
There is a set of common command line arguments that are used consistently by most PCP tools.
archive is a comma-separated list of names, each of which may be the name of a directory containing one or more archives, the base name common to all of the physical files created by an instance of pmlogger(1), or any one of the physical files, e.g. /path/to/myarchives (directory) or myarchive (base name) or myarchive.meta (the metadata file) or myarchive.index (the temporal index) or myarchive.0 (the first data volume of archive) or myarchive.0.bz2 or myarchive.0.bz (the first data volume compressed with bzip2(1)) or myarchive.0.gz or myarchive.0.Z or myarchive.0.z (the first data volume compressed with gzip(1)), myarchive.1 or myarchive.3.bz2 or myarchive.42.gz etc.
In the absence of a live or archive source of metrics, a heuristic search for archives for the local host can be invoked via the -O (or --origin) option. When using this option without an explicit source of metrics, monitor tools attempt to use archives from a system archive location such as $PCP_LOG_DIR/pmlogger/`hostname`. Refer to the TIME WINDOW SPECIFICATION section below for details on the acceptable syntax for the origin option, but a typical invocation in this mode would be -O today or --origin yesterday.
Most PCP tools operate with periodic sampling or reporting, and the -t (or --interval) and -A (or --align) options may be used to control the duration of the sample interval and the alignment of the sample times.
The interval argument is specified as a sequence of one or more elements of the form
where number is an integer or floating point constant (parsed using strtod(3)) and the optional units is one of: seconds, second, secs, sec, s, minutes, minute, mins, min, m, hours, hour, h, days, day and d. If the unit is empty, second is assumed.
number[units]
In addition, the upper case (or mixed case) version of any of the above is also acceptable.
Spaces anywhere in the interval are ignored, so 4 days 6 hours 30 minutes, 4day6hour30min, 4d6h30m and 4d6.5h are all equivalent.
Multiple specifications are additive, for example ``1hour 15mins 30secs'' is interpreted as 3600+900+30 seconds.
The align argument follows the syntax for an interval argument described above for the -t or --interval option.
Note that alignment occurs by advancing the time as required, and that -A (or --align) acts as a modifier to advance both the start of the time window (see the next section) and the origin time (if the -O or --origin option is specified).
Many PCP tools are designed to operate in some time window of interest, for example to define a termination time for real-time monitoring or to define a start and end time within a set of PCP archives.
In the absence of the -O (or --origin) and -A (or --align) options to specify an initial sample time origin and time alignment (see above), the PCP application will retrieve the first sample at the start of the time window.
The following options may be used to specify a time window of interest.
The starttime parameter may be given in one of three forms (interval is the same as for the -t or --interval option as described above, datetime is described below):
The endtime parameter may be given in one of three forms (interval is the same as for the -t or --interval option as described above, datetime is described below):
The origin argument accepted by -O (or --origin) conforms to the same syntax and semantics as the starttime argument for the -T (or --finish) option.
For example --origin -0 specifies that the initial position should be at the end of the time window; this is most useful when wishing to replay ``backwards'' within the time window.
The datetime argument for the -O (or
--origin), -S (or --start) and -T (or
--finish) options consists of:
date time zone day relative
A date can be one of: YY-MM-DD, MM/DD/YY, DD Month YYYY, or Month DD YYYY. A
time can be one of: HH:MM:SS, HH:MM. HH:MM can use either the 12 hour (via
an am or pm suffix) or 24 hour convention. A day of the week can be a
spelled out day of the week, optionally preceded by an ordinal number such
as second Tuesday. A zone is a time zone value as specified by the
tzselect(8) command. A relative time can be a time unit that is:
preceded by a cardinal number such as 1 year or 2 months, preceded by one of
the time words this or last, or succeeded by the time word ago. A relative
time can also be one of the time words: yesterday, today, tomorrow, now.
Examples of datetime strings are: 1996-03-04 13:07:47 EST Mon,
1996-03-05 14:07:47 EST -1hour, Mon Mar 4 13:07:47 1996,
Mar 4 1996, Mar 4, Mar, 13:07:50 or
13:08.
For any missing low order fields, the default value of 0 is assumed for hours, minutes and seconds, 1 for day of the month and Jan for months. Hence, the following are equivalent: --start '@ Mar 1996' and --start '@ Mar 1 00:00:00 1996'.
If any high order fields are missing, they are filled in by starting with the year, month and day from the current time (real-time mode) or the time at the beginning of the set of PCP archives (archive mode) and advancing the time until it matches the fields that are specified. So, for example if the time window starts by default at ``Mon Mar 4 13:07:47 1996'', then --start @13:10 corresponds to 13:10:00 on Mon Mar 4, 1996, while --start @10:00 corresponds to 10:00:00 on Tue Mar 5, 1996 (note this is the following day).
For greater precision than afforded by datetime(3), the seconds component may be a floating point number.
If a timezone is not included in a datetime then there ares several interpretations available depending on the other command line options used. The default is to use the local timezone on the system where the PCP tool is being run. A -Z or --timezone option specifies an explicit timezone, else a -z or --hostzone option changes the timezone to the local timezone at the host that is the source of the performance metrics.
The number of performance metric names supported by PCP on most platforms ranges from many hundreds to several thousand. The PCP libraries and applications use an internal identification scheme that unambiguously associates a single integer with each known performance metric. This integer is known as the Performance Metric Identifier, or PMID. Although not a requirement, PMIDs tend to have global consistency across all systems, so a particular performance metric usually has the same PMID.
For all users and most applications, direct use of the PMIDs would be inappropriate (this would limit the range of accessible metrics, make the code hard to maintain, force the user interface to be particularly baroque, and so on). Hence a Performance Metrics Name Space (PMNS) is used to provide external names and a hierarchic classification for performance metrics. A PMNS is represented as a tree, with each node having a label, a pointer to either a PMID (for leaf nodes) or a set of descendent nodes in the PMNS (for non-leaf nodes).
A node label must begin with an alphabetic character, followed by zero or more characters drawn from the alphabetics, the digits and character ``_'' (underscore). For alphabetic characters in a node label, upper and lower case are distinguished.
By convention, the name of a performance metric is constructed by concatenation of the node labels on a path through the PMNS from the root node to a leaf node, with a ``.'' as a separator. The root node in the PMNS is unlabeled, so all names begin with the label associated with one of the descendent nodes below the root node of the PMNS, for example kernel.percpu.syscall. Typically (although this is not a requirement) there would be at most one name for each PMID in a PMNS. For example kernel.all.cpu.idle and disk.dev.read are the unique names for two distinct performance metrics, each with a unique PMID.
Groups of related PMIDs may be named by naming a non-leaf node in the PMNS tree, for example disk.
The default local PMNS used by pmcd is located at $PCP_VAR_DIR/pmns/root however the environment variable PMNS_DEFAULT may be set to the full pathname of a different PMNS which will then be used as the default local PMNS.
Most applications do not use the local PMNS directly, but rather import parts of the PMNS as required from the same place that performance metrics are fetched, i.e. from pmcd(1) for live monitoring or from a set of PCP archives for retrospective monitoring.
To explore the PMNS use pminfo(1), or if the PCP GUI package is installed the New Chart and Metric Search windows within pmchart(1).
Some performance metrics have a singular value. For example, the available memory or number of context switches have one value per performance metric source, that is, one value per host. The metric descriptor (metadata) for each metric makes this fact known to applications that process values for these single-valued metrics.
Some performance metrics have a set of values or instances in each implementing performance metric domain. For example, one value for each disk, one value for each process, one value for each CPU, or one value for each activation of a given application. When a metric has multiple instances, the PMNS does not represent this in metric names; rather, a single metric may have an associated set of values. Multiple values are associated with the members of an instance domain, such that each instance has a unique instance identifier within the associated instance domain. For example, the ''per CPU´´ instance domain may use the instance identifiers 0, 1, 2, 3, and so on to identify the configured processors in the system. Internally, instance identifiers are encoded as binary values, but each performance metric domain also supports corresponding strings as external names for the instance identifiers, and these names are used at the user interface to the PCP utilities.
Multiple performance metrics may be associated with a single instance domain. For example, per-process metrics under proc all share the same instance domain.
PCP arranges for information describing instance domains to be exported from the performance metric domains to the applications that require this information. Applications may also choose to retrieve values for all instances of a performance metric, or some arbitrary subset of the available instances.
Metric names and the instance domain concept provides two-dimensions for the modelling of performance metrics. This is a clear and simple model, however on some occasions it does not suffice. For example, a metric may wish to represent higher dimensional data such as ``per CPU'' counters for each running process. In these cases it is common to create a compound instance, where the name is composed of each component with a separator in-between (for example, ``87245::cpu7'' might be used to separate process ID from CPU ID) to create flattened instance names. Additionally, such cases benefit from the use of metric instances labels to explicitly show the separate components (continuing the example from above, labels ``{"pid":87245,"cpu":7}'' might be used).
In configuration files and (to a lesser extent) command line options, metric specifications adhere to the following syntax rules by most tools. See the tool specific manual pages for the exact syntax supported.
If the source of performance metrics is real-time from
pmcd(1) then the accepted syntax is
host:metric[instance1,instance2,...]
If the source of performance metrics is a set of PCP archives then
the accepted syntax is
archive/metric[instance1,instance2,...]
The host:, archive/ and [instance1,instance2,...] components are all optional.
The , delimiter in the list of instance names may be replaced by white space.
Special characters in instance names may be escaped by surrounding the name in double quotes or preceding the character with a backslash.
White space is ignored everywhere except within a quoted instance name.
An empty instance is silently ignored, and in particular ``[]'' is the same as no instance, while ``[one,,,two]'' is parsed as specifying just the two instances ``one'' and ``two''.
As a special case, if the host is the single character ``@'' then this refers to a PM_CONTEXT_LOCAL source, see pmNewContext(3).
Since PCP version 3.6.11, a monitor can explicitly request a secure connection to a collector host running pmcd(1) or pmproxy(1) using the PM_CTXFLAG_SECURE context flag. If the PCP Collector host supports this feature - refer to the pmcd.feature.secure metric for confirmation of this - a TLS/SSL (Transport Layer Security or Secure Sockets Layer) connection can be established which uses public key cryptography and related techniques. These features aim to prevent eavesdropping and data tampering from a malicious third party, as well as providing server-side authentication (confident identification of a server by a client) which can be used to guard against man-in-the-middle attacks.
A secure pmcd connection requires use of certificate-based authentication. The security features offered by pmcd and pmproxy are implemented using the OpenSSL APIs and utilities. The openssl(1) tool can be used to create certificates suitable for establishing trust between PCP monitor and collector hosts.
A complete description is beyond the scope of this document, refer to the PCP ENVIRONMENT, FILES and SEE ALSO sections for detailed information. This includes links to tutorials on the steps involved in setting up the available security features.
In the absence of an explicit hostname specification, most tools will default to the local host in live update mode. In PCP releases since 3.8.4 onward, this results in an efficient local protocol being selected - typically a Unix domain socket. If this option is used (which can also be explicitly requested via the unix: host specification described below), it is important to note that all connections will be automatically authenticated. In other words, the credentials of the user invoking a client tool will automatically be made available to pmcd(1) and all of its PMDAs, on the users behalf, such that results can be customized to the privilege levels of individual users.
Names of remote hosts running the pmcd(1) daemon can of course also be provided to request a remote host be used. The most basic form of pmcd host specification is a simple host name, possibly including the domain name if necessary. However, this can be extended in a number of ways to further refine attributes of the connection made to pmcd.
The pmcd port number and also optional pmproxy(1) hostname and its port number, can be given as part of the host specification, since PCP version 3.0. These supersede (and override) the old-style PMCD_PORT, PMPROXY_HOST and PMPROXY_PORT environment variables.
The following are valid hostname specifications that specify connections to pmcd on host nas1.acme.com with/without a list of ports, with/without a pmproxy(1) connection through a firewall, and with IPv6 and IPv4 addresses as shown.
$ pcp --host nas1.acme.com:44321,4321@firewall.acme.com:44322 $ pcp --host nas1.acme.com:44321@firewall.acme.com:44322 $ pcp --host nas1.acme.com:44321@firewall.acme.com $ pcp --host nas1.acme.com@firewall.acme.com $ pcp --host nas1.acme.com:44321 $ pcp --host [fe80::2ad2:44ff:fe88:e4f1%p2p1] $ pcp --host 192.168.0.103
In addition, ``connection attributes'' can also be specified. These include username, password (can be given interactively and may depend on the authentication mechanism employed), whether to target a specific running container, whether to use secure (encrypted) or native (naked) protocol, and so on. The previous examples all default to native protocol, and use no authentication. This can be altered, as in the following examples.
$ pcp --host pcps://app2.acme.com?container=cae8e6edc0d5 $ pcp --host pcps://nas1.acme.com:44321?username=tanya&method=gssapi $ pcp --host pcps://nas2.acme.com@firewalls.r.us?method=plain $ pcp --host pcp://nas3.acme.com $ pcp --host 192.168.0.103?container=cae8e6edc0d5,method=scram-sha-256 $ pcp --host unix: $ pcp --host local:
The choice of authentication method, and other resulting parameters like username, optionally password, etc, depends on the SASL2 configuration used by each (remote) pmcd. Tutorials are available specifying various aspects of configuring the authentication module(s) used, these fine details are outside the scope of this document.
In all situations, host names can be used interchangeably with IPv4 or IPv6 addressing (directly), as shown above. In the case of an IPv6 address, the full address must be enclosed by square brackets and the scope (interface) must also be specified.
The final local: example above is now the default for most tools. This connection is an automatically authenticated local host connection on all platforms that support Unix domain sockets. No password is required and authentication is automatic. This is also the most efficient (lowest overhead) communication channel.
The difference between unix: and local: is that the former is a strict Unix domain socket specification (connection fails if it cannot connect that way), whereas the latter has a more forgiving fallback to using localhost (i.e. a regular Inet socket connection is used when Unix domain socket connections are unavailable).
In addition, if the PCP product is installed the following files and directories are relevant.
In addition to the PCP run-time environment and configuration variables described in the PCP ENVIRONMENT section below, the following environment variables apply to all installations.
Note that most uses of these environment variables are optimized to check the environment only the first time the variable might be used. As the environment usually is not checked again, the only safe strategy is to ensure all PCP-related environment variables are set before the first call into any of the PCP libraries.
Each file in the resulting list is assumed to contain definitions of derived metrics as per the syntax described in pmLoadDerivedConfig(3), and these are loaded in order.
Derived metrics may be used to extend the available metrics with new (derived) metrics using simple arithmetic expressions.
If PCP_DERIVED_CONFIG is set, the derived metric definitions are processed automatically as each new source of performance metrics is established (i.e. each time a pmNewContext(3) is called) or when requests are made against the PMNS.
Any component in the $PCP_DERIVED_CONFIG list or the expanded list of files that is not a file, or is not a directory or is not accessible (due to permissions or a bad symbolic link) will be silently ignored.
If PCP_IGNORE_MARK_RECORDS is set (but has no value) then all <mark> records will be ignored. Otherwise the value $PCP_IGNORE_MARK_RECORDS follows the syntax for an interval argument described above for the -t option, and <mark> records will be ignored if the time gap between the last record before the <mark> and the first record after the <mark> is not more than interval.
If PCP_STDERR is set to the literal value DISPLAY then all messages will be displayed in a dialog. This is used for any tools launched from a Desktop environment.
If PCP_STDERR is set to any other value, the value is assumed to be a filename, and all messages will be written there.
The check on pmcd's readiness will wait up to PMCD_WAIT_TIMEOUT seconds. If pmcd has a long startup time (such as on a very large system), then PMCD_WAIT_TIMEOUT can be set to provide a maximum wait longer than the default 60 seconds.
The environment variable PCP_COUNTER_WRAP may be set to indicate that all such cases of a decreasing ``counter'' should be treated as a counter overflow, and hence the values are assumed to have wrapped once in the interval between consecutive samples. This ``wrapping'' behavior was the default in earlier PCP versions, but by default has been disabled in PCP release from version 1.3 on.
The following environment variables are relevant to installations in which pmlogger(1), the PCP archiver, is used.
If you have the PCP product installed, then the following environment variables are relevant to the Performance Metrics Domain Agents (PMDAs).
The previous behaviour was that if this variable was set, then a context established with the type of PM_CONTEXT_LOCAL will have access to the ``proc'' PMDA to retrieve performance metrics about individual processes.
The previous behaviour was that if this variable was set, then a context established with the type of PM_CONTEXT_LOCAL will have access to the ``sample'' PMDA if this optional PMDA has been installed locally.
Environment variables with the prefix PCP_ are used to parameterize the file and directory names used by PCP. On each installation, the file /etc/pcp.conf contains the local values for these variables. The $PCP_CONF variable may be used to specify an alternative configuration file, as described in pcp.conf(5).
For environment variables affecting PCP tools, see pmGetOptions(3).
pcp(1), pmcd(1), pmie(1), pmie_daily(1), pminfo(1), pmlc(1), pmlogger(1), pmlogger_daily(1), pmrep(1), pmstat(1), pmval(1), systemctl(1), LOGIMPORT(3), LOGARCHIVE(5), pcp.conf(5), pcp.env(5), PMNS(5) and chkconfig(8).
If the PCP GUI package is installed, then the following entries
are also relevant:
pmchart(1), pmtime(1), and pmdumptext(1).
If the secure sockets extensions have been enabled, then the
following references are also relevant:
https://pcp.io/documentation.html
https://pcp.readthedocs.io/en/latest/QG/EncryptedConnections.html
https://pcp.readthedocs.io/en/latest/QG/AuthenticatedConnections.html
Also refer to the books Performance Co-Pilot User's and Administrator's Guide and Performance Co-Pilot Programmer's Guide which can be found at https://pcp.readthedocs.io/en/latest/.
PCP | Performance Co-Pilot |