Dashboards section

The Dashboard section is in the Grid menu bar.

Together, the RTM dashboards display useful information about the status of your LSF clusters. By changing the icon color, RTM can also alert operators when a host becomes unavailable for some reason.

In its current form, you can view the status of each of your clusters, the status of feature licenses, and a pictorial representation of the hosts on those clusters. If you choose to filter the display, the display is changed to reflect the current filtering.

Reasons Dashboard

The reasons dashboard shows pending reason statistics for the cluster grouped by project or queue.

Statistical Dashboard

The statistical dashboard presents non-time series cluster and host statistics by using graphically rich charts.

Cluster Dashboard

The cluster dashboard shows the following information:
  • Cluster Name: The LSF cluster name.

  • Cluster Status: The status of the cluster.

  • Master Status: The status of the master host in the cluster.

  • PAU: The type of the host currently controlling the cluster. Valid values are as follows:

    • P: Primary master host

    • A: Failover host

    • U: Unknown host type

  • Collect Status: The data collection status for the cluster.

  • CPU %%: The cluster’s overall CPU utilization rate, as a percentage.

  • Slot %%: The entire cluster’s slot utilization, as a percentage.

  • Efic %%: The entire cluster’s CPU efficiency for running jobs. Efficiency is calculated with this formula: cpu_time / (run_time × #_of_cpus).

  • Total CPUs: The total number of CPUs in the cluster.

  • Host Slots: The total number of slots available to run jobs in the cluster.

  • Pend Jobs: The total number of pending jobs in the cluster.

  • Run Jobs: The total number of running jobs in the cluster.

  • Susp Jobs: The total number of suspended jobs in the cluster (including system suspended and user suspended jobs).

  • Hourly Started: The total number of jobs that are started during the last hour.

  • Hourly Done: The total number of jobs that are completed during the last hour.

  • Hourly Exit: The total number of jobs that are cancelled during the last hour (unsuccessful completion).

License Dashboard

The license link on the dashboard takes you to the license plug-in that displays a list of your configured license servers.

Host Dashboard

If you roll your mouse over a host, summary information displays about that host. For example, you can view load averages, numbers of job slots and current slot utilization, administrative notes and status. If you click a host icon, you are directed to the “RUNNING” jobs for that host (on the Job Info > Details page). Color-coding for the host icons is described under the Host Status Legend section.

The host icons can be displayed as either small or large. Click the Settings tab and modify the settings that are found under the Visual subtab to control this behavior.

The Host dashboard can notify you if there are any events triggered on the hosts. You can enable audible and/or visual notifications from the Settings>General tab. When you point to a host icon, you can see the Alarm tab, which lists any triggered alarms or grid alerts for that host. The Host dashboard shows alerts for Grid Alarm, Syslog, Threshold, and batchload. If required, you can stop, resume, or edit the selected alert.

Alerts Dashboard

The alerts dashboard shows grid alerts. If required, you can stop, resume and edit the selected alert.

Benchmark Jobs Dashboard

You can monitor a benchmark job to measure job throughput in your cluster. You can graphically view time values (View Benchmark Jobs) for the job such as communication times between RTM and LSF, how long it took to start a job, over time. You can configure benchmark jobs at Console > Grid Management > Benchmark Jobs
Remember:

If the benchmark Job History Days is more than the Job Data Retention Period, you can see the benchmark job details in the reports but job details are not displayed when JobID is clicked from the Grid > Reports > Benchmark Results page.

License Scheduler Dashboard

The License Scheduler dashboard provides detailed data, including graphs, as collected by the configured License Scheduler Collectors. You can configure License Scheduler Collectors at Console > Grid Management > License Schedulers. After your License Scheduler Collector has polled the cluster and retrieved license usage data, you can access a graphical view of the results by selecting Grid > Dashboards > License Scheduler and clicking the Graph icon in the Actions column, for a particular License Scheduler Collector. Use the filters to show data for particular License Scheduler Collectors.
The tabs show the data in a variety of formats, including custom graphs.
  • Summary: Shows summary information of all License Scheduler features. Select an Action icon to quickly go to the applicable tab for that icon, showing detailed data and any custom graphs for that License Scheduler Collector.
  • Feature: Shows detailed information of a selected feature.
  • Clusters: Shows cross cluster information for tokens.
  • Projects: Shows data in a project view, such as how many licenses the project used or demanded.
  • Distribution: Shows LSF license data, such as LSF jobs that checked out the license or non-LSF jobs that checked out the license.
  • Users: Shows user data, such as who checked out the license.
  • Checkouts: Shows data based on the checkout, such as when the license was checked out.
  • Graphs: Shows the License Scheduler data in graph format. Graph management is found in Console > Management > Graphs. Frequency of graph generation is controlled by administrators in Console > Grid Management > License Schedulers.