Use Troubleshooting to
Investigate a Reported Problem
To troubleshoot
problems with the VPSALES4632 virtual machine, consider evaluating symptoms,
examining time line information and events, and creating metric charts to find
the root cause.
- Locate the object for which the problem was reported. See Search for a Specific Object.
- Review the alerts for the virtual machine to determine if the problem is already identified and recommendations made. See Review Alerts Related to Reported Problems.
If a review of the alerts did
not help you identify the cause of the problem reported for the virtual
machine, use the following tabs:
,
,
and
All
Metrics
to troubleshoot the virtual machine history and current
state.
.
- From the left menu, clickEnvironment>Object Browser, and then clickInventoryand select VPSALES4632 from the tree.The main pane updates to display the objectSummarytab.
- Click theAlertstab, click theSymptomstab, and review the symptoms to determine if one of the symptoms is related to the reported problem.Depending on how your alerts are configured, some symptoms might be triggered but not sufficient to generate an alert.
- Review symptom names to determine if one or more symptoms are related to the reported problem.The Information column provides the triggering condition, trend, and current value. What are the most common symptoms that affect response time? Do you see any symptoms related to CPU or memory use?
- Sort by theCreated Ondate so that you can focus on the time frame in which your customer reported that the problem.
- Click theStatus: Activefilter button to deactivate the filter so that you can review active and inactive symptoms.
It appears the problem is related to CPU or memory use. But you do not know if the problem is with the virtual machine or with the host. - Click thetabs and review the alerts, symptoms, and change events that might help identify common trends that are contributing to the reported problem.
- To determine if other virtual machines had symptoms triggered and alerts generated at the same time as your reported problem, click.Other virtual machine alerts are added to the time line. If you see that multiple virtual machines triggered symptoms in the same time frame, then you can investigate parent objects.
- ClickView Fromand selectHost Systemfrom the Parent list.The alerts and symptoms that are associated with the host on which the virtual machine is deployed are added to the time line. Use the information to determine if a correlation exists between the reported problem and the alerts on the host.
- Click theEvents > Eventstab to view changes in the collected metrics for the problematic virtual machine. Metrics might direct you toward the cause of the reported problem.
- Manipulate theDate Controlsto identify the approximate time when your customer reported the problem.
- Use the Filters to filter on event criticality and status. Select Symptoms if you want to include the filters in your analysis.
- Click anEventto view the details about the event.
- ClickView From, selectHost Systemunder Parents, and repeat the analysis.
Comparing events on the virtual machine and the host, and evaluating those results, indicates that CPU or memory problems are the likely cause of the problem. - If the problem relates to CPU or memory use, clickAll Metricsand create metric charts to identify whether it is CPU, memory, or both.
- If the host is still the focus, begin by working with host metrics.
- In the metric list, double-click theCPU Usage (%)and theMemory Usage (%)metrics to add them to the workspace on the right.
- In the map, click theVPSALES4632object.The metric list now displays the virtual machine metrics.
- In the metric list, double-click theCPU Usage (%)and theMemory Usage (%)metrics to add them to the workspace on the right.
- Review the host and virtual machine charts to see if you can identify a pattern that indicates the cause of the reported problem.
Comparing the four charts shows normal CPU use on both the host and the virtual machine, and normal memory use on the virtual machine. However, memory use on the host is consistently elevated three days before the reported problem on VPSALES4632.
The host memory is
consistently elevated, which impacts virtual machine response time. The number
of running virtual machines is well within the supported number. The cause
might be many intensive process applications on the virtual machines. Move some
of the virtual machines to other hosts, distribute the workload, or power off
idle virtual machines.
- In this example, useVMware Aria Operationsto power off virtual machines on the host so that you can improve performance in the running virtual machines. See Run Actions.
- If you want to use the combination of charts that you created on theAll Metricstab again, clickGenerate Dashboard.