0% found this document useful (0 votes)
262 views31 pages

Vmware Logs For Troubleshooting

This document discusses troubleshooting ESXi server 6.x using log files and commands. It outlines 7 important log files to review when troubleshooting, including vmksummary.log for abrupt host reboots and boot.gz for slow boot issues. The document also discusses 7 key ESXi commands and 7 significant configuration files that are important for troubleshooting sessions. The overall agenda is to improve a vSphere administrator's troubleshooting abilities through understanding the ESXi architecture and these important troubleshooting tools.

Uploaded by

Manoj Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
262 views31 pages

Vmware Logs For Troubleshooting

This document discusses troubleshooting ESXi server 6.x using log files and commands. It outlines 7 important log files to review when troubleshooting, including vmksummary.log for abrupt host reboots and boot.gz for slow boot issues. The document also discusses 7 key ESXi commands and 7 significant configuration files that are important for troubleshooting sessions. The overall agenda is to improve a vSphere administrator's troubleshooting abilities through understanding the ESXi architecture and these important troubleshooting tools.

Uploaded by

Manoj Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

SER2965BU

Advanced Troubleshooting of i bu t i o n
is t r
ESXi Server 6.x for vSphere o r d
t i o n
Gurus bl i c a
r p u
t fo
: N o
ten t
C o n
2 017
w o rld
V M

Ragavendra P Kumar & Abhilash Kunhappan

#VMworld #SER2965BU
Learner Objectives
• Improvise troubleshooting steps for a vSphere admin by

• Understanding the ESXi Architecture from a component breakdown perspective


t i o n
i s tr ibu
• Learn “7” important and key :- or d
a t i on
1. Log Files : which log files to review and when u blic
o r p
o t f
2. ESXi Commands t: N issues
: to isolate and troubleshoot
n
o nte
3. Configuration Files : forms17theCbackbone of troubleshooting sessions
d 2 0
w o rl
V M
Every VMworld session is time bound and we will cover as much as we can.
For all remaining questions, we are always a phone call away.

#SER2965BU CONFIDENTIAL 6
Agenda
© 

t i o n
1 7 Important Log files
i s tribu
or d
2 t ion
7 Key ESXi Commands
bli c a
r p u
3 7 Significant Configuration Files Not fo
en t:
C ont
1 7
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 7
ESXi – Architecture
HTML 5 - vSphere
WebClient vpxd DRS
VIM Clients VIM FT
HA Guest OS & Apps
HTML 5 – ESXi Power
CLI
t i o n
ibu
Host Client vCPU
vCenter Server
di s tr
n or
io
vpxa

c a t
bli
Hostd Sys
FDM
Calls
p u
User world Support
r
VMX
o t fo
esxcli Process
n t N
: Scheduler
localcli

ont e

VMK Call
Shell User worlds Resource

1 7 C VMM

d 20 Monitor
l
vNIC

or Network Storage

VMw
Stack Stack vSCSI
vSCSI
DVFilter

vmkapi and Interrupt handler


VMkernel

Server Hardware

8
#SER2965BU CONFIDENTIAL
ESXi Log Files

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 9
1. vmksummary.log – Abrupt Host reboots
• Time stamp indicating ESXi server reboot

t i o n
i s tribu
or d
t ion
bli c a
r p u
fo
n Not shows host rebooting has not been tracked
• VMkernel will always track Host reboots, the below
t:
ont e
1 7 C
o rl d 20
VM w

• Disable the Automated Server Recovery feature from the BIOS to avoid automatic reboots

#SER2965BU CONFIDENTIAL 10
2. boot.gz log - Slow Boot Issues
• To troubleshoot both slow boot and boot process hang of ESXi server

• Location : /var/log/boot.gz n
t i o
ibu i s t r
• Use DCUI - Alt + F12 to review VMkernel messages during this time o r d
t io n
li c a
bscreen

r u
Enabling Serial logging using (Shift + o) at the start pup for log redirection
o t fo
– vmkBootVerbose=TRUE vmbLog=TRUEnt : N
logPort=com1
o nte
1 7 C no-auto-partition bootUUID=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
d 2 0
– debugLogToSerial=1 logPort=com1

w o rl
VM

#SER2965BU CONFIDENTIAL 11
2. boot.gz log - Slow Boot Issues
• Slow ESXi Boot Issues when, RDM devices are not perennially tagged
• Validate the esx.conf file with a key word search “perennialResvd” with respect to the NAA.ID of
the LUN t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rld 20
• Command to set w
VMthe perennially reserve tag per LUN
# esxcli storage core device setconfig -d naa.id --perennially-reserved=true

#SER2965BU CONFIDENTIAL 12
3. hostd & hostd-probe.log – ESXi Not Responding
• hostd unresponsive and Host Not responding from vCenter server
• A successful hostd process start up or working fine hostd process should finish reading and
populating the “vmInventory.xml” file t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w
• Direct Connect to the ESXi with Host client should also validate the hostd status
• Do NOT restart the ESXi Server when hostd is in a loop if you need to pursue the cause

#SER2965BU CONFIDENTIAL 13
3. hostd.log – ESXi Not Responding
• Understanding the Hostd status and validating the Hostd service restart is successful

t i o n
i s tribu
or d
t ion
bli c a
r p u
t fo
No
• Key words search in hostd.log – “BEGINt:SERVICES”
n
nt e
17 Co
2 0
w o rld
V M

#SER2965BU CONFIDENTIAL 14
4. vmware.log - Virtual Machine Issues
• VM Power ON issues
• Attempt a Virtual Machine power ON and then validate the below log
t i o n
• Identify the cause of the Power ON failure
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 15
5. vmkernel.log - Storage Issues
Understanding SCSI Sense codes are key for Storage issues

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o r l d 20
VM w
• SCSI codes which are T10 org defined codes
http://www.t10.org/lists/2op.htm
• SCSI sense codes
http://www.virten.net/vmware/esxi-scsi-sense-code-decoder/

#SER2965BU CONFIDENTIAL 16
6. vobd.log - Network & Storage Issues
• Network showing uplink transition

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
• Storage disconnect messages
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 17
7. fdm.log – High Availability (HA) Issues

to identify :-
• the HostID?
t i o n
• the Host is a Slave or Master?
i s tribu
• the Host that was isolated or d
t ion
from cluster?
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 18
ESXi Commands
©

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 19
1. To Monitor & Configure ESXi Server Settings…
boot vib
module 03 04 profile
process sources t i o n
i s tribu
or d
nic t ion
bli c a
firewall 02
r p u cluster
vSwitch t f o 05 network
: N o
en t datastore
C ont
1 7
o rl d 20
san VM w list
vvol 01 06 kill
vmfs process
esxcli

#SER2965BU CONFIDENTIAL 20
# esxcli Displaying the ARP
and Neighbor
Discovery cache

t i o n
i s tribu
or d
t ion
bli c a
r p u
VMX World
o t fo
ID
n t: N
information VMX Cartel
o nt e
1 C
7 ID
0
d 2 information
o rl
VM w

#SER2965BU CONFIDENTIAL 21
2. VMkernel Sysinfo Shell Command…
** # vsish
get <opt.1>/<opt.2>/<opt.3>,….
t i o n
i s tribu
or d
t ion
bli c a
hardware netfor pu storage
n t: Not
ont e
1 7 C
o rl d 20
VM w
/bios /pNics/vmnic<>/stats /scsifw/devices/.<>/stats
/hardwareInfo /portsets/vSwitch<>/stats /scsifw/adapters/.<>/stats

** take assistance always from GSS when in doubt #SER2965BU CONFIDENTIAL 22


# vsish

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo Get the VMX
n t: N
ont e Cartel ID

1 7 C
o rl d 20
VM w
Get the
GroupID

#SER2965BU CONFIDENTIAL 23
3. To Manage ESXi & VM Configurations…
login auth
connect 03 04 task_list
registervm task_cancel
t i o n
i s tribu
or d
net t ion
bli c a port_info
r pu
storage 02
firmware
o t fo 05 service_list

n t: N remove_service
ont e
1 7 C
o r l d 20
getallvms VM w vprobes
power.getstate 01 06 shutdown
get.guest refresh_pnic
vim-cmd

#SER2965BU CONFIDENTIAL 24
# vim-cmd
Validates if the Hostd
is running by listing
the VM’s registered

t i o n
i s tribu
or d
a t i on
p ublic List the tasks running
fo r on the ESXi host
n t: Not
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 25
4. To Manage VMFS Volumes & Virtual Disks…

# vmkfstools <opt.1> name.vmdk <opt.2>


t i o n
i s tr ibu
or d
t ion
bli c a -x
r pu
-e -t10 .

-v10 o t fo check,
n t: N-v10 repair
ont e
1 7 C
o rl d 20
VM w
to check VM to check Inode to check & correct
Snapshot Chain integrity of a vmdk basic file level
consistency like file in VMFS layer issues of a vmdk
PID & CID in and not in GOS file in VMFS layer
vmdks level

#SER2965BU CONFIDENTIAL 26
# vmkfstools VM Snapshot Chain
Consistency check

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C Check or Repair

o rl d 20 the Virtual Disk


VM w

#SER2965BU CONFIDENTIAL 27
5. # memstats – to Get Detailed Memory Statistics
• VM is identified by the VMX cartel ID using the command ps | grep –i <vmx name>
• Memstats –r vm-stats –s name:<choose the options needed>

Get the VMX cartel


ID t i o n
i s tribu
Virtual Machine Stats check o r d
t n
io ID
based on the VMX
bli c acartel
p u
t f or
: N o
t en t
C on
1 7
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 28
# memstats
• VM is identified by the VMX cartel ID using the command ps | grep –i <vmx name>
• Memstats –r swap-stats –s name:<choose the options needed> -u mb
Swap Statistics on the
Virtual Machine
t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 29
6. # pktcap-uw - Network Packet Capture Analysis Tool
• pktcap-uw --trace -- vmk <vmk port> Output can be redirected to
• pktcap-uw --trace -- uplink <uplink name> a file as well

t i o n
i s ibu
tr path from
o d
Shows packet traverse
r to identify drop
io
each layer
t n and
c a
bli within the ESXi host
r p u
o t fo
n t: N
ont e MAC address of the Port
1 7 C that is being captured

o rl d 20
VM w

#SER2965BU CONFIDENTIAL 30
7. To Monitor Core ESXi Performance…
virtual machine
v
Power
(usage & management) CPU t i o n
p c
i s tr ibu
or d
t ion
bli c a
r p u
o fo
t
network n n t N
:esxtop m
ont e memory
(vSwitch, uplinks)
1 7 C
o r l d 20
VM w

x d
vSAN
disk
u (hba level info)
storage
(LUN specific) #SER2965BU CONFIDENTIAL 31
# esxtop

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 32
ESXi – 7 Effective Commands for Troubleshooting
1. esxcli : Monitoring or configuring ESXi infra settings

2. vim-cmd
on
: Managing ESXi server & Virtual Machine related configurations
u ti
s tr i b
di
3. vsish : VMkernel Sys Info Shell
n or
c a t io
p bli
uvirtual disks from command line
4. vmkfstools : for management of VMFS volumes
o r &
f
N o t
n t:
5. memstats : Memory statisticsonte
1 7 C
d 2 0
6. pktcap-uw o rl
: Anwenhanced Network packet capture analysis tool
VM
7. esxtop : Real time performance monitoring

#SER2965BU CONFIDENTIAL 33
ESXi Configuration Files

t i o n
i s tribu
or d
t ion
bli c a
r p u
o t fo
n t: N
ont e
1 7 C
o rl d 20
VM w

#SER2965BU CONFIDENTIAL 34
7 Configuration File
1. /etc/vmware/esx.conf : Holds information about Storage, Network and Hardware

2. /etc/vmware/hostd/vmInventory.xml : Virtual Machine Inventory list update on the ESXi


t i o n
ribu
3. /etc/vmware/hostd/authorization.xml dis–tauthorization
: vCenter to ESXi host connection
o r
o n
4. /etc/vmware/vpxa/vpxa.cfg : vCenter and ESXi cati
ubli connectivity
o r p
N tf
o configuration file
5. /etc/vmware/vmkiscsid/iscsid.conf
nt
: :
iSCSI
o nte
17 C
6. /etc/vmware/fdm 2 0 : FDM config file with Clusterconfig, Hostlist and VM metadata
w orld
7. V M
/etc/vmware/license.cfg : License configuration file for ESXi host

#SER2965BU CONFIDENTIAL 35

You might also like