Symptoms :-
- ovs-agent fails to start upon reboot on T7 servers
- The ovs-agent will be in maintenance state.
- When the service ovm-ovs-config starts, the ilom couldn't communicate with the SP
- From ldmd service logs we can see a problem with ilom interconnect. The connection is restore 1 minute later.
- ilom interconnect seems started after one minute
JUDI-DEV-01# ldm ls-spconfig
The requested operation could not be performed because the communication
channel between the LDoms Manager and the system controller is down.
The ILOM interconnect may be disabled or down (see ilomconfig(1M)).
JUDI-DEV-01#
JUDI-DEV-01# svcs -xv
svc:/ovm/ovs-config:default (Oracle VM Server Infrastructure)
State: maintenance since Mar 17, 2017 10:10:30 AM BST
Reason: Start method exited with $SMF_EXIT_ERR_FATAL.
See: http://support.oracle.com/msg/SMF-8000-KS
See: /var/svc/log/ovm-ovs-config:default.log
Impact: 2 dependent services are not running:
svc:/ovm/ovs-agent:default
svc:/ovm/ovmwatch:default
JUDI-DEV-01#
JUDI-DEV-01# tail /var/svc/log/ovm-ovs-config:default.log
reply = self._do_request('list-%s' % info, args)
File "/usr/lib/python2.6/site-packages/agent/lib/ldoms/ldmxml.py", line 1845, in _do_request
_check_reply(reply)
File "/usr/lib/python2.6/site-packages/agent/lib/ldoms/ldmxml.py", line 2092, in _check_reply
cmd_replies)
agent.lib.ldoms.ldmxml.LDMError: failure
command "list-spconfig": failure (failure: mdstore routine failed to load)
INFO: The Oracle VM Agent configuration is not valid.
ERROR: The server does not have a valid configuration to run the Oracle VM Agent. Run the 'ovs-agent-setup configure' command to configure the server.
[ May 17 11:15:30 Method "start" exited with status 95. ]
JUDI-DEV-01#
Workaround :-
Immediate solution when this occurs is to clear the service
# svcadm clear ovs-config
Intermediate solution while awaiting integration is to modify the utils.py file with a retry entry:
1. Goto
cd /usr/lib/python2.6/site-packages/agent/setup/ldoms
2. Save a backup of utils.py
# cp utils.py utils.py.orig
3. Edit utils.py and replace the fist occurance of the string "spconfig = ldm().list_spconfig()" with the retry entry below: (This should be line 58 of 256 )
#spconfig = ldm().list_spconfig()
spconfig = ldm().list_spconfig(retry=20)
4. Remove the compiled file utils.pyc
rm utils.pyc
5. Reboot the server This will generate a new complied utils.pyc
reboot
6. Verify the ovs-config service is online (there may be a delay during transition to online)
svcs -a | grep ovm
~Judi~
ovs-agent fails
ovs-config fails
ovmwatch fails
The ILOM interconnect may be disabled or down
Oracle Sparc iLOM commands
Configure iLOM, Configure iLOM IP address; Configure user account in iLOM
OVM Backup
ovmm
backup oracle vm
install oracle vm manager
No comments:
Post a Comment