Popular Posts

Jun 22, 2017

ovs-agent fails to start upon reboot on T7 sparc servers


Symptoms :-

                 - ovs-agent fails to start upon reboot on T7 servers
                 - The ovs-agent will be in maintenance state.
                 -  When the service ovm-ovs-config starts, the ilom couldn't communicate with the SP
                 -  From ldmd service logs we can see a problem with ilom interconnect. The connection is restore 1 minute later.
                 -  ilom interconnect seems started after one minute

JUDI-DEV-01# ldm ls-spconfig 
The requested operation could not be performed because the communication 
channel between the LDoms Manager and the system controller is down. 
The ILOM interconnect may be disabled or down (see ilomconfig(1M)). 
JUDI-DEV-01#

JUDI-DEV-01# svcs -xv 
svc:/ovm/ovs-config:default (Oracle VM Server Infrastructure) 
State: maintenance since Mar 17, 2017 10:10:30 AM BST 
Reason: Start method exited with $SMF_EXIT_ERR_FATAL. 
See: http://support.oracle.com/msg/SMF-8000-KS 
See: /var/svc/log/ovm-ovs-config:default.log 
Impact: 2 dependent services are not running: 
svc:/ovm/ovs-agent:default 
svc:/ovm/ovmwatch:default 
JUDI-DEV-01# 

JUDI-DEV-01# tail /var/svc/log/ovm-ovs-config:default.log 
reply = self._do_request('list-%s' % info, args) 
File "/usr/lib/python2.6/site-packages/agent/lib/ldoms/ldmxml.py", line 1845, in _do_request 
_check_reply(reply) 
File "/usr/lib/python2.6/site-packages/agent/lib/ldoms/ldmxml.py", line 2092, in _check_reply 
cmd_replies) 
agent.lib.ldoms.ldmxml.LDMError: failure 
command "list-spconfig": failure (failure: mdstore routine failed to load) 
INFO: The Oracle VM Agent configuration is not valid. 
ERROR: The server does not have a valid configuration to run the Oracle VM Agent. Run the 'ovs-agent-setup configure' command to configure the server. 
[ May 17 11:15:30 Method "start" exited with status 95. ] 
JUDI-DEV-01# 

Workaround :-
Immediate solution when this occurs is to clear the service
               # svcadm clear ovs-config 

Intermediate solution while awaiting integration is to modify the utils.py file with a retry entry:

1.      Goto
               cd /usr/lib/python2.6/site-packages/agent/setup/ldoms 

2.      Save a backup of utils.py
               # cp utils.py utils.py.orig 

3.      Edit utils.py and replace the fist occurance of the string "spconfig = ldm().list_spconfig()" with the retry entry below: (This should be line 58 of 256 )
               #spconfig = ldm().list_spconfig() 
               spconfig = ldm().list_spconfig(retry=20) 

4.      Remove the compiled file utils.pyc
               rm utils.pyc

5.      Reboot the server This will generate a new complied utils.pyc
               reboot

6.      Verify the ovs-config service is online (there may be a delay during transition to online)
               svcs -a | grep ovm



~Judi~

ovs-agent fails
ovs-config fails
ovmwatch fails
The ILOM interconnect may be disabled or down
Oracle Sparc iLOM commands 
Configure iLOM, Configure iLOM IP address; Configure user account in iLOM
OVM Backup
ovmm
backup oracle vm

install oracle vm manager

No comments:

Post a Comment

Popular Posts