/usr/lib/python3.6/site-packages/paramiko/client.py:711: UserWarning: Unknown ssh-ed25519 host key for 192.168.122.158: b'c4dc2b25c8a8efad9953efd8358c778d' key.get_fingerprint()))) 2018-10-06 17:01:57,066 main.py 106 INFO doctor test starting....... 2018-10-06 17:01:57,066 apex.py 28 INFO Setup Apex installer start...... 2018-10-06 17:01:57,066 base.py 80 INFO Get SSH keys from apex installer...... 2018-10-06 17:01:57,439 apex.py 46 INFO Get controller ips from Apex installer...... 2018-10-06 17:01:57,439 base.py 98 INFO Run command=source stackrc; nova list | grep ' overcloud-controller-[0-9] ' | sed -e 's/^.*ctlplane=//' |awk '{print $1}' in apex installer...... 2018-10-06 17:02:01,960 base.py 107 INFO Output=['192.30.9.4', '192.30.9.6', '192.30.9.5'] command=source stackrc; nova list | grep ' overcloud-controller-[0-9] ' | sed -e 's/^.*ctlplane=//' |awk '{print $1}' in apex installer 2018-10-06 17:02:01,961 apex.py 53 INFO Get controller_ips:['192.30.9.4', '192.30.9.6', '192.30.9.5'] from Apex installer 2018-10-06 17:02:02,615 apex.py 66 INFO Set apply patches start...... /usr/lib/python3.6/site-packages/paramiko/client.py:711: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.4: b'3141a7e0ae95384ea3b655190c95593d' key.get_fingerprint()))) /usr/lib/python3.6/site-packages/paramiko/client.py:711: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.6: b'c13da191076e35ee29149c03a03fbdb1' key.get_fingerprint()))) /usr/lib/python3.6/site-packages/paramiko/client.py:711: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.5: b'916b51b945e0f7a7e19b912c9bb4f271' key.get_fingerprint()))) 2018-10-06 17:02:17,949 base.py 60 INFO Setup ssh stunnel in apex installer...... 2018-10-06 17:02:17,963 image.py 48 INFO image create start...... 2018-10-06 17:02:25,791 image.py 68 INFO image create end...... 2018-10-06 17:02:25,791 user.py 69 INFO user create start...... 2018-10-06 17:02:25,886 user.py 85 INFO create project...... 2018-10-06 17:02:26,010 user.py 95 INFO test project 2018-10-06 17:02:26,124 user.py 103 INFO create user...... 2018-10-06 17:02:26,477 user.py 113 INFO test user 2018-10-06 17:02:26,565 user.py 127 INFO role _member_ already created...... 2018-10-06 17:02:26,565 user.py 128 INFO test role 2018-10-06 17:02:27,270 user.py 77 INFO user create end...... 2018-10-06 17:02:27,271 main.py 53 INFO doctor fault management test starting....... 2018-10-06 17:02:27,601 fault_management.py 65 INFO fault management setup...... 2018-10-06 17:02:27,602 user.py 181 INFO user quota update start...... 2018-10-06 17:02:28,440 user.py 197 INFO user quota update end...... 2018-10-06 17:02:28,440 network.py 41 INFO network create start....... 2018-10-06 17:02:29,607 network.py 47 INFO network create end....... 2018-10-06 17:02:29,607 network.py 49 INFO subnet create start....... 2018-10-06 17:02:30,609 network.py 58 INFO subnet create end....... 2018-10-06 17:02:30,609 instance.py 51 INFO instance create start...... 2018-10-06 17:02:33,491 instance.py 73 INFO instance create end...... 2018-10-06 17:02:33,491 instance.py 92 INFO wait for vm launch start...... 2018-10-06 17:02:43,420 instance.py 110 INFO wait for vm launch end...... 2018-10-06 17:02:43,420 alarm.py 45 INFO alarm create start...... 2018-10-06 17:02:45,873 alarm.py 81 INFO alarm create end...... 2018-10-06 17:02:45,873 sample.py 72 INFO sample inspector start...... 2018-10-06 17:02:47,194 sample.py 26 INFO sample consumer start...... * Running on http://0.0.0.0:12345/ (Press CTRL+C to quit) * Running on http://0.0.0.0:12346/ (Press CTRL+C to quit) 2018-10-06 17:02:48,351 apex.py 58 INFO Get host ip by hostname=overcloud-novacompute-1.opnfvlf.org from Apex installer...... 2018-10-06 17:02:48,352 base.py 98 INFO Run command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer...... 2018-10-06 17:02:52,800 base.py 107 INFO Output=['192.30.9.7'] command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer 2018-10-06 17:02:52,800 fault_management.py 118 INFO Get host info(name:overcloud-novacompute-1.opnfvlf.org, ip:192.30.9.7) which vm(doctor_vm0) launched at 2018-10-06 17:02:52,801 sample.py 30 INFO sample monitor start...... 2018-10-06 17:02:52,801 sample.py 85 INFO Starting Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.7) 2018-10-06 17:04:52,848 fault_management.py 89 INFO fault management start...... 2018-10-06 17:04:52,850 base.py 80 INFO Get SSH keys from apex installer...... 2018-10-06 17:04:52,851 base.py 84 INFO Already have SSH keys from apex installer...... /usr/lib/python3.6/site-packages/paramiko/client.py:711: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.7: b'1f5871fd8880bdb1a0e96aa5465e0956' key.get_fingerprint()))) 2018-10-06 17:04:52,915 utils.py 91 INFO Copy /src/doctor-tests/doctor_tests/disable_network.sh -> disable_network.sh 2018-10-06 17:04:53,081 utils.py 72 INFO Executing: bash disable_network.sh > disable_network.log 2>&1 & 2018-10-06 17:04:53,147 utils.py 86 INFO *** SUCCESSFULLY run command bash disable_network.sh > disable_network.log 2>&1 & 2018-10-06 17:04:53,148 fault_management.py 91 INFO fault management end...... 2018-10-06 17:04:54,332 sample.py 98 INFO doctor monitor detected at 1538845494.332166 2018-10-06 17:04:54,333 sample.py 41 INFO sample monitor report error...... 2018-10-06 17:04:54,341 sample.py 168 INFO event posted in sample inspector at 1538845494.3414428 2018-10-06 17:04:54,341 sample.py 169 INFO sample inspector = 2018-10-06 17:04:54,342 sample.py 171 INFO sample inspector received data = b'[{"time": "2018-10-06T17:04:54.333282", "type": "compute.host.down", "details": {"hostname": "overcloud-novacompute-1.opnfvlf.org", "status": "down", "monitor": "monitor_sample", "monitor_event_id": "monitor_sample_event1"}}]' 2018-10-06 17:04:54,401 sample.py 115 INFO doctor mark host(overcloud-novacompute-1.opnfvlf.org) down at 1538845494.401144 2018-10-06 17:04:54,488 sample.py 126 INFO doctor mark vm() error at 1538845494.4889076 127.0.0.1 - - [06/Oct/2018 17:04:54] "PUT /events HTTP/1.1" 200 - 2018-10-06 17:04:54,492 sample.py 101 INFO ping timeout, quit monitoring... 2018-10-06 17:04:54,631 sample.py 58 INFO doctor consumer notified at 1538845494.6317067 2018-10-06 17:04:54,632 sample.py 61 INFO sample consumer received data = {'severity': 'moderate', 'alarm_name': 'doctor_alarm0', 'current': 'alarm', 'alarm_id': 'c15d7e7c-e25e-42c9-87e7-84937f2d6db6', 'reason': 'Event hits the query .', 'reason_data': {'type': 'event', 'event': {'event_type': 'compute.instance.update', 'traits': [['resource_id', 1, '76e2be1e-0a42-4b54-ac32-f6a682eb6df8'], ['ephemeral_gb', 2, 0], ['instance_type_id', 2, 207], ['user_id', 1, '450c9ed268d14d2b8dfb3a7e5cfcb6d9'], ['service', 1, 'compute'], ['state', 1, 'error'], ['old_state', 1, 'active'], ['project_id', 1, '650bb87463094be9a972fbc37f63fa53'], ['launched_at', 4, '2018-10-06T17:02:34'], ['disk_gb', 2, 1], ['instance_id', 1, '76e2be1e-0a42-4b54-ac32-f6a682eb6df8'], ['host', 1, 'overcloud-controller-2.opnfvlf.org'], ['root_gb', 2, 1], ['tenant_id', 1, '650bb87463094be9a972fbc37f63fa53'], ['memory_mb', 2, 512], ['instance_type', 1, 'm1.tiny'], ['vcpus', 2, 1], ['request_id', 1, 'req-943ecb24-13f6-41de-85d7-e323914c654a']], 'message_signature': 'f93684e4374ad13b5e3f23c69e096c8cbd01ea93dca84b6cab1327a0d1fa793b', 'raw': {}, 'generated': '2018-10-06T17:04:54.473990', 'message_id': 'bda7a80e-f324-45d0-87d5-45237a6f27c1'}}, 'previous': 'insufficient data'} 127.0.0.1 - - [06/Oct/2018 17:04:54] "POST /failure HTTP/1.1" 200 - 2018-10-06 17:05:03,215 fault_management.py 185 INFO doctor fault management test successfully,notification_time=0.29954075813293457 2018-10-06 17:05:03,215 fault_management.py 191 INFO run doctor fault management profile....... 2018-10-06 17:05:03,216 base.py 80 INFO Get SSH keys from apex installer...... 2018-10-06 17:05:03,216 base.py 84 INFO Already have SSH keys from apex installer...... 2018-10-06 17:05:06,287 utils.py 91 INFO Copy disable_network.log -> /src/doctor-tests/doctor_tests/disable_network.log 2018-10-06 17:05:06,444 fault_management.py 155 INFO Get the disable_netork.log fromdown_host(host_name:overcloud-novacompute-1.opnfvlf.org, host_ip:192.30.9.7) 2018-10-06 17:05:06,447 profiler_poc.py 97 INFO Total time cost: 4601(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |4301 |69 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:4301 | | | | | | | | found affected:? | | | | | | | set VM error:4458 | | | | | | marked host down:4370 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:4601 2018-10-06 17:05:06,447 fault_management.py 94 INFO fault management cleanup...... 2018-10-06 17:05:06,447 fault_management.py 136 INFO Already get the disable_netork.log from down_host...... 2018-10-06 17:05:08,567 sample.py 78 INFO sample inspector stop...... 2018-10-06 17:05:08,611 sample.py 178 INFO shutdown inspector app server at 1538845508.6119525 127.0.0.1 - - [06/Oct/2018 17:05:08] "POST /events/shutdown HTTP/1.1" 200 - 2018-10-06 17:05:08,614 sample.py 35 INFO sample monitor stop...... 2018-10-06 17:05:08,614 sample.py 108 INFO Stopping Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.7) 2018-10-06 17:05:08,615 sample.py 31 INFO sample consumer stop...... 2018-10-06 17:05:08,619 sample.py 66 INFO shutdown consumer app server at 1538845508.619906 127.0.0.1 - - [06/Oct/2018 17:05:08] "POST /shutdown HTTP/1.1" 200 - 2018-10-06 17:05:08,622 alarm.py 84 INFO alarm delete start....... 2018-10-06 17:05:09,295 alarm.py 93 INFO alarm delete end....... 2018-10-06 17:05:09,295 instance.py 76 INFO instance delete start....... 2018-10-06 17:06:08,106 instance.py 89 INFO instance delete end....... 2018-10-06 17:06:08,106 network.py 61 INFO subnet delete start....... 2018-10-06 17:06:08,892 network.py 64 INFO subnet delete end....... 2018-10-06 17:06:08,892 network.py 66 INFO network delete start....... 2018-10-06 17:06:09,787 network.py 69 INFO network delete end....... 2018-10-06 17:06:09,787 apex.py 80 INFO restore apply patches start...... 2018-10-06 17:06:12,941 image.py 71 INFO image delete start....... 2018-10-06 17:06:14,034 image.py 76 INFO image delete end....... 2018-10-06 17:06:14,034 user.py 156 INFO user delete start...... 2018-10-06 17:06:14,563 user.py 178 INFO user delete end...... Total time cost: 4601(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |4301 |69 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:4301 | | | | | | | | found affected:? | | | | | | | set VM error:4458 | | | | | | marked host down:4370 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:4601