/usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.168.122.184: b'b02be0414e3f0691a22370fde1be0721' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-28 11:58:34,044 main.py 130 INFO doctor test starting....... 2019-03-28 11:58:34,045 apex.py 40 INFO Setup Apex installer start...... 2019-03-28 11:58:34,045 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-28 11:58:34,490 apex.py 57 INFO Get overcloud config details from Apex installer...... 2019-03-28 11:58:34,490 base.py 174 INFO Run command=source stackrc; nova list | grep ' overcloud-' in apex installer...... 2019-03-28 11:58:39,021 base.py 183 INFO Output=['| 22032324-77bc-468e-a06f-2a44a9e55e37 | overcloud-controller-0 | ACTIVE | - | Running | ctlplane=192.30.9.5 |', '| a376f2a6-0e4b-49d1-84dc-2dcb3333b75f | overcloud-novacompute-0 | ACTIVE | - | Running | ctlplane=192.30.9.8 |', '| 59411214-c28f-4049-8352-32319fea190e | overcloud-novacompute-1 | ACTIVE | - | Running | ctlplane=192.30.9.9 |'] command=source stackrc; nova list | grep ' overcloud-' in apex installer 2019-03-28 11:58:39,022 base.py 188 INFO Check command=grep docker /home/stack/deploy_command return in apex installer...... 2019-03-28 11:58:39,084 base.py 191 INFO return 0 2019-03-28 11:58:39,085 apex.py 70 INFO controller_ips:['192.30.9.5'] 2019-03-28 11:58:39,085 apex.py 71 INFO compute_ips:['192.30.9.8', '192.30.9.9'] 2019-03-28 11:58:39,085 apex.py 72 INFO use_containers:True 2019-03-28 11:58:39,760 apex.py 92 INFO Set apply patches start...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.5: b'6defbef50e60dc8ea0c3f1849d71707a' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-28 11:58:45,294 apex.py 127 INFO Set apply patches start...... 2019-03-28 11:58:45,295 base.py 63 INFO Setup ssh stunnel in apex installer...... 2019-03-28 11:58:45,295 base.py 76 INFO tunnel for port 12346 2019-03-28 11:58:45,300 image.py 48 INFO image create start...... 2019-03-28 11:58:50,169 image.py 68 INFO image create end...... 2019-03-28 11:58:50,170 user.py 70 INFO user create start...... 2019-03-28 11:58:50,390 user.py 86 INFO create project...... 2019-03-28 11:58:50,527 user.py 95 INFO test project 2019-03-28 11:58:50,659 user.py 103 INFO create user...... 2019-03-28 11:58:51,128 user.py 113 INFO test user 2019-03-28 11:58:51,339 user.py 127 INFO role _member_ already created...... 2019-03-28 11:58:51,339 user.py 128 INFO test role 2019-03-28 11:58:52,515 user.py 78 INFO user create end...... 2019-03-28 11:58:52,516 main.py 55 INFO doctor fault management test starting....... 2019-03-28 11:58:53,341 fault_management.py 65 INFO fault management setup...... 2019-03-28 11:58:53,341 user.py 190 INFO quota update start...... 2019-03-28 11:58:53,341 user.py 206 INFO default quota update start...... 2019-03-28 11:58:54,153 user.py 217 INFO user quota update start...... 2019-03-28 11:58:54,465 user.py 230 INFO quota update end...... 2019-03-28 11:58:54,466 network.py 41 INFO network create start....... 2019-03-28 11:58:55,912 network.py 47 INFO network create end....... 2019-03-28 11:58:55,913 network.py 49 INFO subnet create start....... 2019-03-28 11:58:56,520 network.py 58 INFO subnet create end....... 2019-03-28 11:58:56,521 instance.py 51 INFO instance create start...... 2019-03-28 11:58:59,994 instance.py 73 INFO instance create end...... 2019-03-28 11:58:59,995 instance.py 92 INFO wait for vm launch start...... 2019-03-28 11:59:09,171 instance.py 110 INFO wait for vm launch end...... 2019-03-28 11:59:09,171 alarm.py 45 INFO alarm create start...... 2019-03-28 11:59:12,093 alarm.py 81 INFO alarm create end...... 2019-03-28 11:59:12,093 sample.py 85 INFO sample inspector start...... 2019-03-28 11:59:13,319 sample.py 26 INFO sample consumer start...... * Serving Flask app "inspector" (lazy loading) * Environment: production * Serving Flask app "consumer" (lazy loading) WARNING: Do not use the development server in a production environment. * Environment: production Use a production WSGI server instead. WARNING: Do not use the development server in a production environment. * Debug mode: off Use a production WSGI server instead. * Debug mode: off * Running on http://0.0.0.0:12345/ (Press CTRL+C to quit) * Running on http://0.0.0.0:12346/ (Press CTRL+C to quit) 2019-03-28 11:59:14,530 apex.py 76 INFO Get host ip by hostname=overcloud-novacompute-0.opnfvlf.org from Apex installer...... 2019-03-28 11:59:14,530 base.py 174 INFO Run command=source stackrc; nova show overcloud-novacompute-0 | awk '/ ctlplane network /{print $5}' in apex installer...... 2019-03-28 11:59:17,443 base.py 183 INFO Output=['192.30.9.8'] command=source stackrc; nova show overcloud-novacompute-0 | awk '/ ctlplane network /{print $5}' in apex installer 2019-03-28 11:59:17,444 fault_management.py 118 INFO Get host info(name:overcloud-novacompute-0.opnfvlf.org, ip:192.30.9.8) which vm(doctor_vm0) launched at 2019-03-28 11:59:17,444 sample.py 30 INFO sample monitor start...... 2019-03-28 11:59:17,444 sample.py 85 INFO Starting Pinger host_name(overcloud-novacompute-0.opnfvlf.org), host_ip(192.30.9.8) 2019-03-28 12:01:17,545 fault_management.py 89 INFO fault management start...... 2019-03-28 12:01:17,546 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-28 12:01:17,546 base.py 117 INFO Already have SSH keys from apex installer...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.8: b'7d527a680d2ac7359b64397db2376543' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-28 12:01:17,617 utils.py 91 INFO Copy /src/doctor-tests/doctor_tests/disable_network.sh -> disable_network.sh 2019-03-28 12:01:17,838 utils.py 72 INFO Executing: bash disable_network.sh > disable_network.log 2>&1 & 2019-03-28 12:01:17,891 utils.py 86 INFO *** SUCCESSFULLY run command bash disable_network.sh > disable_network.log 2>&1 & 2019-03-28 12:01:17,892 fault_management.py 91 INFO fault management end...... 2019-03-28 12:01:19,043 sample.py 98 INFO doctor monitor detected at 1553774479.0437381 2019-03-28 12:01:19,044 sample.py 41 INFO sample monitor report error...... 2019-03-28 12:01:19,052 sample.py 238 INFO event posted in sample inspector at 1553774479.0523002 2019-03-28 12:01:19,052 sample.py 239 INFO sample inspector = 2019-03-28 12:01:19,052 sample.py 241 INFO sample inspector received data = b'[{"time": "2019-03-28T12:01:19.044286", "type": "compute.host.down", "details": {"hostname": "overcloud-novacompute-0.opnfvlf.org", "status": "down", "monitor": "monitor_sample", "monitor_event_id": "monitor_sample_event1"}}]' 2019-03-28 12:01:19,097 sample.py 196 INFO doctor compute.instance.update vm() error 1553774479.0973053 2019-03-28 12:01:19,100 sample.py 165 INFO doctor mark host(overcloud-novacompute-0.opnfvlf.org) down at 1553774479.1005652 2019-03-28 12:01:19,180 sample.py 58 INFO doctor consumer notified at 1553774479.180214 2019-03-28 12:01:19,181 sample.py 61 INFO sample consumer received data = {'severity': 'moderate', 'alarm_name': 'doctor_alarm0', 'current': 'alarm', 'alarm_id': '7ac87ba2-cc0e-44c7-8b15-bd4b32b41baa', 'reason': 'Event hits the query .', 'reason_data': {'type': 'event', 'event': {'event_type': 'compute.instance.update', 'traits': [['resource_id', 1, '61531a88-ac4f-40bc-9923-989b3eeb4ea5'], ['service', 1, 'sample'], ['state', 1, 'error'], ['project_id', 1, 'a6baf052f1854f8cac2d7ed1b7866503'], ['instance_id', 1, '61531a88-ac4f-40bc-9923-989b3eeb4ea5'], ['tenant_id', 1, 'a6baf052f1854f8cac2d7ed1b7866503']], 'message_signature': '813c8ca635326b232fd6d2859ebb86e395991fb77746e062fd15cbb73261f4d0', 'raw': {}, 'generated': '2019-03-28T12:01:19.060055', 'message_id': '83111754-4bc0-41f5-8c10-85ac24c8843e'}}, 'previous': 'insufficient data'} 127.0.0.1 - - [28/Mar/2019 12:01:19] "POST /failure HTTP/1.1" 200 - 2019-03-28 12:01:19,198 sample.py 176 INFO doctor mark vm() error at 1553774479.1989107 127.0.0.1 - - [28/Mar/2019 12:01:19] "PUT /events HTTP/1.1" 200 - 2019-03-28 12:01:19,201 sample.py 101 INFO ping timeout, quit monitoring... 2019-03-28 12:01:47,926 fault_management.py 185 INFO doctor fault management notification_time=0.1364758014678955 2019-03-28 12:01:47,926 fault_management.py 188 INFO doctor fault management test successfully 2019-03-28 12:01:47,927 fault_management.py 198 INFO run doctor fault management profile....... 2019-03-28 12:01:47,927 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-28 12:01:47,927 base.py 117 INFO Already have SSH keys from apex installer...... 2019-03-28 12:01:51,002 utils.py 91 INFO Copy disable_network.log -> /src/doctor-tests/doctor_tests/disable_network.log 2019-03-28 12:01:51,219 fault_management.py 155 INFO Get the disable_netork.log fromdown_host(host_name:overcloud-novacompute-0.opnfvlf.org, host_ip:192.30.9.8) 2019-03-28 12:01:51,222 profiler_poc.py 97 INFO Total time cost: 248(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |112 |57 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:112 | | | | | | | | found affected:? | | | | | | | set VM error:267 | | | | | | marked host down:169 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:248 2019-03-28 12:01:51,222 fault_management.py 94 INFO fault management cleanup...... 2019-03-28 12:01:51,222 fault_management.py 136 INFO Already get the disable_netork.log from down_host...... 2019-03-28 12:01:53,320 sample.py 91 INFO sample inspector stop...... 2019-03-28 12:01:53,361 sample.py 253 INFO shutdown inspector app server at 1553774513.3618302 127.0.0.1 - - [28/Mar/2019 12:01:53] "POST /events/shutdown HTTP/1.1" 200 - 2019-03-28 12:01:53,364 sample.py 35 INFO sample monitor stop...... 2019-03-28 12:01:53,364 sample.py 108 INFO Stopping Pinger host_name(overcloud-novacompute-0.opnfvlf.org), host_ip(192.30.9.8) 2019-03-28 12:01:53,365 sample.py 31 INFO sample consumer stop...... 2019-03-28 12:01:53,370 sample.py 66 INFO shutdown consumer app server at 1553774513.3707018 127.0.0.1 - - [28/Mar/2019 12:01:53] "POST /shutdown HTTP/1.1" 200 - 2019-03-28 12:01:53,373 alarm.py 84 INFO alarm delete start....... 2019-03-28 12:01:54,875 alarm.py 93 INFO alarm delete end....... 2019-03-28 12:01:54,875 instance.py 76 INFO instance delete start....... 2019-03-28 12:02:21,126 instance.py 89 INFO instance delete end....... 2019-03-28 12:02:21,126 network.py 61 INFO subnet delete start....... 2019-03-28 12:02:21,616 network.py 64 INFO subnet delete end....... 2019-03-28 12:02:21,616 network.py 66 INFO network delete start....... 2019-03-28 12:02:22,481 network.py 69 INFO network delete end....... 2019-03-28 12:02:22,481 apex.py 145 INFO restore apply patches start...... 2019-03-28 12:02:22,554 image.py 71 INFO image delete start....... 2019-03-28 12:02:23,085 image.py 76 INFO image delete end....... 2019-03-28 12:02:23,085 user.py 163 INFO user delete start...... 2019-03-28 12:02:24,188 user.py 187 INFO user delete end...... Total time cost: 248(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |112 |57 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:112 | | | | | | | | found affected:? | | | | | | | set VM error:267 | | | | | | marked host down:169 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:248