/usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.168.122.198: b'befc486be970830b4c4cb35daf2fba8c' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-04-02 08:47:15,607 main.py 130 INFO doctor test starting....... 2019-04-02 08:47:15,607 apex.py 43 INFO Setup Apex installer start...... 2019-04-02 08:47:15,608 base.py 113 INFO Get SSH keys from apex installer...... 2019-04-02 08:47:15,965 apex.py 67 INFO Get overcloud config details from Apex installer...... 2019-04-02 08:47:15,966 base.py 174 INFO Run command=source stackrc; nova list | grep ' overcloud-' in apex installer...... 2019-04-02 08:47:19,153 base.py 183 INFO Output=['| c55461e2-499e-443d-bd55-445050ba4d72 | overcloud-controller-0 | ACTIVE | - | Running | ctlplane=192.30.9.3 |', '| d32c468c-8976-430e-b1b2-295ccc007316 | overcloud-novacompute-0 | ACTIVE | - | Running | ctlplane=192.30.9.7 |', '| 00e26e72-8574-497d-877e-5b907649e723 | overcloud-novacompute-1 | ACTIVE | - | Running | ctlplane=192.30.9.4 |'] command=source stackrc; nova list | grep ' overcloud-' in apex installer 2019-04-02 08:47:19,154 base.py 188 INFO Check command=grep docker /home/stack/deploy_command return in apex installer...... 2019-04-02 08:47:19,225 base.py 191 INFO return 0 2019-04-02 08:47:19,225 apex.py 80 INFO controller_ips:['192.30.9.3'] 2019-04-02 08:47:19,226 apex.py 81 INFO compute_ips:['192.30.9.7', '192.30.9.4'] 2019-04-02 08:47:19,226 apex.py 82 INFO use_containers:True 2019-04-02 08:47:20,107 apex.py 102 INFO Set apply patches start...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.3: b'917b7f007199526c47dc1b7be2582506' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-04-02 08:47:21,397 base.py 218 INFO Command sudo python set_config.py output ['Add event notifier in ceilometer', 'NOTE: add compute.instance.update to event_definitions.yaml', 'NOTE: add maintenance.scheduled to event_definitions.yaml', 'NOTE: add maintenance.host to event_definitions.yaml'] 2019-04-02 08:47:22,684 base.py 218 INFO Command sudo python restart_aodh.py output [] 2019-04-02 08:47:25,363 apex.py 148 INFO Set apply patches start...... 2019-04-02 08:47:25,363 base.py 63 INFO Setup ssh stunnel in apex installer...... 2019-04-02 08:47:25,363 base.py 76 INFO tunnel for port 12346 2019-04-02 08:47:25,367 image.py 48 INFO image create start...... 2019-04-02 08:48:17,764 image.py 68 INFO image create end...... 2019-04-02 08:48:17,765 user.py 70 INFO user create start...... 2019-04-02 08:48:17,970 user.py 86 INFO create project...... 2019-04-02 08:48:18,191 user.py 95 INFO test project 2019-04-02 08:48:18,414 user.py 103 INFO create user...... 2019-04-02 08:48:18,898 user.py 113 INFO test user 2019-04-02 08:48:19,097 user.py 127 INFO role _member_ already created...... 2019-04-02 08:48:19,097 user.py 128 INFO test role 2019-04-02 08:48:20,249 user.py 78 INFO user create end...... 2019-04-02 08:48:20,249 main.py 55 INFO doctor fault management test starting....... 2019-04-02 08:48:21,042 fault_management.py 65 INFO fault management setup...... 2019-04-02 08:48:21,043 user.py 190 INFO quota update start...... 2019-04-02 08:48:21,043 user.py 206 INFO default quota update start...... 2019-04-02 08:48:21,750 user.py 217 INFO user quota update start...... 2019-04-02 08:48:22,057 user.py 230 INFO quota update end...... 2019-04-02 08:48:22,058 network.py 41 INFO network create start....... 2019-04-02 08:48:23,099 network.py 47 INFO network create end....... 2019-04-02 08:48:23,100 network.py 49 INFO subnet create start....... 2019-04-02 08:48:23,711 network.py 58 INFO subnet create end....... 2019-04-02 08:48:23,711 instance.py 51 INFO instance create start...... 2019-04-02 08:48:27,421 instance.py 73 INFO instance create end...... 2019-04-02 08:48:27,421 instance.py 92 INFO wait for vm launch start...... 2019-04-02 08:48:37,057 instance.py 110 INFO wait for vm launch end...... 2019-04-02 08:48:37,057 alarm.py 45 INFO alarm create start...... 2019-04-02 08:48:39,811 alarm.py 81 INFO alarm create end...... 2019-04-02 08:48:39,812 sample.py 85 INFO sample inspector start...... 2019-04-02 08:48:41,119 sample.py 26 INFO sample consumer start...... * Serving Flask app "inspector" (lazy loading) * Serving Flask app "consumer" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. * Environment: production Use a production WSGI server instead. WARNING: Do not use the development server in a production environment. * Debug mode: off Use a production WSGI server instead. * Debug mode: off * Running on http://0.0.0.0:12346/ (Press CTRL+C to quit) * Running on http://0.0.0.0:12345/ (Press CTRL+C to quit) 2019-04-02 08:48:42,407 apex.py 86 INFO Get host ip by hostname=overcloud-novacompute-1.opnfvlf.org from Apex installer...... 2019-04-02 08:48:42,408 base.py 174 INFO Run command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer...... 2019-04-02 08:48:45,495 base.py 183 INFO Output=['192.30.9.4'] command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer 2019-04-02 08:48:45,496 fault_management.py 118 INFO Get host info(name:overcloud-novacompute-1.opnfvlf.org, ip:192.30.9.4) which vm(doctor_vm0) launched at 2019-04-02 08:48:45,496 sample.py 30 INFO sample monitor start...... 2019-04-02 08:48:45,497 sample.py 85 INFO Starting Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.4) 2019-04-02 08:50:45,597 fault_management.py 89 INFO fault management start...... 2019-04-02 08:50:45,599 base.py 113 INFO Get SSH keys from apex installer...... 2019-04-02 08:50:45,599 base.py 117 INFO Already have SSH keys from apex installer...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.4: b'179eb6a0bec719488246ea819684e082' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-04-02 08:50:45,671 utils.py 91 INFO Copy /src/doctor-tests/doctor_tests/disable_network.sh -> disable_network.sh 2019-04-02 08:50:45,891 utils.py 72 INFO Executing: bash disable_network.sh > disable_network.log 2>&1 & 2019-04-02 08:50:45,942 utils.py 86 INFO *** SUCCESSFULLY run command bash disable_network.sh > disable_network.log 2>&1 & 2019-04-02 08:50:45,943 fault_management.py 91 INFO fault management end...... 2019-04-02 08:50:47,158 sample.py 98 INFO doctor monitor detected at 1554195047.158093 2019-04-02 08:50:47,158 sample.py 41 INFO sample monitor report error...... 2019-04-02 08:50:47,167 sample.py 238 INFO event posted in sample inspector at 1554195047.1670334 2019-04-02 08:50:47,167 sample.py 239 INFO sample inspector = 2019-04-02 08:50:47,168 sample.py 241 INFO sample inspector received data = b'[{"time": "2019-04-02T08:50:47.158653", "type": "compute.host.down", "details": {"hostname": "overcloud-novacompute-1.opnfvlf.org", "status": "down", "monitor": "monitor_sample", "monitor_event_id": "monitor_sample_event1"}}]' 2019-04-02 08:50:47,218 sample.py 196 INFO doctor compute.instance.update vm() error 1554195047.2187152 2019-04-02 08:50:47,227 sample.py 165 INFO doctor mark host(overcloud-novacompute-1.opnfvlf.org) down at 1554195047.2270136 2019-04-02 08:50:47,304 sample.py 176 INFO doctor mark vm() error at 1554195047.3045259 127.0.0.1 - - [02/Apr/2019 08:50:47] "PUT /events HTTP/1.1" 200 - 2019-04-02 08:50:47,308 sample.py 101 INFO ping timeout, quit monitoring... 2019-04-02 08:50:47,358 sample.py 58 INFO doctor consumer notified at 1554195047.358169 2019-04-02 08:50:47,358 sample.py 61 INFO sample consumer received data = {'severity': 'moderate', 'alarm_name': 'doctor_alarm0', 'current': 'alarm', 'alarm_id': '8fee2e51-c177-4d3a-916c-92cb587d1076', 'reason': 'Event hits the query .', 'reason_data': {'type': 'event', 'event': {'event_type': 'compute.instance.update', 'traits': [['resource_id', 1, '4021f2a6-d0ab-467f-bbaf-0428888bd645'], ['service', 1, 'sample'], ['state', 1, 'error'], ['project_id', 1, 'b81c4a1c7d2c4f99913074103cdbabf1'], ['instance_id', 1, '4021f2a6-d0ab-467f-bbaf-0428888bd645'], ['tenant_id', 1, 'b81c4a1c7d2c4f99913074103cdbabf1']], 'message_signature': '96cb642c79f652e11f0258622adf17d0622bccc32b97b3efca3870e215b6d0a9', 'raw': {}, 'generated': '2019-04-02T08:50:47.174899', 'message_id': '40dea7f8-632a-4639-b559-128b18a77211'}}, 'previous': 'insufficient data'} 127.0.0.1 - - [02/Apr/2019 08:50:47] "POST /failure HTTP/1.1" 200 - 2019-04-02 08:51:16,013 fault_management.py 185 INFO doctor fault management notification_time=0.20007610321044922 2019-04-02 08:51:16,013 fault_management.py 188 INFO doctor fault management test successfully 2019-04-02 08:51:16,013 fault_management.py 198 INFO run doctor fault management profile....... 2019-04-02 08:51:16,014 base.py 113 INFO Get SSH keys from apex installer...... 2019-04-02 08:51:16,014 base.py 117 INFO Already have SSH keys from apex installer...... 2019-04-02 08:51:19,092 utils.py 91 INFO Copy disable_network.log -> /src/doctor-tests/doctor_tests/disable_network.log 2019-04-02 08:51:19,336 fault_management.py 155 INFO Get the disable_netork.log fromdown_host(host_name:overcloud-novacompute-1.opnfvlf.org, host_ip:192.30.9.4) 2019-04-02 08:51:19,338 profiler_poc.py 97 INFO Total time cost: 383(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |182 |69 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:182 | | | | | | | | found affected:? | | | | | | | set VM error:329 | | | | | | marked host down:251 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:383 2019-04-02 08:51:19,339 fault_management.py 94 INFO fault management cleanup...... 2019-04-02 08:51:19,339 fault_management.py 136 INFO Already get the disable_netork.log from down_host...... 2019-04-02 08:51:21,417 sample.py 91 INFO sample inspector stop...... 2019-04-02 08:51:21,463 sample.py 253 INFO shutdown inspector app server at 1554195081.4629939 127.0.0.1 - - [02/Apr/2019 08:51:21] "POST /events/shutdown HTTP/1.1" 200 - 2019-04-02 08:51:21,465 sample.py 35 INFO sample monitor stop...... 2019-04-02 08:51:21,466 sample.py 108 INFO Stopping Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.4) 2019-04-02 08:51:21,466 sample.py 31 INFO sample consumer stop...... 2019-04-02 08:51:21,471 sample.py 66 INFO shutdown consumer app server at 1554195081.4714696 127.0.0.1 - - [02/Apr/2019 08:51:21] "POST /shutdown HTTP/1.1" 200 - 2019-04-02 08:51:21,474 alarm.py 84 INFO alarm delete start....... 2019-04-02 08:51:22,942 alarm.py 93 INFO alarm delete end....... 2019-04-02 08:51:22,942 instance.py 76 INFO instance delete start....... 2019-04-02 08:51:46,662 instance.py 89 INFO instance delete end....... 2019-04-02 08:51:46,662 network.py 61 INFO subnet delete start....... 2019-04-02 08:51:47,160 network.py 64 INFO subnet delete end....... 2019-04-02 08:51:47,160 network.py 66 INFO network delete start....... 2019-04-02 08:51:48,145 network.py 69 INFO network delete end....... 2019-04-02 08:51:48,145 apex.py 166 INFO restore apply patches start...... 2019-04-02 08:51:48,220 image.py 71 INFO image delete start....... 2019-04-02 08:51:48,541 base.py 218 INFO Command sudo python restore_config.py output ['restore', 'restore: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml', 'Bak_file empty, so removing also: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml'] 2019-04-02 08:51:48,782 image.py 76 INFO image delete end....... 2019-04-02 08:51:48,782 user.py 163 INFO user delete start...... 2019-04-02 08:51:49,570 base.py 218 INFO Command sudo python restore_aodh.py output [] 2019-04-02 08:51:49,863 user.py 187 INFO user delete end...... Total time cost: 383(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |182 |69 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:182 | | | | | | | | found affected:? | | | | | | | set VM error:329 | | | | | | marked host down:251 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:383