/usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.168.122.23: b'f4de3964c761b8bd2989a333e7572670' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-30 06:46:19,375 main.py 130 INFO doctor test starting....... 2019-03-30 06:46:19,375 apex.py 43 INFO Setup Apex installer start...... 2019-03-30 06:46:19,375 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-30 06:46:19,819 apex.py 67 INFO Get overcloud config details from Apex installer...... 2019-03-30 06:46:19,820 base.py 174 INFO Run command=source stackrc; nova list | grep ' overcloud-' in apex installer...... 2019-03-30 06:46:23,497 base.py 183 INFO Output=['| 5eacedd5-7ab8-43e7-9b06-930f2d200942 | overcloud-controller-0 | ACTIVE | - | Running | ctlplane=192.30.9.7 |', '| 47e7d236-0996-4d48-b36f-63f3b89f79e8 | overcloud-novacompute-0 | ACTIVE | - | Running | ctlplane=192.30.9.9 |', '| 8f32166d-cffb-496a-96a9-7a53482bfd1c | overcloud-novacompute-1 | ACTIVE | - | Running | ctlplane=192.30.9.4 |'] command=source stackrc; nova list | grep ' overcloud-' in apex installer 2019-03-30 06:46:23,497 base.py 188 INFO Check command=grep docker /home/stack/deploy_command return in apex installer...... 2019-03-30 06:46:23,560 base.py 191 INFO return 0 2019-03-30 06:46:23,561 apex.py 80 INFO controller_ips:['192.30.9.7'] 2019-03-30 06:46:23,561 apex.py 81 INFO compute_ips:['192.30.9.9', '192.30.9.4'] 2019-03-30 06:46:23,562 apex.py 82 INFO use_containers:True 2019-03-30 06:46:24,312 apex.py 102 INFO Set apply patches start...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.7: b'6b36c4ce06f91ad93e47266a1ecd656f' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-30 06:46:25,040 base.py 218 INFO Command sudo python set_config.py output ['Add event notifier in ceilometer', 'NOTE: add compute.instance.update to event_definitions.yaml', 'NOTE: add maintenance.scheduled to event_definitions.yaml', 'NOTE: add maintenance.host to event_definitions.yaml'] 2019-03-30 06:46:26,419 base.py 218 INFO Command sudo python restart_aodh.py output [] 2019-03-30 06:46:29,471 apex.py 148 INFO Set apply patches start...... 2019-03-30 06:46:29,472 base.py 63 INFO Setup ssh stunnel in apex installer...... 2019-03-30 06:46:29,472 base.py 76 INFO tunnel for port 12346 2019-03-30 06:46:29,477 image.py 48 INFO image create start...... 2019-03-30 06:46:38,087 image.py 68 INFO image create end...... 2019-03-30 06:46:38,087 user.py 70 INFO user create start...... 2019-03-30 06:46:38,305 user.py 86 INFO create project...... 2019-03-30 06:46:38,531 user.py 95 INFO test project 2019-03-30 06:46:38,761 user.py 103 INFO create user...... 2019-03-30 06:46:39,243 user.py 113 INFO test user 2019-03-30 06:46:39,360 user.py 127 INFO role _member_ already created...... 2019-03-30 06:46:39,360 user.py 128 INFO test role 2019-03-30 06:46:40,661 user.py 78 INFO user create end...... 2019-03-30 06:46:40,662 main.py 55 INFO doctor fault management test starting....... 2019-03-30 06:46:43,469 fault_management.py 65 INFO fault management setup...... 2019-03-30 06:46:43,469 user.py 190 INFO quota update start...... 2019-03-30 06:46:43,470 user.py 206 INFO default quota update start...... 2019-03-30 06:46:44,175 user.py 217 INFO user quota update start...... 2019-03-30 06:46:44,495 user.py 230 INFO quota update end...... 2019-03-30 06:46:44,496 network.py 41 INFO network create start....... 2019-03-30 06:46:45,538 network.py 47 INFO network create end....... 2019-03-30 06:46:45,538 network.py 49 INFO subnet create start....... 2019-03-30 06:46:46,469 network.py 58 INFO subnet create end....... 2019-03-30 06:46:46,469 instance.py 51 INFO instance create start...... 2019-03-30 06:46:49,928 instance.py 73 INFO instance create end...... 2019-03-30 06:46:49,929 instance.py 92 INFO wait for vm launch start...... 2019-03-30 06:46:59,313 instance.py 110 INFO wait for vm launch end...... 2019-03-30 06:46:59,313 alarm.py 45 INFO alarm create start...... 2019-03-30 06:47:02,282 alarm.py 81 INFO alarm create end...... 2019-03-30 06:47:02,282 sample.py 85 INFO sample inspector start...... 2019-03-30 06:47:03,817 sample.py 26 INFO sample consumer start...... * Serving Flask app "inspector" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. Use a production WSGI server instead. * Debug mode: off * Serving Flask app "consumer" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. Use a production WSGI server instead. * Debug mode: off * Running on http://0.0.0.0:12345/ (Press CTRL+C to quit) * Running on http://0.0.0.0:12346/ (Press CTRL+C to quit) 2019-03-30 06:47:05,193 apex.py 86 INFO Get host ip by hostname=overcloud-novacompute-1.opnfvlf.org from Apex installer...... 2019-03-30 06:47:05,193 base.py 174 INFO Run command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer...... 2019-03-30 06:47:07,365 base.py 183 INFO Output=['192.30.9.4'] command=source stackrc; nova show overcloud-novacompute-1 | awk '/ ctlplane network /{print $5}' in apex installer 2019-03-30 06:47:07,365 fault_management.py 118 INFO Get host info(name:overcloud-novacompute-1.opnfvlf.org, ip:192.30.9.4) which vm(doctor_vm0) launched at 2019-03-30 06:47:07,366 sample.py 30 INFO sample monitor start...... 2019-03-30 06:47:07,366 sample.py 85 INFO Starting Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.4) 2019-03-30 06:49:07,425 fault_management.py 89 INFO fault management start...... 2019-03-30 06:49:07,426 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-30 06:49:07,427 base.py 117 INFO Already have SSH keys from apex installer...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.4: b'23570b8b3199055438dbdef1b6300092' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-30 06:49:07,509 utils.py 91 INFO Copy /src/doctor-tests/doctor_tests/disable_network.sh -> disable_network.sh 2019-03-30 06:49:07,759 utils.py 72 INFO Executing: bash disable_network.sh > disable_network.log 2>&1 & 2019-03-30 06:49:07,812 utils.py 86 INFO *** SUCCESSFULLY run command bash disable_network.sh > disable_network.log 2>&1 & 2019-03-30 06:49:07,812 fault_management.py 91 INFO fault management end...... 2019-03-30 06:49:09,042 sample.py 98 INFO doctor monitor detected at 1553928549.0425882 2019-03-30 06:49:09,043 sample.py 41 INFO sample monitor report error...... 2019-03-30 06:49:09,051 sample.py 238 INFO event posted in sample inspector at 1553928549.051486 2019-03-30 06:49:09,051 sample.py 239 INFO sample inspector = 2019-03-30 06:49:09,052 sample.py 241 INFO sample inspector received data = b'[{"time": "2019-03-30T06:49:09.043247", "type": "compute.host.down", "details": {"hostname": "overcloud-novacompute-1.opnfvlf.org", "status": "down", "monitor": "monitor_sample", "monitor_event_id": "monitor_sample_event1"}}]' 2019-03-30 06:49:09,102 sample.py 196 INFO doctor compute.instance.update vm() error 1553928549.1022992 2019-03-30 06:49:09,109 sample.py 165 INFO doctor mark host(overcloud-novacompute-1.opnfvlf.org) down at 1553928549.1091998 2019-03-30 06:49:09,259 sample.py 58 INFO doctor consumer notified at 1553928549.2597175 2019-03-30 06:49:09,260 sample.py 61 INFO sample consumer received data = {'severity': 'moderate', 'alarm_name': 'doctor_alarm0', 'current': 'alarm', 'alarm_id': 'da611e07-95f7-488e-a402-9a81438a08b1', 'reason': 'Event hits the query .', 'reason_data': {'type': 'event', 'event': {'event_type': 'compute.instance.update', 'traits': [['resource_id', 1, '2122d95d-c28d-418b-a136-b0ee26085810'], ['service', 1, 'sample'], ['state', 1, 'error'], ['project_id', 1, 'c542710bbb564caeb3e8d0b2f8d200bf'], ['instance_id', 1, '2122d95d-c28d-418b-a136-b0ee26085810'], ['tenant_id', 1, 'c542710bbb564caeb3e8d0b2f8d200bf']], 'message_signature': 'd0b6be640da455e077ec87ae184c82dfc6e5060987c70013615cccfaef5aad00', 'raw': {}, 'generated': '2019-03-30T06:49:09.065321', 'message_id': 'aed6b43b-c331-472e-84dd-4ba5d9d4852b'}}, 'previous': 'insufficient data'} 127.0.0.1 - - [30/Mar/2019 06:49:09] "POST /failure HTTP/1.1" 200 - 2019-03-30 06:49:09,264 sample.py 176 INFO doctor mark vm() error at 1553928549.2645698 127.0.0.1 - - [30/Mar/2019 06:49:09] "PUT /events HTTP/1.1" 200 - 2019-03-30 06:49:09,267 sample.py 101 INFO ping timeout, quit monitoring... 2019-03-30 06:49:37,851 fault_management.py 185 INFO doctor fault management notification_time=0.21712923049926758 2019-03-30 06:49:37,851 fault_management.py 188 INFO doctor fault management test successfully 2019-03-30 06:49:37,851 fault_management.py 198 INFO run doctor fault management profile....... 2019-03-30 06:49:37,852 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-30 06:49:37,852 base.py 117 INFO Already have SSH keys from apex installer...... 2019-03-30 06:49:40,927 utils.py 91 INFO Copy disable_network.log -> /src/doctor-tests/doctor_tests/disable_network.log 2019-03-30 06:49:41,148 fault_management.py 155 INFO Get the disable_netork.log fromdown_host(host_name:overcloud-novacompute-1.opnfvlf.org, host_ip:192.30.9.4) 2019-03-30 06:49:41,150 profiler_poc.py 97 INFO Total time cost: 454(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |237 |66 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:237 | | | | | | | | found affected:? | | | | | | | set VM error:459 | | | | | | marked host down:303 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:454 2019-03-30 06:49:41,150 fault_management.py 94 INFO fault management cleanup...... 2019-03-30 06:49:41,151 fault_management.py 136 INFO Already get the disable_netork.log from down_host...... 2019-03-30 06:49:43,237 sample.py 91 INFO sample inspector stop...... 2019-03-30 06:49:43,275 sample.py 253 INFO shutdown inspector app server at 1553928583.2755504 127.0.0.1 - - [30/Mar/2019 06:49:43] "POST /events/shutdown HTTP/1.1" 200 - 2019-03-30 06:49:43,277 sample.py 35 INFO sample monitor stop...... 2019-03-30 06:49:43,278 sample.py 108 INFO Stopping Pinger host_name(overcloud-novacompute-1.opnfvlf.org), host_ip(192.30.9.4) 2019-03-30 06:49:43,278 sample.py 31 INFO sample consumer stop...... 2019-03-30 06:49:43,283 sample.py 66 INFO shutdown consumer app server at 1553928583.2838035 127.0.0.1 - - [30/Mar/2019 06:49:43] "POST /shutdown HTTP/1.1" 200 - 2019-03-30 06:49:43,286 alarm.py 84 INFO alarm delete start....... 2019-03-30 06:49:44,909 alarm.py 93 INFO alarm delete end....... 2019-03-30 06:49:44,910 instance.py 76 INFO instance delete start....... 2019-03-30 06:50:12,857 instance.py 89 INFO instance delete end....... 2019-03-30 06:50:12,857 network.py 61 INFO subnet delete start....... 2019-03-30 06:50:13,590 network.py 64 INFO subnet delete end....... 2019-03-30 06:50:13,590 network.py 66 INFO network delete start....... 2019-03-30 06:50:14,241 network.py 69 INFO network delete end....... 2019-03-30 06:50:14,241 apex.py 166 INFO restore apply patches start...... 2019-03-30 06:50:14,314 image.py 71 INFO image delete start....... 2019-03-30 06:50:14,610 image.py 76 INFO image delete end....... 2019-03-30 06:50:14,611 user.py 163 INFO user delete start...... 2019-03-30 06:50:14,636 base.py 218 INFO Command sudo python restore_config.py output ['restore', 'restore: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml', 'Bak_file empty, so removing also: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml'] 2019-03-30 06:50:15,645 base.py 218 INFO Command sudo python restore_aodh.py output [] 2019-03-30 06:50:15,753 user.py 187 INFO user delete end...... Total time cost: 454(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |237 |66 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:237 | | | | | | | | found affected:? | | | | | | | set VM error:459 | | | | | | marked host down:303 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:454