/usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.168.122.66: b'a0e685cc629ca5b00ce2e8f3d57a0580' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-31 15:28:16,900 main.py 130 INFO doctor test starting....... 2019-03-31 15:28:16,900 apex.py 43 INFO Setup Apex installer start...... 2019-03-31 15:28:16,900 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-31 15:28:17,342 apex.py 67 INFO Get overcloud config details from Apex installer...... 2019-03-31 15:28:17,342 base.py 174 INFO Run command=source stackrc; nova list | grep ' overcloud-' in apex installer...... 2019-03-31 15:28:21,432 base.py 183 INFO Output=['| 9cbb8097-ace8-4314-bd78-345401380298 | overcloud-controller-0 | ACTIVE | - | Running | ctlplane=192.30.9.4 |', '| 905d12ad-196e-4161-b729-6a872b1d4167 | overcloud-novacompute-0 | ACTIVE | - | Running | ctlplane=192.30.9.6 |', '| f556e8ab-f9cc-4e16-b1ca-16062543eb90 | overcloud-novacompute-1 | ACTIVE | - | Running | ctlplane=192.30.9.8 |'] command=source stackrc; nova list | grep ' overcloud-' in apex installer 2019-03-31 15:28:21,433 base.py 188 INFO Check command=grep docker /home/stack/deploy_command return in apex installer...... 2019-03-31 15:28:21,511 base.py 191 INFO return 0 2019-03-31 15:28:21,512 apex.py 80 INFO controller_ips:['192.30.9.4'] 2019-03-31 15:28:21,512 apex.py 81 INFO compute_ips:['192.30.9.6', '192.30.9.8'] 2019-03-31 15:28:21,513 apex.py 82 INFO use_containers:True 2019-03-31 15:28:22,289 apex.py 102 INFO Set apply patches start...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.4: b'f5920817332ef8451513b98cb6677d68' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-31 15:28:22,979 base.py 218 INFO Command sudo python set_config.py output ['Add event notifier in ceilometer', 'NOTE: add compute.instance.update to event_definitions.yaml', 'NOTE: add maintenance.scheduled to event_definitions.yaml', 'NOTE: add maintenance.host to event_definitions.yaml'] 2019-03-31 15:28:24,295 base.py 218 INFO Command sudo python restart_aodh.py output [] 2019-03-31 15:28:27,450 apex.py 148 INFO Set apply patches start...... 2019-03-31 15:28:27,451 base.py 63 INFO Setup ssh stunnel in apex installer...... 2019-03-31 15:28:27,451 base.py 76 INFO tunnel for port 12346 2019-03-31 15:28:27,456 image.py 48 INFO image create start...... 2019-03-31 15:28:34,714 image.py 68 INFO image create end...... 2019-03-31 15:28:34,714 user.py 70 INFO user create start...... 2019-03-31 15:28:34,932 user.py 86 INFO create project...... 2019-03-31 15:28:35,164 user.py 95 INFO test project 2019-03-31 15:28:35,392 user.py 103 INFO create user...... 2019-03-31 15:28:35,942 user.py 113 INFO test user 2019-03-31 15:28:36,147 user.py 127 INFO role _member_ already created...... 2019-03-31 15:28:36,147 user.py 128 INFO test role 2019-03-31 15:28:37,147 user.py 78 INFO user create end...... 2019-03-31 15:28:37,147 main.py 55 INFO doctor fault management test starting....... 2019-03-31 15:28:38,042 fault_management.py 65 INFO fault management setup...... 2019-03-31 15:28:38,043 user.py 190 INFO quota update start...... 2019-03-31 15:28:38,043 user.py 206 INFO default quota update start...... 2019-03-31 15:28:38,736 user.py 217 INFO user quota update start...... 2019-03-31 15:28:39,147 user.py 230 INFO quota update end...... 2019-03-31 15:28:39,148 network.py 41 INFO network create start....... 2019-03-31 15:28:40,184 network.py 47 INFO network create end....... 2019-03-31 15:28:40,184 network.py 49 INFO subnet create start....... 2019-03-31 15:28:40,839 network.py 58 INFO subnet create end....... 2019-03-31 15:28:40,839 instance.py 51 INFO instance create start...... 2019-03-31 15:28:43,779 instance.py 73 INFO instance create end...... 2019-03-31 15:28:43,779 instance.py 92 INFO wait for vm launch start...... 2019-03-31 15:28:53,146 instance.py 110 INFO wait for vm launch end...... 2019-03-31 15:28:53,147 alarm.py 45 INFO alarm create start...... 2019-03-31 15:28:56,442 alarm.py 81 INFO alarm create end...... 2019-03-31 15:28:56,442 sample.py 85 INFO sample inspector start...... 2019-03-31 15:28:57,486 sample.py 26 INFO sample consumer start...... * Serving Flask app "consumer" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. Use a production WSGI server instead. * Debug mode: off * Serving Flask app "inspector" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. Use a production WSGI server instead. * Debug mode: off * Running on http://0.0.0.0:12346/ (Press CTRL+C to quit) * Running on http://0.0.0.0:12345/ (Press CTRL+C to quit) 2019-03-31 15:28:58,877 apex.py 86 INFO Get host ip by hostname=overcloud-novacompute-0.opnfvlf.org from Apex installer...... 2019-03-31 15:28:58,878 base.py 174 INFO Run command=source stackrc; nova show overcloud-novacompute-0 | awk '/ ctlplane network /{print $5}' in apex installer...... 2019-03-31 15:29:01,330 base.py 183 INFO Output=['192.30.9.6'] command=source stackrc; nova show overcloud-novacompute-0 | awk '/ ctlplane network /{print $5}' in apex installer 2019-03-31 15:29:01,330 fault_management.py 118 INFO Get host info(name:overcloud-novacompute-0.opnfvlf.org, ip:192.30.9.6) which vm(doctor_vm0) launched at 2019-03-31 15:29:01,330 sample.py 30 INFO sample monitor start...... 2019-03-31 15:29:01,331 sample.py 85 INFO Starting Pinger host_name(overcloud-novacompute-0.opnfvlf.org), host_ip(192.30.9.6) 2019-03-31 15:31:01,332 fault_management.py 89 INFO fault management start...... 2019-03-31 15:31:01,335 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-31 15:31:01,335 base.py 117 INFO Already have SSH keys from apex installer...... /usr/lib/python3.6/site-packages/paramiko/client.py:779: UserWarning: Unknown ssh-ed25519 host key for 192.30.9.6: b'3f3395b778dfe711fde6414af751d033' key.get_name(), hostname, hexlify(key.get_fingerprint()), 2019-03-31 15:31:01,408 utils.py 91 INFO Copy /src/doctor-tests/doctor_tests/disable_network.sh -> disable_network.sh 2019-03-31 15:31:01,629 utils.py 72 INFO Executing: bash disable_network.sh > disable_network.log 2>&1 & 2019-03-31 15:31:01,680 utils.py 86 INFO *** SUCCESSFULLY run command bash disable_network.sh > disable_network.log 2>&1 & 2019-03-31 15:31:01,680 fault_management.py 91 INFO fault management end...... 2019-03-31 15:31:02,914 sample.py 98 INFO doctor monitor detected at 1554046262.9146457 2019-03-31 15:31:02,915 sample.py 41 INFO sample monitor report error...... 2019-03-31 15:31:02,924 sample.py 238 INFO event posted in sample inspector at 1554046262.924192 2019-03-31 15:31:02,924 sample.py 239 INFO sample inspector = 2019-03-31 15:31:02,925 sample.py 241 INFO sample inspector received data = b'[{"time": "2019-03-31T15:31:02.915536", "type": "compute.host.down", "details": {"hostname": "overcloud-novacompute-0.opnfvlf.org", "status": "down", "monitor": "monitor_sample", "monitor_event_id": "monitor_sample_event1"}}]' 2019-03-31 15:31:02,976 sample.py 165 INFO doctor mark host(overcloud-novacompute-0.opnfvlf.org) down at 1554046262.9763155 2019-03-31 15:31:02,977 sample.py 196 INFO doctor compute.instance.update vm() error 1554046262.977892 2019-03-31 15:31:03,081 sample.py 176 INFO doctor mark vm() error at 1554046263.081373 127.0.0.1 - - [31/Mar/2019 15:31:03] "PUT /events HTTP/1.1" 200 - 2019-03-31 15:31:03,084 sample.py 101 INFO ping timeout, quit monitoring... 2019-03-31 15:31:03,101 sample.py 58 INFO doctor consumer notified at 1554046263.1018696 2019-03-31 15:31:03,102 sample.py 61 INFO sample consumer received data = {'severity': 'moderate', 'alarm_name': 'doctor_alarm0', 'current': 'alarm', 'alarm_id': '6c95fc38-7b72-40d4-94e3-429c1849f5be', 'reason': 'Event hits the query .', 'reason_data': {'type': 'event', 'event': {'event_type': 'compute.instance.update', 'traits': [['resource_id', 1, '70d77b70-2c21-4156-8f22-53f840f0135a'], ['service', 1, 'sample'], ['state', 1, 'error'], ['project_id', 1, '652d392ef9c844638d25239c43a926b4'], ['instance_id', 1, '70d77b70-2c21-4156-8f22-53f840f0135a'], ['tenant_id', 1, '652d392ef9c844638d25239c43a926b4']], 'message_signature': '3d0d7cae08e5650a42328961d18c261a30c6cbe2324c9579d5c96f47567602c9', 'raw': {}, 'generated': '2019-03-31T15:31:02.938538', 'message_id': '06c822a7-a65a-4fd2-8e8e-031c74ed92f0'}}, 'previous': 'insufficient data'} 127.0.0.1 - - [31/Mar/2019 15:31:03] "POST /failure HTTP/1.1" 200 - 2019-03-31 15:31:31,918 fault_management.py 185 INFO doctor fault management notification_time=0.1872239112854004 2019-03-31 15:31:31,918 fault_management.py 188 INFO doctor fault management test successfully 2019-03-31 15:31:31,919 fault_management.py 198 INFO run doctor fault management profile....... 2019-03-31 15:31:31,919 base.py 113 INFO Get SSH keys from apex installer...... 2019-03-31 15:31:31,919 base.py 117 INFO Already have SSH keys from apex installer...... 2019-03-31 15:31:32,989 utils.py 91 INFO Copy disable_network.log -> /src/doctor-tests/doctor_tests/disable_network.log 2019-03-31 15:31:33,202 fault_management.py 155 INFO Get the disable_netork.log fromdown_host(host_name:overcloud-novacompute-0.opnfvlf.org, host_ip:192.30.9.6) 2019-03-31 15:31:33,203 profiler_poc.py 97 INFO Total time cost: 385(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |198 |61 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:198 | | | | | | | | found affected:? | | | | | | | set VM error:365 | | | | | | marked host down:259 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:385 2019-03-31 15:31:33,204 fault_management.py 94 INFO fault management cleanup...... 2019-03-31 15:31:33,204 fault_management.py 136 INFO Already get the disable_netork.log from down_host...... 2019-03-31 15:31:35,292 sample.py 91 INFO sample inspector stop...... 2019-03-31 15:31:35,332 sample.py 253 INFO shutdown inspector app server at 1554046295.3325722 127.0.0.1 - - [31/Mar/2019 15:31:35] "POST /events/shutdown HTTP/1.1" 200 - 2019-03-31 15:31:35,336 sample.py 35 INFO sample monitor stop...... 2019-03-31 15:31:35,336 sample.py 108 INFO Stopping Pinger host_name(overcloud-novacompute-0.opnfvlf.org), host_ip(192.30.9.6) 2019-03-31 15:31:35,336 sample.py 31 INFO sample consumer stop...... 2019-03-31 15:31:35,341 sample.py 66 INFO shutdown consumer app server at 1554046295.3414502 127.0.0.1 - - [31/Mar/2019 15:31:35] "POST /shutdown HTTP/1.1" 200 - 2019-03-31 15:31:35,343 alarm.py 84 INFO alarm delete start....... 2019-03-31 15:31:36,865 alarm.py 93 INFO alarm delete end....... 2019-03-31 15:31:36,865 instance.py 76 INFO instance delete start....... 2019-03-31 15:32:03,778 instance.py 89 INFO instance delete end....... 2019-03-31 15:32:03,778 network.py 61 INFO subnet delete start....... 2019-03-31 15:32:04,556 network.py 64 INFO subnet delete end....... 2019-03-31 15:32:04,557 network.py 66 INFO network delete start....... 2019-03-31 15:32:05,495 network.py 69 INFO network delete end....... 2019-03-31 15:32:05,495 apex.py 166 INFO restore apply patches start...... 2019-03-31 15:32:05,570 image.py 71 INFO image delete start....... 2019-03-31 15:32:05,877 base.py 218 INFO Command sudo python restore_config.py output ['restore', 'restore: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml', 'Bak_file empty, so removing also: /var/lib/config-data/puppet-generated/ceilometer/etc/ceilometer/event_definitions.yaml'] 2019-03-31 15:32:06,114 image.py 76 INFO image delete end....... 2019-03-31 15:32:06,114 user.py 163 INFO user delete start...... 2019-03-31 15:32:06,885 base.py 218 INFO Command sudo python restore_aodh.py output [] 2019-03-31 15:32:07,279 user.py 187 INFO user delete end...... Total time cost: 385(ms) ==============================================================================> |Monitor|Inspector |Controller|Notifier|Evaluator | |198 |61 |? |? |? | | | | | | | | | | | link down:0 | | | | | | | | | raw failure:198 | | | | | | | | found affected:? | | | | | | | set VM error:365 | | | | | | marked host down:259 | | | | | notified VM error:? | | | | transformed event:? | | | evaluated event:? | | fired alarm:? | received alarm:385