tripleo-ansible/playbooks/step_fail_unmount.yml
Julia Kreger 91d2c1c3ce CORE-1837 Add tgt stop, fix unmounts and service stops
The step to stop tgtd was missing for controllermgmt nodes hence
causing what was observed in CORE-1837.  This commit adds the
stop_tgt.yml file as in include for controllermgmt nodes.

During the course if this investigation, it was found that the
remounting call was missing from a number of server classes,
which has been corrected, although shifted to a pure unmount
as there appear to be image differences that make a remount
less reliable.

Additionally, the unmount and cinder volume cleanup will call
another include to cause open files to be listed out into the
log for easy troubleshooting.  This resulted in a few missing
services being identified and added.

Change-Id: I4ddc475f925875a4c8e20107d900d629c9a799e4
2014-10-14 20:57:28 -07:00

23 lines
960 B
YAML

# Copyright (c) 2014 Hewlett-Packard Development Company, L.P.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
# implied.
# See the License for the specific language governing permissions and
# limitations under the License.
---
- name: "Collect Open File List"
sudo: yes
shell: lsof -n|grep /mnt
- name: "Collect process list"
command: ps auxf
- name: "Fail"
fail: msg="The ephemeral storage of this system failed to be cleaned up properly and processes or files are still in use. The previous ansible play should have information to help troubleshoot this issue."