81 Commits

Author SHA1 Message Date
Clark Boylan
681e24d27d Treat unparsed inventory as a failure
We don't want to run ansible if we don't get a complete inventory from
our clouds. The reason for this is we cannot be sure that the ordering
of git servers, gerrit, and zuul or our serialized updates of afs
servers will work correctly if we have an incomplete inventory.

Instead we just want ansible to fail and try again in the future (we can
then debug why our clouds are not working).

From the ansible docs for any_unparsed_is_failed:

  If 'true', it is a fatal error when any given inventory source
  cannot be successfully parsed by any available inventory plugin;
  otherwise, this situation only attracts a warning.

Additionally we tell openstack inventory plugin to report failures
rather than empty inventory so that the unparsed failures happen.

Change-Id: I9025776af4316fbdd2c910566883eb3a2530852a
2018-09-12 15:09:21 -07:00
James E. Blair
09d10bdbdc Install ansible 2.7.0rc1 on bridge
In order to use handlers with the free strategy, we need this
fix: 5f953d1129

Change-Id: Ibbbb65e2eaa34220f35f3621b4a7fd46c0f6687f
2018-09-07 13:31:09 -07:00
Ian Wienand
bb568ddde8 Up profiled tasks limit
Some of our summaries need to display more than 20 tasks to show
complete information.  Up to 50, which should be enough for anyone.

Change-Id: I3ae3bb714ea7f5fb094f85c33c19ea3c8a81f6c3
2018-09-06 07:28:26 +10:00
Ian Wienand
3657cacfca Add logrotate role and rotate ansible log files
Add a logrotate role that allows basic configuration of a logrotate
configuration for a specific log-file.

Use this role in the ansible-cron and install-ansible roles to ensure
the log output they are generating is rotated.

This role is not intended to manage the logrotate package (mostly to
avoid the overhead of frequently checking package state when this is
expected to be called for multiple configuration files on a server).
We add it as a base package to our servers.

Tests are added for testinfra.

Change-Id: I90f59c3e42c1135d6be120de38e942ece608b761
2018-09-05 09:15:46 +10:00
Paul Belanger
ad0e8bd241 Add profile_tasks and timer callbacks for ansible
These add extra datetime information to tasks, in an effort to help
profile ansible. More info at:

  https://docs.ansible.com/ansible/2.5/plugins/callback/profile_tasks.html
  https://docs.ansible.com/ansible/2.5/plugins/callback/timer.html

Change-Id: Iebd40bfe06484ec1c6f938570eb6cb60e532fb9f
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2018-08-28 18:30:31 -04:00
Zuul
18bcfeb5c8 Merge "Don't create "admin" group" 2018-08-24 19:57:29 +00:00
Ian Wienand
114953cbff Don't create "admin" group
According to the Ubuntu 12.04 release notes, up until Ubuntu 11.10
admin access was granted via the "admin" unix group, but was changed
to the "sudo" group to be more consistent with Debian et al.

Remove the now unnecessary group

Modify the install-ansible role to set some directory ownership to
root:root; there didn't seem to be any reason to use admin here.

This means the "users" role is no longer required in the bridge.yaml,
as it is run from the base playbook anyway.

Change-Id: I6a7fdd460fb472f0d3468eb080aebbb010931e11
2018-08-24 08:50:22 -07:00
Ian Wienand
4402201d56 roles: Add README.rst and lint
These role docs aren't exactly War and Peace, but I think longer term
as we fiddle about making things generic or not and moving them
around, we'll be better off having kept ourselves to writing
*something*.

Add terse README.rst files for all existing roles, and add simple
linter check to ensure new roles get them too.

Change-Id: Ibc836310fb8a45e12c2e31f112d92509ac350413
2018-08-23 21:34:42 +10:00
Zuul
45930d6cb4 Merge "Have ansible retry ssh connections" 2018-08-20 20:41:03 +00:00
Monty Taylor
f6533076f7
Have ansible retry ssh connections
In zuul's ansible config we add retries=3 to deal with transient issues.
Do the same thing for our production runs.

Change-Id: Ide53bae34e5e622de1fd4741706752e8728da20e
2018-08-20 13:18:21 -05:00
Monty Taylor
03f52d0bc8
Move ansible log into directory
file: state=touch returns changed every time. Instead, put the log files
into a /var/log/ansible directory.

Change-Id: I086d803f0e532b9da41cb01d4e7d2ed66245dfc1
2018-08-17 12:57:20 -05:00
Monty Taylor
03d09b777e
Use openstack and groups inventory from system-config
Rather than copying these out of system-config inside of
install-ansible, just point the ansible.cfg to them in the system-config
location. This way as changes come in that have group updates we don't
have to first apply them to the system.

Change-Id: I1cefd7848b7f3f1adc8fbfa080eb9831124a297b
2018-08-17 11:47:13 -05:00
Monty Taylor
1a8c2f66da
Move /opt/system-config/production to /opt/system-config
The production directory is a relic from the puppet environment concept,
which we do not use. Remove it.

The puppet apply tests run puppet locally, where the production
environment is still needed, so don't update the paths in the
tools/prep-apply.sh.

Depends-On: https://review.openstack.org/592946
Change-Id: I82572cc616e3c994eab38b0de8c3c72cb5ec5413
2018-08-17 09:41:02 -05:00
Monty Taylor
c4ae268492
Turn on fact caching
We do this for zuul jobs already, so let's do it for our production
runs.

Shift the inventory cache location down a directory so that launch-node
can invalidate the inventory cache.

Change-Id: I52b1c48d091c07e4205c1a7233448925ca26d8d3
2018-08-17 05:44:37 -05:00
Zuul
f3036203c3 Merge "Remove base.yaml things from openstack_project::server" 2018-08-17 10:43:53 +00:00
Monty Taylor
bab6fcad3c
Remove base.yaml things from openstack_project::server
Now that we've got base server stuff rewritten in ansible, remove the
old puppet versions.

Depends-On: https://review.openstack.org/588326
Change-Id: I5c82fe6fd25b9ddaa77747db377ffa7e8bf23c7b
2018-08-16 17:25:10 -05:00
Zuul
1463a78fc0 Merge "Make a firehose group with firehose01 in it" 2018-08-16 21:20:46 +00:00
Monty Taylor
f78f871afe
Make a firehose group with firehose01 in it
We want to configure firehose logically as the firehose service, but the
host that is in the group is called firehose01.openstack.org. Make a
group and put the config variables for firehose into it.

Change-Id: I17c8e8a72f41c5e2730af81f70cef81dd3ed7bca
2018-08-16 15:11:20 -05:00
Zuul
40fec4c885 Merge "Use jinja match filter instead of regex_match" 2018-08-16 20:10:02 +00:00
Monty Taylor
3e139891be
Use jinja match filter instead of regex_match
regex_match seems to either not work or not exist or something. match,
otoh, works. Additionally, we get this:

  [DEPRECATION WARNING]: Using tests as filters is deprecated. Instead
   of using `result|match` use `result is match`. This feature will
   be removed in version 2.9.

when using the | syntax, so obey the warning and switch to is.

Change-Id: Ie201241a11c08b9fed58c0e1790e8187ee4cf474
2018-08-16 14:26:17 -05:00
Zuul
3b0dc9df7c Merge "Stop running puppet from puppetmaster" 2018-08-16 16:40:29 +00:00
Zuul
6c6275a4af Merge "Add exim config for firehose and storyboard" 2018-08-16 16:40:27 +00:00
Zuul
c5559d3636 Merge "Add ssh pipelining config for ansible" 2018-08-15 22:58:32 +00:00
Zuul
80f0fcc785 Merge "Remove bridge from disabled and add puppet group" 2018-08-15 22:58:31 +00:00
Monty Taylor
33c0756ff8
Stop running puppet from puppetmaster
Bridge can run puppet on the remote hosts. Stop running on puppetmaster
so that we can run from bridge. Put it in the disabled group so that we
don't try to run puppet on it from bridge.

Change-Id: Ibcfa7e902c07c55e3a84f8232a11792c5f7d80e9
2018-08-15 15:11:48 -05:00
Monty Taylor
0d1f235fce
Add exim config for firehose and storyboard
In order to get puppet out of the business of mucking with exim and
fighting ansible, finish moving the config to ansible.

This introduces a storyboard group that we can use to apply the exim
config across both servers. It also splits the base playbook so that we
can avoid running exim on the backup servers. And we set
purge_apt_sources the same as was set in puppet. We should probably
remove it though, since none of us have any clue why it's here.

Change-Id: I43ee891a9c1beead7f97808208829b01a0a7ced6
2018-08-15 15:11:48 -05:00
Monty Taylor
4c400cfa7d
Add ssh pipelining config for ansible
Now that we're running more than just "puppet apply", reconnecting
starts to add up. Turn on pipelining.

Change-Id: If629485a0e602f1a906fef0cabd73154243d7e3d
2018-08-15 15:11:43 -05:00
Monty Taylor
245609bc95
Remove bridge from disabled and add puppet group
Instead of just having bridge be disabled, make a puppet group that it's
not a part of and switch the remote_puppet_else playbook to use that.

Change-Id: Ifb96ce483fc5675d095723bda70242a425bdc619
2018-08-15 08:43:23 -05:00
Monty Taylor
102369320f Add system-config to roles path
This is a setup for the next patch, to allow us to roll the change out.

Update the roles path to point to the system-config roles dir.

Change-Id: I6bcf36beba8e65c9dd8ddf9f4a99d0308f42c565
2018-08-14 19:38:07 +00:00
Monty Taylor
41989b59cf
Fix some little ansible issues
The list of allowed hosts is comma separated, not colon separated.

Set exclusive: yes to ensure this is the *only* authorized key.

The zuul-executor group is the group for ze hosts. It's not a second
zuul-scheduler group.

Change-Id: I214482ce8931e697ada497048fcf12fa492b98b7
2018-08-03 09:05:13 -05:00
Monty Taylor
60fecd508d
Install and configure ansible on bridge
There is a shared caching infrastructure in ansible now for inventory
and fact plugins. It needs to be configured so that our inventory access
isn't slow as dirt.

Unfortunately the copy of openstack.py in 2.6 is busted WRT to caching
because the internal API changed ... and we didn't have any test jobs
set up for it. This also includes a fixed copy of the plugin and
installs it into the a plugin dir.

Change-Id: Ie92e5d7eac4b7e4060a4e07cb29c5a6f2a16ae18
2018-08-03 09:05:07 -05:00