system-config

Author	SHA1	Message	Date
Monty Taylor	9ab25e89a9	Several updates because the world is a dark place A few things have changed and we need to fix them in one go. Use mirror for installing docker for buildset-registry While, we need to make this more systemic, that's hanging off of the mirror rework. For now, since we know all of these jobs are debian based, just set the mirror location. Replace use of zuul cloner with git clones You can never be a prophet in your own hometown. This is now broken because of the git cache rework, so just replace it. Update libjemalloc library python:slim is based on buster now, which has libjemalloc2 not libjemalloc1. Remove gerrit repo remote for submodules A recent change to the base jobs to use prepare-workspace-git broke the gerrit image builds by actually having the origin remote by /dev/null as intended. This breaks submodules because for a few of them where we don't have matching stable branches the submodule relative path behavior is actually exactly what we want. Since we don't care about the remote otherwise, remove the origin remote before doing the submodule update --init so that the submodule will clone the refs from the zuul prepared repo. Change-Id: Ieb5b6bc8711fe971ed3445c7c267306ac4616464	2019-10-19 07:51:29 +09:00
Monty Taylor	f0a3f0cb37	Remove bazel version hack The upstream patch has landed, so we don't need this anymore. Change-Id: I08a6705f189b2a24b737ab4f52bb7f449879fdf1	2019-09-19 14:18:41 +02:00
Monty Taylor	072fcca06f	Fix files matcher and bazel for gerrit base image Use latest bazel It seems 0.27 is now too old. This is what happens when I go on vacation apparently. Add in a hack to override the bazelversion. We'll remove this once https://gerrit-review.googlesource.com/c/gerrit/+/237495 lands and has been merged up. Change-Id: Ib7a6d33ce8bf8498fd5cd09b25087dc09acb8df4	2019-09-16 21:20:18 +02:00
Monty Taylor	56ceaf1c40	Remove the extra bazel options We had some extra bazel options that don't seem to be necessary anymore now that we are using upstream bazel options appropriately. Retry the build a couple of times if it goes south, inside of the build image. This should allow re-use of the cache the second time, and if there is a temporary error, it should pick up and move forward. Change-Id: I5f304acb21fd3a4d40701fc0414ae0c424c838e5	2019-08-26 11:26:19 +02:00
Ian Wienand	0751b3d481	Convert nested bridge.o.o ARA report to static HTML With the switch to swift logging, we need to convert the nested ARA report to static HTML Change-Id: I9e177915099598d5d48a31c15bd6db49e4d1c7e8	2019-08-19 10:28:57 +10:00
Ian Wienand	814e4be128	Ansible roles for backup This introduces two new roles for managing the backup-server and hosts that we wish to back up. Firstly the "backup" role runs on hosts we wish to backup. This generates and configures a separate ssh key for running bup and installs the appropriate cron job to run the backup daily. The "backup-server" job runs on the backup server (or, indeed servers). It creates users for each backup host, accepts the remote keys mentioned above and initalises bup. It is then ready to receive backups from the remote hosts. This eliminates a fairly long-standing requirement for manual setup of the backup server users and keys; this section is removed from the documentation. testinfra coverage is added. Change-Id: I9bf74df351e056791ed817180436617048224d2c	2019-08-05 16:59:57 +10:00
Monty Taylor	2a46202b9f	Build gerrit images for 2.16 and 3.0 as well Our goal is upgrading to 3.0. To do that we need to upgrade to 2.15, then to 2.16, then to 3.0. Build all of the images so that we can do that. 2.16 and 3.0 also use bazel, so just use one copy of the Dockerfile for all three and let zuul check out the repos to the right versions. Depends-On: https://review.opendev.org/673147 Depends-On: https://review.opendev.org/672320 Change-Id: I35bd278e0c70c871fa44d005c60a987d1d8e3cdc	2019-07-27 11:34:42 -04:00
Jeremy Stanley	5587c299ea	Re-add gitea01 replacement to inventory Add new IP addresses to inventory for the rebuild, but don't reactivate it in the haproxy pools yet. Note this switches the gitea testing to use a host called gitea99 so that it doesn't conflict with our changes of the production hosts. Change-Id: I9779e16cca423bcf514dd3a8d9f14e91d43f1ca3	2019-07-23 16:17:41 -07:00
Ian Wienand	814b42f616	Set openafs cache sizes for mirror/mirror-update Set the openafs cache values to the same as the puppet set values for openafs-client role users. Change-Id: I5a58673cad8df2a1e8dddb592c322e751d7f2ac5	2019-07-19 12:04:26 -07:00
Ian Wienand	82c6dec4fa	Disable cloud launcher cron job during CI This takes a similar approach to the extant ansible_cron_install_cron variable to disable the cron job for the cloud launcher when running under CI. If you happen to have your CI jobs when the cron job decides to fire, you end up with a harmless but confusing failed run of the cloud launcher (that has tried to contact real clouds) in the ARA results. Use the "disbaled" flag to ensure the cron job doesn't run. Using "disabled" means we can still check that the job was installed via testinfra however. Convert ansible_cron_install_cron to a similar method using disable, document the variable in the README and add a test for the run_all.sh script in crontab too. Change-Id: If4911a5fa4116130c39b5a9717d610867ada7eb1	2019-07-16 15:01:55 +10:00
Zuul	482abf3bf0	Merge "mirror-update: export mirroring logs"	2019-07-15 22:47:34 +00:00
James E. Blair	ee3b273876	Exclude ansible_python_interpreter from write-inventory Zuul now includes an ansible_python_interpreter hostvar in every host in its inventory. It defaults to python2. The write-inventory role, which takes the Zuul inventory and makes an inventory for the fake bridge server in the gate passes that through. Because it's in /etc/ansible/inventory.yaml, it overrides any settings which may arrive via group vars, but this is the way we set the interpreter for all the hosts on bridge (we do not do so in the actual inventory file). To correct this, tell write-inventory to strip the ansible_python_interpreter variable when it writes out the new inventory. This restores the behavior to match what happens on the real bridge host. One instance of setting the interpreter for the fake "trusty" host used in base platform tests is moved to a hostvars file to match the rest of the real hosts. Change-Id: I60f0acb64e7b90ed8af266f21f2114fd598f4a3c	2019-07-10 10:10:02 -07:00
Ian Wienand	959f0301e7	mirror-update: export mirroring logs This adds a periodic job to copy logs to a mirror volume, and export it via the usual mirror http. I have precreated the log volume; just as a R/W volume because this is expected to be very low volume access. Change-Id: I67870f6d439af2d2a63a5048ef52cecff3e75275	2019-07-04 09:11:29 +10:00
Ian Wienand	aa357fc19f	mirror-update: update keytab testing Keytabs are slightly longer than what is being tested; upto 100 bytes or so. This means the encoded data breaks over lines, which means you need to be more careful about quoting. Update the testing to a longer keytab (100 bytes of random data) and fix up the quoting. Also enable no_logging to avoid putting key material into the logs. Change-Id: I73c391a2ebd2c962dc9a422f9d44265160210852	2019-07-02 17:17:20 +10:00
Ian Wienand	b85282c046	Move rsync mirror updates to new opendev.org mirror-update host This move was prompted by wishing to expose the mirror update logs for the rsync updates so that debugging problems does not require a root user (note: not actually done in this change; will be a follow-on). Rather than start hacking at puppet, the rsync mirror scripts make a nice delination point for starting an Ansible-first/Bionic update. Most magic is included in the scripts, so there is not much more to do than copy them. The host uses the existing kerberos and openafs roles and copies the key material into place (to be added before merge). Note the scripts are removed from the extant puppet so we don't have two updates happening simultaneously. This will also require a manual clean to remove the cron jobs as a once-off when merging. The other part of mirror-update is the reprepro based scripts for the various debuntu repositories. They are left as future work for now. Testing is added to ensure dependencies and scripts are all in place. Change-Id: I525ac18b55f0e11b0a541b51fa97ee5d6512bf70	2019-07-02 16:42:33 +10:00
Clark Boylan	3cc931b72d	Add clouds.yaml entries for fortnebula cloud Donnyd has kindly offered us access to fortnebula's test cloud. This adds clouds.yaml entries to bridge and nodepool so that we can take advantage of these resources. Change-Id: I4ebc261c6f548aca0b3f37dc9b60ffac08029e67	2019-06-28 11:17:48 -07:00
Ian Wienand	d33105535a	Separate openafs CI mirror This is an intermediate step to having both kafs and openafs testing in the gate; this just makes it clear which host is which. Change-Id: I8cd006227ed47ad5f2c5eec664083477dd7ba397	2019-06-17 15:56:09 +10:00
Ian Wienand	52780440ff	Update to ansible-lint 4.1.0 In a follow-on change (I9bf74df351e056791ed817180436617048224d2c) I want to use #noqa to ignore an ansible-lint rule on a task; however emperical testing shows that it doesn't work with 3.5.1. Upgrading to 4.1.0 it seems whatever was wrong has been fixed. This, however, requires upgrading to 4.1.0. I've been through the errors ... the comments inline I think justify what has been turned off. The two legitimate variable space issues I have rolled into this change; all other hits were false positives as described. Change-Id: I7752648aa2d1728749390cf4f38459c1032c0877	2019-06-06 22:13:12 +00:00
Zuul	1fe34e00d4	Merge "Add control plane clouds to nodepool builder clouds.yaml"	2019-06-04 20:15:24 +00:00
Monty Taylor	ff1b8a94c6	Add control plane clouds to nodepool builder clouds.yaml In order to have nodepool build images and upload them to control plane clouds, add them to the clouds.yaml on the nodepool-builder hosts. Keep them out of the launcher configs by splitting the config templates. So that we can keep our copies of things to a minimum, create a group called "control-plane-clouds" and put bridge and nb0* in it. There are clouds mentions in here that we no longer use, a followup patch will clean those up. NOTE: Requires shifting the clouds config dict from host_vars/bridge.openstack.org.yaml to group_vars/control-plane-clouds.yaml in the secrets on bridge. Needed-By: https://review.opendev.org/640044 Change-Id: Id1161bca8f23129202599dba299c288a6aa29212	2019-05-23 14:34:10 -05:00
Ian Wienand	670107045a	Create opendev mirrors This impelements mirrors to live in the opendev.org namespace. The implementation is Ansible native for deployment on a Bionic node. The hostname prefix remains the same (mirrorXX.region.provider.) but the groups.yaml splits the opendev.org mirrors into a separate group. The matches in the puppet group are also updated so to not run puppet on the hosts. The kerberos and openafs client parts do not need any updating and works on the Bionic host. The hosts are setup to provision certificates for themselves from letsencrypt. Note we've added a new handler for mirror nodes to use that restarts apache on certificate issue/renewal. The new "mirror" role is a port of the existing puppet mirror.pp. It installs apache, sets up some modules, makes some symlinks, sets up a cleanup cron job and installs the apache vhost configuration. The vhost configuration is also ported from the extant puppet. It is simplified somewhat; but the biggest change is that we have extracted the main port 80 configuration into a macro which is applied to both port 80 and 443; i.e. the host will have SSL support. The other ports are left alone for now, but can be updated in due course. Thus we should be able to CNAME the existing mirrors to new nodes, and any existing http access can continue. We can update our mirror setup scripts to point to https resources as appropriate. Change-Id: Iec576d631dd5b02f6b9fb445ee600be060f9cf1e	2019-05-21 11:08:25 +10:00
Zuul	2c5847dad9	Merge "Split the base playbook into services"	2019-05-20 10:04:40 +00:00
James E. Blair	8ad300927e	Split the base playbook into services This is a first step toward making smaller playbooks which can be run by Zuul in CD. Zuul should be able to handle missing projects now, so remove it from the puppet_git playbook and into puppet. Make the base playbook be merely the base roles. Make service playbooks for each service. Remove the run-docker job because it's covered by service jobs. Stop testing that puppet is installed in testinfra. It's accidentally working due to the selection of non-puppeted hosts only being on bionic nodes and not installing puppet on bionic. Instead, we can now rely on actually running puppet when it's important, such as in the eavesdrop job. Also remove the installation of puppet on the nodes in the base job, since it's only useful to test that a synthetic test of installing puppet on nodes we don't use works. Don't run remote_puppet_git on gitea for now - it's too slow. A followup patch will rework gitea project creation to not take hours. Change-Id: Ibb78341c2c6be28005cea73542e829d8f7cfab08	2019-05-19 07:31:00 -05:00
Ian Wienand	733122f0df	Use handlers for letsencrypt cert updates This change proposes calling a handler each time a certificate is created/updated. The handler name is based on the name of the certificate given in the letsencrypt_certs variable, as described in the role documentation. Because Ansible considers calling a handler with no listeners an error this means each letsencrypt user will need to provide a handler. One simple option illustrated here is just to produce a stamp file. This can facilitate cross-playbook and even cross-orchestration-tool communication. For example, puppet or other ansible playbooks can detect this stamp file and schedule their reloads, etc. then remove the stamp file. It is conceivable more complex listeners could be setup via other roles, etc. should the need arise. A test is added to make sure the stamp file is created for the letsencrypt test hosts, which are always generating a new certificate in the gate test. Change-Id: I4e0609c4751643d6e0c8d9eaa38f184e0ce5452e	2019-05-14 08:14:51 +10:00
Zuul	5ba6fc424d	Merge "Use swift to back intermediate docker registry"	2019-04-23 00:30:21 +00:00
James E. Blair	c7d499d22b	Merge "Bind to v4 and v6 in haproxy"	2019-04-20 15:49:58 +00:00
OpenDev Sysadmins	1ee61397a3	OpenDev Migration Patch This commit was bulk generated and pushed by the OpenDev sysadmins as a part of the Git hosting and code review systems migration detailed in these mailing list posts: http://lists.openstack.org/pipermail/openstack-discuss/2019-March/003603.html http://lists.openstack.org/pipermail/openstack-discuss/2019-April/004920.html Attempts have been made to correct repository namespaces and hostnames based on simple pattern matching, but it's possible some were updated incorrectly or missed entirely. Please reach out to us via the contact information listed at https://opendev.org/ with any questions you may have.	2019-04-19 19:26:05 +00:00
James E. Blair	65563f226e	Bind to v4 and v6 in haproxy Also, add a newline between listener stanzas in the config for readability. Change-Id: I599ca06f933e746fae3769e7872ae9911c4b00ed	2019-04-18 15:38:15 -07:00
James E. Blair	f357e5cdab	Use swift to back intermediate docker registry Note, this does not have complete tests yet (we will need to update the job to start a swift for that). Change-Id: I2ee7a9e4fb503a3431366c16c380cf09327f6050	2019-04-18 08:14:37 -07:00
Zuul	a83ecc7ed1	Merge "letsencrypt: split staging and self-signed generation"	2019-04-10 02:44:11 +00:00
Ian Wienand	86c5bc2b45	letsencrypt: split staging and self-signed generation We currently only have letsencrypt_test_only as a single flag that sets tests to use the letsencrypt staging environment and also generates a self-signed certificate. However, for initial testing we actually want to fully generate certificates on hosts, but using the staging environment (i.e. not generate self-signed certs). Thus we need to split this option into two, so the gate tests still use staging+self-signed, but in-progress production hosts can just using the staging flag. These variables are split, and graphite01.opendev.org is made to create staging certificates. Also remove some debugging that is no longer necessary. Change-Id: I08959ba904f821c9408d8f363542502cd76a30a4	2019-04-10 08:47:32 +10:00
Zuul	459961522f	Merge "Set ansible_python_interpreter for bridge.o.o"	2019-04-09 16:15:04 +00:00
Paul Belanger	e2c4d9b3ad	Set ansible_python_interpreter for bridge.o.o We don't have python2 on bridge.o.o, force python3. Change-Id: Ie8eb68007c0854329cf3757e577ebcbfd40ed8aa Signed-off-by: Paul Belanger <pabelanger@redhat.com>	2019-04-02 09:43:39 -04:00
Ian Wienand	afd907c16d	letsencrypt support This change contains the roles and testing for deploying certificates on hosts using letsencrypt with domain authentication. From a top level, the process is implemented in the roles as follows: 1) letsencrypt-acme-sh-install This role installs the acme.sh tool on hosts in the letsencrypt group, along with a small custom driver script to help parse output that is used by later roles. 2) letsencrypt-request-certs This role runs on each host, and reads a host variable describing the certificates required. It uses the acme.sh tool (via the driver) to request the certificates from letsencrypt. It populates a global Ansible variable with the authentication TXT records required. If the certificate exists on the host and is not within the renewal period, it should do nothing. 3) letsencrypt-install-txt-record This role runs on the adns server. It installs the TXT records generated in step 2 to the acme.opendev.org domain and then refreshes the server. Hosts wanting certificates will have pre-provisioned CNAME records for _acme-challenge.host.opendev.org pointing to acme.opendev.org. 4) letsencrypt-create-certs This role runs on each host, reading the same variable as in step 2. However this time the acme.sh tool is run to authenticate and create the certificates, which should now work correctly via the TXT records from step 3. After this, the host will have the full certificate material. Testing is added via testinfra. For testing purposes requests are made to the staging letsencrypt servers and a self-signed certificate is provisioned in step 4 (as the authentication is not available during CI). We test that the DNS TXT records are created locally on the CI adns server, however. Related-Spec: https://review.openstack.org/587283 Change-Id: I1f66da614751a29cc565b37cdc9ff34d70fdfd3f	2019-04-02 15:31:41 +11:00
Ian Wienand	ce7d04ddea	Remove /hosts from log outputs Change I754637115f8c7469efbc1856e88bbcb6fb83b4ce moved a bunch of log collection to use "stage-output". This uses "fetch-output" which automatically puts these logs in hostname subdirectories; but it does not have an option to put it in hosts/hostname as we were doing with the other logs. Although we could add such support, it probably doesn't make sense as most other multinode jobs will have the same layout with the host logs at the top level. Remove the intermediate "/hosts/" directory on system-config jobs so all logs remain at the top level, and we don't have this confusing split as to where logs are for each host. Change-Id: I56bd67c659ffb26a460d9406f6f090d431c8aa79	2019-04-02 13:20:01 +11:00
Ian Wienand	66ceb321a6	master-nameserver: Add unmanaged domains; add acme.opendev.org This adds the concept of an unmanaged domain; for unmanaged domains we will write out the zone file only if it doesn't already exist. acme.opendev.org is added as an unmanaged domain. It will be managed by other ansible roles which add TXT records for ACME authentication. The initial template comes from the dependent change, and this ensures the bind configuration is always valid. For flexibility and testing purposes, we allow passing an extra refspec and version to the git checkout. This is one way to pull in changes for speculative CI runs (I looked into having the hosts under test checkout from Zuul; but by the time we're 3-ansible call's deep on the DNS hosts-under-test it's a real pain. For the amount of times we update this, it's easier to just allow a speculative change that can take a gerrit URL; for an example see [1]) [1] https://review.openstack.org/#/c/641155/10/playbooks/group_vars/dns.yaml Testing is enhanced to check for zone files and correct configuration stanzas. Depends-On: https://review.openstack.org/641154 Depends-On: https://review.openstack.org/641168 Change-Id: I9ef5cfc850c3458c63aff46cfaa0d49a5d194e87	2019-03-27 14:22:59 +11:00
Ian Wienand	0484e29576	Add fake DNSSEC keys for zones This allows the zones to load, which is useful in follow-on changes where we can query them on the host from testinfra to make sure it's all working. Change-Id: I9d22c07ce2d1ebad67b0f1ca222c1b457779ce47	2019-03-27 10:39:02 +11:00
James E. Blair	d8f56f827b	Disable ansible cron even more We call the bridge playbook from run-base.yaml to bootstrap bridge, so that's really where we need to disable the cron installation. Change-Id: I5f3d604feaca5c1d577636c2d1130eec82a35961	2019-03-08 15:44:27 -08:00
Monty Taylor	7a18eb49e4	Add flag to disable cron for test jobs The run_all cron running in test jobs is unawesome because it can cause the inventory overrides we put in for the testing to get overwritten with the real inventory. We don't want test jobs attempting to run against real hosts. Change-Id: I733f66ff24b329d193799e6063953e88dd6a35b1	2019-03-08 21:19:42 +00:00
James E. Blair	9ff29b108d	Test gitea project creation playbook Add an option to run a playbook (in the fake bridge context) after running the base playbook. Use this to run a new playbook which exercises gitea project creation after bootstrapping the gitea service. Disable ansible-lint 304 because it erroneously thinks shell and command are the same thing. Change-Id: I0394b614771bc62b9fe23d811defd7767b3d10db	2019-03-06 18:42:39 +00:00
Clark Boylan	9342c2aa6d	Add zuul user to bridge.openstack.org We want to trigger ansible runs on bridge.o.o from zuul jobs. First iteration of this tried to login as root but this is not allowed by our ssh config. That config seems reasonable so we add a zuul user instead which we can ssh in as then run things as root from zuul jobs. This makes use of our existing user management system. Change-Id: I257ebb6ffbade4eb645a08d3602a7024069e60b3	2019-03-04 14:47:51 -08:00
Zuul	d96623934c	Merge "Run an haproxy load balancer for gitea"	2019-02-22 23:00:11 +00:00
James E. Blair	4b031f9f24	Run an haproxy load balancer for gitea This runs an haproxy which is strikingly similar to the one we currently run for git.openstack.org, but it is run in a docker container. Change-Id: I647ae8c02eb2cd4f3db2b203d61a181f7eb632d2	2019-02-22 12:54:04 -08:00
James E. Blair	dd011e1c7a	run-base: configure docker mirrors on all hosts in CI When setting up hosts for testing in CI, configure the docker mirrors before running the base playbook. Change-Id: I172ae87156238fa6a07414c74e1ca17df1a30257	2019-02-22 08:42:12 -08:00
James E. Blair	67cda2c7df	Deploy gitea with docker-compose This deploys a shared-nothing gitea server using docker-compose. It includes a mariadb server. Change-Id: I58aff016c7108c69dfc5f2ebd46667c4117ba5da	2019-02-18 08:46:40 -08:00
James E. Blair	94d404a535	Install kubectl on bridge With a snap package. Because apparently that's how that's done. Change-Id: I0462cc062c2706509215158bca99e7a2ad58675a	2019-02-11 10:16:58 -08:00
James E. Blair	7610682b6f	Configure .kube/config on bridge Add the gitea k8s cluster to root's .kube/config file on bridge. The default context does not exist in order to force us to explicitly specify a context for all commands (so that we do not inadvertently deploy something on the wrong k8s cluster). Change-Id: I53368c76e6f5b3ab45b1982e9a977f9ce9f08581	2019-02-06 15:43:19 -08:00
James E. Blair	12709a1c8b	Run a docker registry for CI Change-Id: If9669bb3286e25bb16ab09373e823b914b645f26	2019-02-01 10:12:51 -08:00
James E. Blair	8062f4c1ec	Grab container logs at the end of run-base So that we automatically get container logs for future jobs which use containers. Change-Id: I329c67eefb8c6a2ff9a8ce8ef69cc844cef6012a	2019-02-01 10:12:39 -08:00
James E. Blair	22ad414a86	Use stage-output role in system-config-run This simplifies log collection. Change-Id: I754637115f8c7469efbc1856e88bbcb6fb83b4ce Depends-On: https://review.openstack.org/634293	2019-01-31 11:03:25 -08:00

1 2

85 Commits