configpolicy

dustin

Author	SHA1	Message	Date
Dustin	fdc59fe73b	pyrocufflink-dns: Drop group The internal DNS server for the pyrocufflink.blue et al. domains runs on the firewall now, and is thus no longer managed by Ansible. Dropping the group variables so the file encrypted with Ansible Vault can go away.	2024-02-22 10:23:19 -06:00
Dustin	19d833cc76	websites/d&t.com: drop obsolete formsubmit config The dustinandtabitha.com website no longer uses formsubmit (the time for RSVP has long passed). Removing the configuration so the file encrypted with Ansible Vault can go away.	2024-02-22 10:23:19 -06:00
Dustin	f9f8d5aa29	Remove grafana, metricspi groups With the Metrics Pi decommissioned and Victoria Metrics and Grafana running in Kubernetes now, these groups are no longer needed.	2024-02-22 10:23:19 -06:00
Dustin	f83cea50e9	r/ssu-user-ca: Configure sshd TrustedUserCAKeys The `TrustedUserCAKeys` setting for sshd(8) tells the server to accept any certificates signed by keys listed in the specified file. The authenticating username has to match one of the principals listed in the certificate, of course. This role is applied to all machines, via the `base.yml` playbook. Certificates issued by the user CA managed by SSHCA will therefore be trusted everywhere. This brings us one step closer to eliminating the dependency on Active Directory/Samba.	2024-02-01 18:46:40 -06:00
Dustin	0d30e54fd5	r/fileserver: Restrict non-administrators to SFTP Normal users do not need shell access to the file server, and certainly should not be allowed to e.g. forward ports through it. Using a `Match` block, we can apply restrictions to users who do not need administrative functionality. In this case, we restrict everyone who is not a member of the Server Admins group in the PYROCUFFLINK AD domain.	2024-02-01 10:29:32 -06:00
Dustin	4b8b5fa90b	pyrocufflink: Enable pam_ssh_agent_auth for sudo By default, `sudo` requires users to authenticate with their passwords before granting them elevated privileges. It can be configured to allow (some) users access to (some) privileged commands without prompting for a password (i.e. `NOPASSWD`), however this has a real security implication. Disabling the password requirement would effectively grant any program root privileges. Prompting for a password prevents malicious software from running privileged commands without the user knowing. Unfortunately, handling `sudo` authentication for Ansible is quite cumbersome. For interactive use, the `--ask-become-pass`/`-K` argument is useful, though entering the password for each invocation of `ansible-playbook` while iterating on configuration policy development is a bit tedious. For non-interactive use, though, the password of course needs to be stored somewhere. Encrypting it with Ansible Vault is one way to protect it, but it still ends up stored on disk somewhere and needs to be handled carefully. pam_ssh_agent_auth provides an acceptable solution to both issues. It is better than disabling `sudo` authentication entirely, but a lot more convenient than dealing with passwords. It uses the calling user's SSH agent to assert that the user has access to a private key corresponding to one of the authorized public keys. Using SSH agent forwarding, that private key can even exist on a remote machine. If the user does not have a corresponding private key, `sudo` will fall back to normal password-based authentication. The security of this solution is highly dependent on the client to store keys appropriately. FIDO2 keys are supported, though when used with Ansible, it is quite annoying to have to touch the token for _every task_ on _every machine_. Thus, I have created new FIDO2 keys for both my laptop and my desktop that have the `no-touch-required` option enabled. This means that in order to use `sudo` remotely, I still need to have my token plugged in to my computer, but I do not have to tap it every time it's used. For Jenkins, a hardware token is obviously impossible, but using a dedicated key stored as a Jenkins credential is probably sufficient.	2024-01-28 12:16:35 -06:00
Dustin	7b54bc4400	nut-monitor: Require both UPS to be online Unfortunately, the automatic transfer switch does not seem to work correctly. When the standby source is a UPS running on battery, it does not switch sources if the primary fails. In other words, when the power is out and both UPS are running on battery, when the first one dies, it will NOT switch to the second one. It has no trouble switching when the second source is mains power, though, which is very strange. I have tried messing with all the settings including nominal input voltage, sensitivity, and frequency tolerence, but none seem to have any effect. Since it is more important for the machines to shut down safely than it is to have an extra 10-15 minutes of runtime during an outage, the best solution for now is to configure the hosts to shut down as soon as the first UPS battery gets low. This is largely a waste of the second UPS, but at least it will help prevent data loss.	2024-01-25 21:22:04 -06:00
Dustin	236e6dced6	r/web/hlc: Add formsubmit config for summer signup And of course, Tabitha lost her SSH key so she had to get another one.	2024-01-23 22:04:29 -06:00
Dustin	07f84e7fdc	vm-hosts: Increase VM start delay after K8s Increasing the delay after starting the Kubernetes cluster to hopefully allow things to "settle down" enough that starting services on follow up VMs doesn't time out.	2024-01-22 08:35:40 -06:00
Dustin	6f4fb70baa	vm-hosts: Clean up vm-autostart list Start Kubernetes earlier. Start Synapse later (it takes a long time to start up and often times out when the VM hosts are under heavy load). Start SMTP relay later as it's not really needed.	2024-01-21 18:42:28 -06:00
Dustin	b4fcbb8095	unifi: Deploy unifi_exporter `unifi_exporter` provides Prometheus metrics for UniFi controller.	2024-01-21 16:12:29 -06:00
Dustin	6f5b400f4a	vm-hosts: Fix test network device name The network device for the test/pyrocufflink.red network is named `br1`. This needs to match in the systemd-networkd configuration or libvirt will not be able to attach virtual machines to the bridge.	2024-01-21 15:55:37 -06:00
Dustin	fb445224a0	vm-hosts: Add k8s-amd64-n3 to autostart list	2024-01-21 15:55:23 -06:00
Dustin	525f2b2a04	nut-monitor: Configure upsmon `upsmon` is the component of [NUT] that monitors (local or remote) UPS devices and reacts to changes in their state. Notably, it is responsible for powering down the system when there is insufficient power to the system.	2024-01-19 20:50:03 -06:00
Dustin	ab30fa13ca	file-servers: Set Apache ServerName Since file0.pyrocufflink.blue now hosts a couple of VirtualHosts, accessing its HTTP server by the files.pyrocufflink.blue alias no longer works, as Apache routes unknown hostnames to the first VirtualHost, rather than the global configuration. To resolve this, we must set `ServerName` to the alias.	2023-12-29 10:46:13 -06:00
Dustin	dfd828af08	r/ssh-host-certs: Manage SSH host certificates The ssh-host-certs role, which is now applied as part of the `base.yml` playbook and therefore applies to all managed nodes, is responsible for installing the sshca-cli package and using it to request signed SSH host certificates. The sshca-cli-systemd sub-package includes systemd units that automate the process of requesting and renewing host certificates. These units need to be enabled and provided the URL of the SSHCA service. Additionally, the SSH daemon needs to be configured to load the host certificates.	2023-11-07 21:27:02 -06:00
Dustin	c6f0ea9720	r/repohost: Configure Yum package repo host So it turns out Gitea's RPM package repository feature is less than stellar. Since each organization/user can only have a single repository, separating packages by OS would be extremely cumbersome. Presumably, the feature was designed for projects that only build a single PRM for each version, but most of my packages need multiple builds, as they tend to link to system libraries. Further, only the repository owner can publish to user-scoped repositories, so e.g. Jenkins cannot publish anything to a repository under my dustin account. This means I would ultimately have to create an Organization for every OS/version I need to support, and make Jenkins a member of it. That sounds tedious and annoying, so I decided against using that feature for internal packages. Instead, I decided to return to the old ways, publishing packages with `rsync` and serving them with Apache. It's fairly straightforward to set this up: just need a directory with the appropriate permissions for users to upload packages, and configure Apache to serve from it. One advantage Gitea's feature had over a plain directory is its automatic management of repository metadata. Publishers only have to upload the RPMs they want to serve, and Gitea handles generating the index, database, etc. files necessary to make the packages available to Yum/dnf. With a plain file host, the publisher would need to use `createrepo` to generate the repository metadata and upload that as well. For repositories with multiple packages, the publisher would need a copy of every RPM file locally in order for them to be included in the repository metadata. This, too, seems like it would be too much trouble to be tenable, so I created a simple automatic metadata manager for the file-based repo host. Using `inotifywatch`, the `repohost-createrepo` script watches for file modifications in the repository base directory. Whenever a file is added or changed, the directory containing it is added to a queue. Every thirty seconds, the queue is processed; for each unique directory in the queue, repository metadata are generated. This implementation combines the flexibility of a plain file host, supporting an effectively unlimited number of repositories with fully-configurable permissions, and the ease of publishing of a simple file upload.	2023-11-07 20:51:10 -06:00
Dustin	6955c4e7ad	hosts: Decommission dc-4k6s8e.p.b Replaced by dc-nrtxms.pyrocufflink.blue	2023-10-28 16:07:56 -05:00
Dustin	420764d795	hosts: Add dc-nrtxms.p.b New Fedora 38 Active Directory Domain Controller	2023-10-28 16:07:39 -05:00
Dustin	a8c184d68c	hosts: Decommission dc-ag62kz.p.b Replaced by dc-qi85ia.pyrocufflink.blue	2023-10-28 16:07:08 -05:00
Dustin	686817571e	smtp-relay: Switch to Fastmail AWS is going to begin charging extra for routable IPv4 addresses soon. There's really no point in having a relay in the cloud anymore anyway, since a) all outbound messages are sent via the local relay and b) no messages are sent to anyone except me.	2023-10-24 17:27:21 -05:00
Dustin	1b9543b88f	metricspi: alerts: Increase Frigate disk threshold We want the Frigate recording volume to be basically full at all times, to ensure we are keeping as much recording as possible.	2023-10-15 09:52:12 -05:00
Dustin	2f554dda72	metricspi: Scrape k8s-aarch64-n1 I've added a new Kubernetes worker node, k8s-aarch64-n1.pyrocufflink.blue. This machine is a Raspberry Pi CM4 mounted on a Waveshare CM4-IO-Base A and clipped onto the DIN rail. It's got 8 GB of RAM and 32 GB of eMMC storage. I intend to use it to build container images locally, instead of bringing up cloud instances.	2023-10-05 14:32:19 -05:00
Dustin	a74113d95f	metricspi: Scrape Zincati metrics from CoreOS hosts Zincati is the automatic update manager on Fedora CoreOS. It exposes Prometheus metrics for host/update statistics, which are useful to track the progress of automatic updates and identify update issues. Zinciti actually exposes its metrics via a Unix socket on the filesystem. Another process, [local_exporter], is required to expose the metrics from this socket via HTTP so Prometheus can scrape them. [local_exporter]: https://github.com/lucab/local_exporter	2023-10-03 10:29:12 -05:00
Dustin	d7f778b01c	metricspi: Scrape metrics from k8s-aarch64-n0 collectd is now running on k8s-aarch64-n0.pyrocufflink.blue, exposing system metrics. As it is not a member of the AD domain, it has to be explicitly listed in the `scrape_collectd_extra_targets` variable.	2023-10-03 10:29:11 -05:00
Dustin	50f4b565f8	hosts: Remove nvr1.p.b as managed system nvr1.pyrocufflink.blue has been migrated to Fedora CoreOS. As such, it is no longer managed by Ansible; its configuration is done via Butane/Ignition. It is no longer a member of the Active Directory domain, but it does still run collectd and export Prometheus metrics.	2023-09-27 20:24:47 -05:00
Dustin	7a9c678ff3	burp-server: Keep more backups New retention policy: * 7 daily backups * 4 weekly backups * 12 ~monthly backups * 5 ~yearly backups	2023-07-17 16:36:37 -05:00
Dustin	06782b03bb	vm-hosts: Update VM autostart list * dc2 is gone for a long time, replaced by two new domain controllers * unifi0 was recently replaced by unifi1	2023-07-07 10:05:22 -05:00
Dustin	71a43ccf07	unifi: Deploy Unifi Network controller Since Ubiquiti only publishes Debian packages for the Unifi Network controller software, running it on Fedora has historically been neigh impossible. Fortunately, a modern solution is available: containers. The linuxserver.io project publishes a container image for the controller software, making it fairly easy to deploy on any host with an OCI runtime. I briefly considered creating my own image, since theirs must be run as root, but I decided the maintenance burden would not be worth it. Using Podman's user namespace functionality, I was able to work around this requirement anyway.	2023-07-07 10:05:01 -05:00
Dustin	61844e8a95	pyrocufflink: Add Luma SSH keys for root Sometimes I need to connect to a machine when there is an AD issue (e.g. domain controllers are down, clocks are out of sync, etc.) but I can't do it from my desktop.	2023-07-05 16:35:57 -05:00
Dustin	0a68d84121	metricspi: Scrape hatchlearningcenter.org To monitor site availability and certificate expiration.	2023-06-21 14:31:33 -05:00
Dustin	4e608e379f	metricspi/alerts: Correct BURP archive alert query When the RAID array is being resynchronized after the archived disk has been reconnected, md changes the disk status from "missing" to "spare." Once the synchronization is complete, it changes from "spare" to "active." We only want to trigger the "disk needs archived" alert once the synchronization process is complete; otherwise, both the "disks need swapped" and "disk needs archived" alerts would be active at the same time, which makes no sense. By adjusting the query for the "disk needs archived" alert to consider disks in both "missing" and "spare" status, we can delay firing that alert until the proper time.	2023-06-20 11:58:35 -05:00
Dustin	bf4d57b5cb	frigate: Configure journal2ntfy for MD RAID The Frigate server has a RAID array that it uses to store video recordings. Since there have been a few occasions where the array has suddenly stopped functioning, probably because of the cheap SATA controller, it will be nice to get an alert as soon as the kernel detects the problem, so as to minimize data loss.	2023-06-08 10:05:36 -05:00
Dustin	87e8ec2ed4	synapse: Back up data using BURP Most of the Synapse server's state is in its SQLite database. It also has a `media_store` directory that needs to be backed up, though. In order to back up the SQLite database while the server is running, the database must be in "WAL mode." By default, Synapse leaves the database in the default "rollback journal mode," which disallows multiple processes from accessing the database, even for read-only operations. To change the journal mode: ```sh sudo systemctl stop synapse sudo -u synapse sqlite3 /var/lib/synapse/homeserver.db 'PRAGMA journal_mode=WAL;' sudo systemctl start synapse ```	2023-05-23 09:52:50 -05:00
Dustin	78296f7198	Merge branch 'journal2ntfy'	2023-05-23 08:31:52 -05:00
Dustin	347cda74fd	metrics: Scrape metrics from Kubernetes API server Kubernetes exports a lot of metrics in Prometheus format. I am not sure what all is there, yet, but apparently several thousand time series were added. To allow anonymous access to the metrics, I added this RoleBinding: ```yaml apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: name: prometheus rules: - apiGroups: - "" resources: - nodes/metrics verbs: - get - nonResourceURLs: - /metrics verbs: - get ```	2023-05-22 21:21:08 -05:00
Dustin	c0bb387b18	metricspi: Scrape metrics from MinIO backup storage MinIO exposes metrics in Prometheus exposition format. By default, it requires an authentication token to access the metrics, but I was unable to get this to work. Fortunately, it can be configured to allow anonymous access to the metrics, which is fine, in my opinion.	2023-05-22 21:19:25 -05:00
Dustin	a7319c561d	journal2ntfy: Script to send log messagess via ntfy The `journal2ntfy.py` script follows the systemd journal by spawning `journalctl` as a child process and reading from its standard output stream. Any command-line arguments passed to `journal2ntfy` are passed to `journalctl`, which allows the caller to specify message filters. For any matching journal message, `journal2ntfy` sends a message via the ntfy web service. For the BURP server, we're going to use `journal2ntfy` to generate alerts about the RAID array. When I reconnect the disk that was in the fireproof safe, the kernel will log a message from the md subsystem indicating that the resynchronization process has begun. Then, when the disks are again in sync, it will log another message, which will let me know it is safe to archive the other disk.	2023-05-17 14:51:21 -05:00
Dustin	2c002aa7c5	alerts: Add alert to archive BURP disk This alert will fire once the MD RAID resynchronization process has completed and both disks in the array are online. It will clear when one disk is disconnected and moved to the safe.	2023-05-16 08:33:13 -05:00
Dustin	877dcc3879	alerts: Add alerts for missed client backups When BURP fails to even start a backup, it does not trigger a notification at all. As a result, I may not notice for a few days when backups are not happening. That was the case this week, when clients' backups were failing immediately, because of a file permissions issue on the server. To hopefully avoid missing backups for too long in the future, I've added two new alerts: * The no recent backups alert fires if there have not been any BURP backups recently. This may also fire, for example, if the BURP exporter is not working, or if there is something wrong with the BURP data volume. * The missed client backup alert fires if an active BURP client (i.e. one that has had at least one backup in the past 90 days) has not been backed up in the last 24 hours.	2023-05-14 11:48:36 -05:00
Dustin	a2bcd5ccbb	alerts: Adjust BURP RAID disk swap alert Using a 30-day window for the `tlast_change_over_time` function effectively "caps out" the value at 30 days. Thus, the alert reminding me to swap the BURP backup volume will never fire, since the value will never be greater than the 30-day threshold. Using a wider window resolves that issue (though the query will still produce inaccurate results beyond the window).	2023-05-14 11:38:00 -05:00
Dustin	ad9fb6798e	samba-dc: Omit tls cafile setting The `tls cafile` setting in `smb.conf` is not necessary. It is used for verifying peer certificates for mutual TLS authentication, not to specify the intermediate certificate authority chain like I thought. The setting cannot simply be left out, though. If it is not specified, Samba will attempt to load a file from a built-in default path, which will fail, causing the server to crash. This is avoided by setting the value to the empty string.	2023-05-10 08:28:49 -05:00
Dustin	9722fed1b8	metricspi: Scrape dustinandtabitha.com	2023-05-09 21:30:11 -05:00
Dustin	f6f286ac24	alerts: Correct BURP volume swap alert The `tlast_change_over_time` function needs an interval wide enough to consider the range of time we are intrested in. In this case, we want to see if the BURP volume has been swapped in the last thirty days, so the interval needs to be `30d`.	2023-05-03 11:06:34 -05:00
Dustin	5ed3ee525e	synapse: Update LDAP server URI	2023-05-01 12:36:33 -05:00
Dustin	a4cc9d0c46	metricspi: Scrape tabitha.biz	2023-04-23 20:03:43 -05:00
Dustin	6c68126a3a	grafana: Update LDAP server host name dc0.p.b has been gone for a while now. All the current domain controllers use LDAPS certificates signed by Let's Encrypt and include the pyrocufflink.blue name, so we can now use the apex domain A record to connect to the directory.	2023-04-12 14:07:51 -05:00
Dustin	78f65355fa	gitea: Back up with BURP	2023-04-12 14:07:51 -05:00
Dustin	1da4c17a8c	alerts: Add alerts for HTTPS certificates These alerts will generate notifications when websites' HTTPS certificates are not properly renewed automatically and become in danger of expiring.	2023-04-12 13:55:31 -05:00
Dustin	bf4133652c	metrics: Scrape Jenkins with blackbox exporter This is mostly to monitor the HTTPS certificate expiration.	2023-04-12 13:55:31 -05:00
Dustin	dc2a05dc8f	alerts: Add alert for BURP RAID array swap This alert counts how long its been since the number of "active" disks in the RAID array on the BURP server has changed. The assumption is that the number will typically be `1`, but it will be `2` when the second disk synchronized before the swap occurs.	2023-04-11 22:25:36 -05:00
Dustin	2394bf7436	metricspi: Fix vmalert links 1. Grafana 8 changed the format of the query string parameters for the Explore page. 2. vmalert no longer needs the http.pathPrefix argument when behind a reverse proxy, rather it uses the request path like the other Victoria Metrics components.	2023-04-11 21:46:43 -05:00
Dustin	6c562c9821	alerts: Ignore missing mdraid disk for BURP The way I am handling swapping out the BURP disk now is by using the Linux MD RAID driver to manage a RAID 1 mirror array. The array normally operates with one disk missing, as it is in the fireproof safe. When it is time to swap the disks, I reattach the offline disk, let the array resync, then disconnect and store the other disk. This works considerably better than the previous method, as it does not require BURP or the NFS server to be offline during the synchronization.	2023-04-11 20:08:07 -05:00
Dustin	a59f24a8b5	metricspi: Stop scraping speedtest Running the speed test periodically was just wasting bandwidth. It failed frequently, and generally did not provide useful information.	2023-04-02 11:05:16 -05:00
Dustin	94de5d6067	samba-dc: Decrease Samba log level The default log level (3) produces too much output and quickly fills the `/var/log` volume on the domain controllers.	2023-03-08 11:26:57 -06:00
Dustin	748c432334	vaultwarden: Change Domain URL The rule is "if it is accessible on the Internet, its name ends in .net" Although Vaultwarden can be accessed by either name, the one specified in the Domain URL setting is the only one that works for WebAuthn.	2023-03-03 11:17:07 -06:00
Dustin	632e1dd906	metricspi: Update LDAP configuration All domain controllers now use the Let's Encrypt wildcard certificate for the pyrocufflink.blue domain. Further, dc2.p.b is decommissioned.	2023-01-09 12:23:54 -06:00
Dustin	90f9e5eba5	samba-dc: Manage sudoers Domain controllers only allow users in the Domain Admins AD group to use `sudo` by default. dustin and jenkins need to be able to apply configuration policy to these machines, but they are not members of said group.	2022-12-23 08:47:31 -06:00
Dustin	9408ee31c3	home-assistant: Back up Zigbee/ZWave/Mosquitto Mosquitto, Zigbee2MQTT, and ZWaveJS2MQTT all have persistent state that needs to be backed up in addition to Home Assistant's own data.	2022-12-23 06:56:52 -06:00
Dustin	77191c8b5a	Fedora37: Set collectd SELinux domain permissive collectd is broken by default on Fedora 36 and 36. Several plugins generate AVC denials.	2022-12-19 10:22:00 -06:00
Dustin	637289036a	blackbox: Update pyrocufflink DNS check I changed the naming convention for domain controller machines. They are no longer "numbered," since the plan is to rotate through them quickly. For each release of Fedora, we'll create two new domain controllers, replacing the existing ones. Their names are now randomly generated and contain letters and numbers, so the Blackbox Exporter check for DNS records needs to account for this.	2022-12-19 09:04:37 -06:00
Dustin	caef7f342b	vm-hosts: Update autostart list * Remove DC0 (decommissioned) * Remove Jenkins and its build VMs (Migrated to Kubernetes) * Add pxe0 (Required for Basement HUD)	2022-12-18 19:55:48 -06:00
Dustin	77c6408187	metricspi: Remove sensors scrape job Sensor data are retrieved via Home Assistant.	2022-12-18 19:16:10 -06:00
Dustin	244482ac52	websites: Add hatchlearningcenter.org This is the website for Tabitha's new hybrid private school! 👩‍🎓	2022-11-30 22:04:29 -06:00
Dustin	772f669ab2	r/gitea: Handle encoded / characters in HTTP paths Gitea package names (e.g. OCI images, etc.) can contain `/` charactres. These are encoded as %2F in request paths. Apache needs to forward these sequences to the Gitea server without decoding them. Unfortunately, the `AllowEncodedSlashes` setting, which controls this behavior, is a per-virtualhost setting that is not inherited from the main server configuration, and therefore must be explicitly set inside the `VirtualHost` block. This means Gitea needs its own virtual host definition, and cannot rely on the default virtual host.	2022-11-27 17:21:03 -06:00
Dustin	4511d5447e	vm-hosts: Add missing kube.network config When I added the systemd-networkd configuration for the Kubernetes network interface on the VM hosts, I only added the `.netdev` configuration and forgot the `.network` part. Without the latter, systemd-networkd creates the interface, but does not configure or activate it, so it is not able to handle traffic for the VMs attached to the bridge.	2022-08-22 20:00:47 -05:00
Dustin	b8b8ae5798	vm-hosts: Define machines to auto start	2022-08-20 21:19:01 -05:00
Dustin	bc60451949	metricspi: Update DNS server address DNS is now handled by the border firewall.	2022-08-20 18:19:13 -05:00
Dustin	4622240c6c	r/netboot/jenkins-agent: Configure NBD exports The netboot/jenkins-agent Ansible role configures three NBD exports: * A single, shared, read-only export containing the Jenkins agent root filesystem, as a SquashFS filesystem * For each defined agent host, a writable data volume for Jenkins workspaces * For each defined agent host, a writable data volume for Docker Agent hosts must have some kind of unique value to identify their persistent data volumes. Raspberry Pi devices, for example, can use the SoC serial number.	2022-08-15 17:14:06 -05:00
Dustin	dbc18022f2	metricspi: Increase scrape_timeout for speedtest Running the Internet speed test can often take longer than a minute.	2022-08-12 14:54:49 -05:00
Dustin	ce3e88932d	vmalert: Allow configuring http.pathPrefix vmalert requires explicit configuration when it is behind a reverse proxy.	2022-08-12 13:10:36 -05:00
Dustin	fe87edea21	r/vmalert: Allow configuring external source URLs The `-external.url` and `-external.alert.source` command line arguments and their corresponding environment variables can be used to configure the "Source" links associated with alerts created by `vmalert`.	2022-08-12 12:58:53 -05:00
Dustin	c57500a9f4	metricspi: Update speedtest scrape target The firewall hardware is too slow to run the prometheus_speedtest program. It always showed way lower speeds than were actually available. I've moved the service to the Kubernetes cluster and it works a lot better there.	2022-08-12 12:55:52 -05:00
Dustin	4ddbc9f256	hosts: Add mtrcs0.p.r mtrcs0.pyrocufflink.red is a Raspberry Pi CM4 on a Waveshare CM4-IO-BASE-B carrier board with a NVMe SSD. It runs a custom OS built using Buildroot, and is not a member of the pyrocufflink.blue AD domain. mtrcs0.p.r hosts Victoria Metrics/`vmagent`, `vmalert`, AlertManager, and Grafana. I've created a unique group and playbook for it, metricspi, to manage all these applications together.	2022-08-11 21:40:19 -05:00
Dustin	4aedeef546	grafana: Redirect HTTP to HTTPS	2022-08-10 21:55:54 -05:00
Dustin	c48cc985b2	r/collectd: Ignore filesystems by path In addition to ignoring particular types of filesystems, e.g. OverlayFS, we can also ignore filesystems by their mount point. This could be useful, for example, for bind-mounted directories, such as those used on Kubernetes nodes.	2022-08-05 18:56:48 -05:00
Dustin	c8e89a4b16	hosts: Add Kubernetes machines There is no specific playbook or role for Kubernetes. All OS configuration is done at install time via kickstart scripts, and deploying Kubernetes itself is done (manually) using `kubeadm init` and `kubeadm join`.	2022-08-03 20:52:01 -05:00
Dustin	3b692a9de8	vm-hosts: Add Kubernetes VLAN configuration	2022-08-03 20:51:33 -05:00
Dustin	6f11a4cf3a	grafana: Set Grafana domain Necessary for Grafana CSRF protection.	2022-07-24 10:31:46 -05:00
Dustin	3e8da609e7	frigate: Keep front porch recordings for 2 days Now that there is plenty of storage in the new video server, let's keep 24/7 recordings from the front porch camera, too.	2022-07-23 17:52:26 -05:00
Dustin	c1c28a51b5	frigate: Use native MQTT/TLS support Frigate has native support for MQTT over TLS now, so there is no more any need to use stunnel.	2022-07-23 17:27:02 -05:00
Dustin	d5ef18ccc3	frigate: Split camera config into separate file This will make it easier to manage Frigate camera settings.	2022-07-23 17:26:19 -05:00
Dustin	41582beef9	group_vars/frigate: Add second back yard camera Adding a second camera to the back yard, on the North side of the porch, to try and figure out how the possums keep getting under the porch even with the chicken wire around it!	2022-07-18 18:25:20 -05:00
Dustin	82f9ce0797	group_vars/frigate: Keep back yard recordings We're trying to discover how the possums are getting into and out of the house. Let's enable continuous video recording from the back yard camera so we can observe them and come up with a plan to get rid of them.	2022-07-18 18:20:21 -05:00
Dustin	a3608f187c	home-assistant: Enable Mosquitto persistence Configuring Mosquitto to persist its state to the filesystem will keep retained messages from MQTT sensors, etc.	2022-05-29 11:26:39 -05:00
Dustin	3c8e576841	grafana: Enable anonymous access Allow unauthenticated users to view dashboards. Useful for Heads-Up Displays.	2022-03-07 20:10:13 -06:00
Dustin	5485fc6f93	websites/d…and…t: Configure formsubmit To handle the RSVP form on dustinandtabitha.com, we are going to use formsubmit. It runs on the same machine that hosts the website, so there's no dealing with CORS. The /submit/rsvp path, which is proxied to the backend, is the RSVP form's target.	2022-02-27 17:56:54 -06:00
Dustin	3632698f37	websites/dustinandtabitha.com: Add role Wedding website 😍	2022-02-27 17:41:40 -06:00
Dustin	c12da40228	home-assistant: Correct BURP exclude syntax BURP does not support relative paths or globs in `exclude` values.	2022-01-16 10:08:27 -06:00
Dustin	5efbee725e	home-assistant: Omit history DB from backups The state history database is entirely too big. It takes over an hour to create a backup of it, which usually causes BURP to time out. The data it stores isn't particularly interesting anyway. Instead of trying to back it up and ultimately not getting any backup at all, we'll just skip it altogether to ensure we have a consistent backup of everything else that is actually important.	2022-01-02 12:07:12 -06:00
Dustin	2b27a31bee	frigate: Update config syntax for 0.9.x There were several backward-incompatible changes introduced in Frigate [0.9.0](https://github.com/blakeblackshear/frigate/releases/tag/v0.9.0). Notably, recordings and clips are now configured together.	2021-12-30 09:33:58 -06:00
Dustin	6acb25e309	nextcloud: Trust headers from public rev proxy If Nextcloud does not have the Internet-facing reverse proxy listed in its "trusted proxies" setting, it will mark all traffic as being from the proxy itself. This breaks brute force detection, etc.	2021-12-20 22:20:09 -06:00
Dustin	74deb895ae	pyrocufflink-dns: Remove dc0 forwarder Decommissioning dc0.pyrocufflink.blue. Do not forward requests for internal domain names to it.	2021-12-18 16:44:48 -06:00
Dustin	62ca80a5f0	pyrocufflink-dns: Remove FireMon zones There is no longer any point to having forward zones in the main DNS server for FireMon domains, since we don't have a network-wide VPN anymore.	2021-12-18 10:51:17 -06:00
Dustin	739ffb2845	home-assistant: Configure BURP backups Take a snapshot of the history database first, then back up everything in `/var/lib/homeassistant`.	2021-12-17 20:57:38 -06:00
Dustin	fdfdaa6fe6	bitwarden_rs: Update burp backup path Vaultwarden data are stored in a different location since the migration to Podman.	2021-12-17 20:33:31 -06:00
Dustin	14c7b1fcc1	bitwarden_rs: Update collectd process name `bitwarden_rs` is now named `vaultwarden`.	2021-11-06 19:42:07 -05:00
Dustin	c882ac45e7	nut: Add playbook for NUT NUT runs on serial0.pyrocufflink.blue and monitors the two UPSes on the server rack.	2021-10-31 14:28:27 -05:00
Dustin	881c8de625	Switch Prometheus/collectd to pull Transitioning from push-based to pull-based monitoring with Prometheus/collectd. The write_prometheus plugin will be installed on all hosts, and Prometheus will be configured to scrape them directly.	2021-10-30 16:41:17 -05:00
Dustin	8e9699810b	burp-server: Monitor burp process with collectd	2021-10-16 21:53:51 -05:00

1 2 3 4 5 ...

271 Commits (010f652060107aa0167c688c8c615d218e36d470)