74 Commits

Author SHA1 Message Date
da92154d33 Remove old secrets 2025-10-01 16:40:16 +02:00
adec7f80fd Mount /ceph in owl1 and owl2 2025-10-01 16:40:16 +02:00
8a0034a867 Warn about the owl2 omnipath device 2025-10-01 16:40:16 +02:00
6828273c05 Clean owl2 configuration 2025-10-01 16:40:16 +02:00
8cedffe040 Move the ceph client config to an external module 2025-10-01 16:40:16 +02:00
8a027d8b09 Reorganize secrets and ssh keys
The agenix tools needs to read the secrets from a standalone file, but
we also need the same information for the SSH keys.
2025-10-01 16:40:16 +02:00
1f32b8409a Add anavarro user 2025-10-01 16:40:16 +02:00
bc51564a88 Set zsh inc_append_history option 2025-10-01 16:40:16 +02:00
8ba4f910c3 Set zsh shell for rarias 2025-10-01 16:40:16 +02:00
515fa49ed0 Enable zsh and fix key bindings 2025-10-01 16:40:16 +02:00
c63fa494d5 Keep a log over time with the config commits 2025-10-01 16:40:16 +02:00
a6d3f43b98 Store nixos config in /etc/nixos/config.rev 2025-10-01 16:40:16 +02:00
76e6ae2f00 Enable binary emulation for other architectures 2025-10-01 16:40:16 +02:00
409efacf5b Enable watchdog 2025-10-01 16:40:16 +02:00
e1e879178d Enable all osd on boot in lake2 2025-10-01 16:40:16 +02:00
042ca9e882 Scrape lake2 too 2025-10-01 16:40:16 +02:00
9241bda0ac Also enable monitoring in lake2 2025-10-01 16:40:16 +02:00
005a1be48a Scrape metrics from bay 2025-10-01 16:40:16 +02:00
f86114f33e Add monitoring in the bay node 2025-10-01 16:40:16 +02:00
af29f639e2 Add fio tool 2025-10-01 16:40:16 +02:00
0fe025e8be Add ceph tools in hut too 2025-10-01 16:40:16 +02:00
6a429fda1b Switch ceph logs to journal 2025-10-01 16:40:16 +02:00
3b823ee478 Move pkgs overlay to overlay.nix 2025-10-01 16:40:16 +02:00
d9dea762de Enable ceph osd daemons in lake2 2025-10-01 16:40:16 +02:00
80efd57a11 Add the lake2 hostname to the hosts 2025-10-01 16:40:16 +02:00
cced6b0dc0 Use the sda for lake2 2025-10-01 16:40:16 +02:00
b63b450111 Remove netboot module 2025-10-01 16:40:16 +02:00
81baeee5b1 Disable pixiecore in hut for now 2025-10-01 16:40:16 +02:00
686f750c06 Add PXE helper 2025-10-01 16:40:16 +02:00
33155fcb62 Enable netboot again for PXE 2025-10-01 16:40:16 +02:00
6e89b3f936 Specify the disk by path 2025-10-01 16:40:16 +02:00
f0f67f374e Prepare lake2 config after bootstrap
The disk ID is different under NixOS.
2025-10-01 16:40:16 +02:00
7443a192c6 Add lake2 bootstrap config 2025-10-01 16:40:16 +02:00
3c83996e26 Add agenix to PATH in hut 2025-10-01 16:40:16 +02:00
a4fc3d131a Store ceph secret key in age
This allows a node to mount the ceph FS without any extra ceph
configuration in /etc/ceph.
2025-10-01 16:40:16 +02:00
660a8ae163 Add rarias key for secrets 2025-10-01 16:40:16 +02:00
91270b26bb Add ceph metrics to prometheus 2025-10-01 16:40:16 +02:00
94ce6fedf9 Mount the ceph filesystem in hut 2025-10-01 16:40:16 +02:00
817c98d37b Add ceph config in bay 2025-10-01 16:40:16 +02:00
9cd013c4ed Add the bay host name 2025-10-01 16:40:16 +02:00
f707650724 Remove netboot and fixes 2025-10-01 16:40:15 +02:00
9c152ec9cc Add bay node 2025-10-01 16:40:15 +02:00
8fcb5a1079 Monitor power from other nodes via LAN 2025-10-01 16:40:15 +02:00
b80656228d Increase prometheus retention time to one year 2025-10-01 16:40:15 +02:00
cd6e6de2ad Don't set all_proxy 2025-10-01 16:40:15 +02:00
ae2007e2fe Allow access to devices for node_exporter 2025-10-01 16:40:15 +02:00
d8e366b444 GRUB version no longer needed 2025-10-01 16:40:15 +02:00
8c1bf6db42 Kill slurmd remaining processes on upgrade 2025-10-01 16:40:15 +02:00
a094093d95 koro: Add vlopez user 2025-10-01 16:40:15 +02:00
cbe53a6f0a Add koro node 2025-10-01 16:40:15 +02:00