162 Commits

Author SHA1 Message Date
76e6ae2f00 Enable binary emulation for other architectures 2025-10-01 16:40:16 +02:00
409efacf5b Enable watchdog 2025-10-01 16:40:16 +02:00
e1e879178d Enable all osd on boot in lake2 2025-10-01 16:40:16 +02:00
042ca9e882 Scrape lake2 too 2025-10-01 16:40:16 +02:00
9241bda0ac Also enable monitoring in lake2 2025-10-01 16:40:16 +02:00
005a1be48a Scrape metrics from bay 2025-10-01 16:40:16 +02:00
f86114f33e Add monitoring in the bay node 2025-10-01 16:40:16 +02:00
af29f639e2 Add fio tool 2025-10-01 16:40:16 +02:00
0fe025e8be Add ceph tools in hut too 2025-10-01 16:40:16 +02:00
6a429fda1b Switch ceph logs to journal 2025-10-01 16:40:16 +02:00
3b823ee478 Move pkgs overlay to overlay.nix 2025-10-01 16:40:16 +02:00
d9dea762de Enable ceph osd daemons in lake2 2025-10-01 16:40:16 +02:00
80efd57a11 Add the lake2 hostname to the hosts 2025-10-01 16:40:16 +02:00
cced6b0dc0 Use the sda for lake2 2025-10-01 16:40:16 +02:00
b63b450111 Remove netboot module 2025-10-01 16:40:16 +02:00
81baeee5b1 Disable pixiecore in hut for now 2025-10-01 16:40:16 +02:00
686f750c06 Add PXE helper 2025-10-01 16:40:16 +02:00
33155fcb62 Enable netboot again for PXE 2025-10-01 16:40:16 +02:00
6e89b3f936 Specify the disk by path 2025-10-01 16:40:16 +02:00
f0f67f374e Prepare lake2 config after bootstrap
The disk ID is different under NixOS.
2025-10-01 16:40:16 +02:00
7443a192c6 Add lake2 bootstrap config 2025-10-01 16:40:16 +02:00
3c83996e26 Add agenix to PATH in hut 2025-10-01 16:40:16 +02:00
a4fc3d131a Store ceph secret key in age
This allows a node to mount the ceph FS without any extra ceph
configuration in /etc/ceph.
2025-10-01 16:40:16 +02:00
660a8ae163 Add rarias key for secrets 2025-10-01 16:40:16 +02:00
91270b26bb Add ceph metrics to prometheus 2025-10-01 16:40:16 +02:00
94ce6fedf9 Mount the ceph filesystem in hut 2025-10-01 16:40:16 +02:00
817c98d37b Add ceph config in bay 2025-10-01 16:40:16 +02:00
9cd013c4ed Add the bay host name 2025-10-01 16:40:16 +02:00
f707650724 Remove netboot and fixes 2025-10-01 16:40:15 +02:00
9c152ec9cc Add bay node 2025-10-01 16:40:15 +02:00
8fcb5a1079 Monitor power from other nodes via LAN 2025-10-01 16:40:15 +02:00
b80656228d Increase prometheus retention time to one year 2025-10-01 16:40:15 +02:00
cd6e6de2ad Don't set all_proxy 2025-10-01 16:40:15 +02:00
ae2007e2fe Allow access to devices for node_exporter 2025-10-01 16:40:15 +02:00
d8e366b444 GRUB version no longer needed 2025-10-01 16:40:15 +02:00
8c1bf6db42 Kill slurmd remaining processes on upgrade 2025-10-01 16:40:15 +02:00
a094093d95 koro: Add vlopez user 2025-10-01 16:40:15 +02:00
cbe53a6f0a Add koro node 2025-10-01 16:40:15 +02:00
1b8c3eb554 eudy: Add fcsv3 and intermediate versions for testing 2025-10-01 16:40:15 +02:00
c0335c1f95 eudy: Enable memory overcommit 2025-10-01 16:40:15 +02:00
d5857c0f7d eudy: disable all cpu mitigations 2025-10-01 16:40:15 +02:00
9097811cc0 Enable NTP using the BSC time server 2025-10-01 16:40:15 +02:00
83acd40880 Add the ssfhead node as gateway 2025-10-01 16:40:15 +02:00
ba75bf8249 Use our host names first by default 2025-10-01 16:40:15 +02:00
e9845cc76a Add DNS tools to resolve hosts 2025-10-01 16:40:15 +02:00
d5951483ee Lower perf_event_paranoid to -1 2025-10-01 16:40:15 +02:00
937d8a7637 Set perf paranoid to 0 by default 2025-10-01 16:40:15 +02:00
798e01f9e6 Add perf to packages 2025-10-01 16:40:15 +02:00
2ca7e7383e Allow srun to specify the cpu binding
The task/affinity plugin needs to be selected.
2025-10-01 16:40:15 +02:00
b610f12133 Move authorized keys to users.nix 2025-10-01 16:40:15 +02:00