130 Commits

Author SHA1 Message Date
43c63f45d7 Add lake2 bootstrap config 2023-08-24 12:30:46 +02:00
35580a83a0 Add section to enable serial console 2023-08-24 12:29:44 +02:00
591a4c774e Add agenix to PATH in hut 2023-08-23 17:42:50 +02:00
e8d5eeb5cf Store ceph secret key in age
This allows a node to mount the ceph FS without any extra ceph
configuration in /etc/ceph.
2023-08-23 17:18:17 +02:00
2516559fac Add rarias key for secrets 2023-08-23 17:15:26 +02:00
bb8bf86051 Add ceph metrics to prometheus 2023-08-22 16:33:55 +02:00
2416ec7806 Mount the ceph filesystem in hut 2023-08-22 15:57:49 +02:00
34ebe09f66 Add ceph config in bay 2023-08-22 15:57:25 +02:00
1f270d070d Add the bay host name 2023-08-22 15:56:09 +02:00
817bea45a5 Remove netboot and fixes 2023-07-28 20:31:44 +02:00
490cdf7b95 Add bay node 2023-07-28 19:49:48 +02:00
335c77593d Update flake 2023-08-22 10:28:26 +02:00
199358a5e3 Monitor power from other nodes via LAN 2023-08-17 18:55:40 +02:00
776a582c10 Increase prometheus retention time to one year 2023-07-28 16:19:59 +02:00
b526531f20 Don't set all_proxy 2023-08-17 12:37:58 +02:00
ad78e41c8b Update nixpkgs to fix docker problem 2023-07-28 14:24:51 +02:00
b978839406 Allow access to devices for node_exporter 2023-07-28 13:48:30 +02:00
b698b9da12 GRUB version no longer needed 2023-07-27 17:22:20 +02:00
92f5c1ee19 Upgrade flake: nixpkgs, bscpkgs and agenix 2023-07-27 17:19:17 +02:00
c8ff31ec08 Kill slurmd remaining processes on upgrade 2023-07-27 14:24:21 +02:00
b408af0092 koro: Add vlopez user 2023-07-21 10:34:37 +02:00
4878b6fd8b Add koro node 2023-07-21 10:34:19 +02:00
b5d3d08706 eudy: Add fcsv3 and intermediate versions for testing 2023-07-12 13:22:42 +02:00
72497a88d4 eudy: Enable memory overcommit 2023-06-30 12:49:44 +02:00
cb90c9c73f eudy: disable all cpu mitigations 2023-06-29 09:14:39 +02:00
246226b3d3 Enable NTP using the BSC time server 2023-06-30 14:02:15 +02:00
aaa082390e Add the ssfhead node as gateway 2023-06-30 14:01:35 +02:00
cc2160f134 Use our host names first by default 2023-06-23 16:22:18 +02:00
01e7a9b8a4 Add DNS tools to resolve hosts 2023-06-23 16:12:25 +02:00
a66a4d9a43 Lower perf_event_paranoid to -1 2023-06-23 16:01:27 +02:00
31eace8400 Set perf paranoid to 0 by default 2023-06-21 16:23:16 +02:00
4997191f30 Add perf to packages 2023-06-21 15:41:06 +02:00
3ea8bdcdf1 Allow srun to specify the cpu binding
The task/affinity plugin needs to be selected.
2023-06-21 13:16:23 +02:00
7db6671ce5 Move authorized keys to users.nix 2023-06-20 14:08:34 +02:00
952541ff4a Add rpenacob user 2023-06-20 12:48:00 +02:00
d200e4b172 Add osumb to the system packages 2023-06-16 19:22:41 +02:00
cced1c2e08 flake.lock: Update
Flake lock file updates:

• Updated input 'bscpkgs':
    'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs%2fheads%2fmaster&rev=c775ee4d6f76aded05b08ae13924c302f18f9b2c' (2023-04-26)
  → 'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs%2fheads%2fmaster&rev=cbe9af5d042e9d5585fe2acef65a1347c68b2fbd' (2023-06-16)
2023-06-16 18:33:54 +02:00
197c93a2be Set mpi to mpich by default in bscpkgs 2023-06-16 16:05:17 +02:00
d9002dd028 Add missing parameter to extend 2023-06-16 16:04:36 +02:00
60ee744a54 Use explicit order in overlays 2023-06-16 16:02:25 +02:00
cd1fde4760 Replace mpi inside bsc attribute 2023-06-16 15:54:55 +02:00
3985e66fa4 Add mpich overlay 2023-06-16 14:16:51 +02:00
5010746e9c Add coments in slurm config 2023-06-16 14:16:14 +02:00
6df4924b00 Add eudy host key to known hosts 2023-06-16 17:29:48 +02:00
59a29e1af6 Rename xeon08 to eudy
From Eudyptula, a little penguin.
2023-06-16 17:16:05 +02:00
a4141301ad Update rebuild script for all nodes 2023-06-16 12:13:07 +02:00
3a07842480 Add ssh host keys 2023-06-16 12:01:12 +02:00
e2aa26a8b3 Set the name of the slurm cluster to jungle 2023-06-16 12:00:54 +02:00
ebf45be2b5 Change owl hostnames 2023-06-16 11:42:39 +02:00
e0ab4e1408 Add owl and all partition 2023-06-16 11:34:00 +02:00