180 Commits

Author SHA1 Message Date
772e0f00fb Update slurm to 23.02.05.1 2023-09-13 17:44:24 +02:00
de3a28b7df Monitor storage nodes via IPMI too 2023-09-13 15:57:13 +02:00
a05d87d4b9 Enable fstrim service 2023-09-12 16:39:45 +02:00
826d6263fd Serve the nix store from hut 2023-09-12 12:19:43 +02:00
b0b04e8fb1 Add encrypted munge key with agenix 2023-09-08 19:01:57 +02:00
a5e81fea95 Remove unused large port hole in firewall 2023-09-08 18:22:48 +02:00
dd616a7fb1 Make exporters listen in localhost only 2023-09-08 18:13:04 +02:00
e41404f619 Allow only some ports for srun 2023-09-08 17:51:37 +02:00
1c7ce3fc51 Block ssfhead from reaching our slurm daemon 2023-09-08 17:20:32 +02:00
bdd03dac60 Poweroff idle slurm nodes after 1 hour 2023-09-08 13:31:23 +02:00
21b38de26d Add IB and IPMI node host names 2023-09-08 13:21:37 +02:00
52d3794b14 flake.lock: Update
Flake lock file updates:

• Updated input 'bscpkgs':
    'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs/heads/master&rev=ee24b910a1cb95bd222e253da43238e843816f2f' (2023-09-01)
  → 'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs/heads/master&rev=6122fef92701701e1a0622550ac0fc5c2beb5906' (2023-09-07)
2023-09-07 11:13:45 +02:00
d91c9b7473 Unlock ovni gitlab runners 2023-09-05 16:24:27 +02:00
6b526f9827 flake.lock: Update
Flake lock file updates:

• Updated input 'bscpkgs':
    'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs/heads/master&rev=18d64c352c10f9ce74aabddeba5a5db02b74ec27' (2023-08-31)
  → 'git+https://pm.bsc.es/gitlab/rarias/bscpkgs.git?ref=refs/heads/master&rev=ee24b910a1cb95bd222e253da43238e843816f2f' (2023-09-01)
• Updated input 'nixpkgs':
    'github:NixOS/nixpkgs/d680ded26da5cf104dd2735a51e88d2d8f487b4d' (2023-08-19)
  → 'github:NixOS/nixpkgs/e56990880811a451abd32515698c712788be5720' (2023-09-02)
2023-09-05 15:03:26 +02:00
ae4ad95902 Add agenix to all nodes 2023-09-04 22:09:40 +02:00
3cc7b33c5a Add agenix module to ceph 2023-09-04 22:06:20 +02:00
8fc87885da Remove old secrets 2023-09-04 22:04:32 +02:00
1ea8912d6c Mount /ceph in owl1 and owl2 2023-09-04 22:00:36 +02:00
7d9e7e4e83 Warn about the owl2 omnipath device 2023-09-04 22:00:17 +02:00
779b591d40 Clean owl2 configuration 2023-09-04 21:59:56 +02:00
c13022596a Move the ceph client config to an external module 2023-09-04 21:59:04 +02:00
875622ad0f Reorganize secrets and ssh keys
The agenix tools needs to read the secrets from a standalone file, but
we also need the same information for the SSH keys.
2023-09-04 21:36:31 +02:00
a7eddecf80 Add anavarro user 2023-09-04 16:00:01 +02:00
fcddbdb72b Set zsh inc_append_history option 2023-09-03 16:57:53 +02:00
bfb5363d94 Set zsh shell for rarias 2023-09-03 16:46:27 +02:00
44c1d958f4 Enable zsh and fix key bindings 2023-09-03 11:51:53 +02:00
e334891c41 Keep a log over time with the config commits 2023-09-02 23:49:41 +02:00
ea73a72b79 Configure bscpkgs.nixpkgs to follow nixpkgs 2023-09-02 23:37:59 +02:00
13b2379d97 Store nixos config in /etc/nixos/config.rev 2023-09-02 23:37:11 +02:00
48727d3a88 Enable binary emulation for other architectures 2023-08-31 17:22:36 +02:00
b9598df864 Enable watchdog 2023-08-29 22:26:12 +02:00
a0e447301e Enable all osd on boot in lake2 2023-08-29 18:47:25 +02:00
4495cbf380 Scrape lake2 too 2023-08-29 12:33:26 +02:00
042d85ba61 Also enable monitoring in lake2 2023-08-29 12:29:41 +02:00
c47c190c79 Scrape metrics from bay 2023-08-29 11:58:00 +02:00
a1271f007f Add monitoring in the bay node 2023-08-29 11:53:32 +02:00
042e56b5b2 Add fio tool 2023-08-29 11:27:50 +02:00
a510a41eed Add ceph tools in hut too 2023-08-28 17:58:21 +02:00
a68909f96c Switch ceph logs to journal 2023-08-28 17:58:08 +02:00
3c523572cb Update ceph to 18.2.0 in overlay 2023-08-25 18:12:46 +02:00
7cd15b9732 Move pkgs overlay to overlay.nix 2023-08-25 18:12:00 +02:00
7ae2403db8 Enable ceph osd daemons in lake2 2023-08-25 14:44:53 +02:00
e8824bf72e Add the lake2 hostname to the hosts 2023-08-25 14:44:35 +02:00
e46ded9843 Use the sda for lake2 2023-08-25 13:40:10 +02:00
d6d3624617 Remove netboot module 2023-08-25 13:39:01 +02:00
300690df4c Disable pixiecore in hut for now 2023-08-25 13:21:00 +02:00
9d15c13a44 Add PXE helper 2023-08-25 12:03:30 +02:00
3c030307f1 Enable netboot again for PXE 2023-08-24 19:08:23 +02:00
d30399d31b Specify the disk by path 2023-08-24 15:27:37 +02:00
9ac05ed4c0 Prepare lake2 config after bootstrap
The disk ID is different under NixOS.
2023-08-24 13:54:22 +02:00