Setup a distributed filesystem #11
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The cluster has 3 nodes specifically configured for storage, in particular MDS and OSS-0 and OSS-1, which correspond to the names of the MetaData Service and Object Storage Service needed by Lustre.
Based on the documentation, the OSS nodes have 4 disks of 2TB each, a total of 16 TB of storage, which currently is completely disregarded. The current setup uses a single 1TB disk in the login node served via NFS to the compute nodes, which is almost full. Also the storage is served via the Ethernet port (1Gbit/s), and using the OmniPath network may be a better idea.
Lustre and Ceph seem to be appropriate candidates. However, Lustre seems to be incompatible with the latest kernel version.
changed the description
marked the checklist item Contact Ramón Nou to erase the disks in the MDS, OSS1 and OSS2 nodes (currently used by their Lustre installation). as completed
changed the description
marked the checklist item Take control over mds01 as completed
The mds01 and oss02 nodes have a lot of unused disk space:
marked the checklist item Install nixos in one of the disks as completed
Ceph setup and running on MDS, renamed as bay:
changed the description
mentioned in merge request !19
marked the checklist item Test Ceph as completed
marked the checklist item Mount the ceph FS in the other nodes as completed
The installation in bay seems to be working fine. Let's move on to the oss nodes.
The node oss01 is waiting for the voltage regulator (see #22) so I will start by oss02.
changed the description