doc: link to the user guide in the readme
This commit is contained in:
parent
7d66b34140
commit
9d38a37787
462
README
462
README
@ -1,462 +0,0 @@
|
||||
|
||||
|
||||
bscpkgs: User guide
|
||||
|
||||
|
||||
ABSTRACT
|
||||
|
||||
This repository contains a set of nix packages used in the Barcelona
|
||||
Supercomputing Center by the Programming Models group.
|
||||
|
||||
The current setup uses the xeon07 machine to build packages, which are
|
||||
automatically uploaded to MareNostrum4, due to lack of permissions in
|
||||
the latter to perform the build safely.
|
||||
|
||||
Some preliminary steps must be done manually to be able to build and
|
||||
install packages (derivations in nix jargon).
|
||||
|
||||
1. Introduction
|
||||
|
||||
To easily connect to xeon07 in one step, setup the SSH (for version
|
||||
7.3 and upwards) configuration file in ~/.ssh/config adding these
|
||||
lines:
|
||||
|
||||
Host cobi
|
||||
HostName ssflogin.bsc.es
|
||||
User your-username-here
|
||||
|
||||
Host xeon07
|
||||
ProxyJump cobi
|
||||
HostName xeon07
|
||||
User your-username-here
|
||||
|
||||
You should be able to connect with:
|
||||
|
||||
laptop$ ssh xeon07
|
||||
|
||||
1.1 Network access
|
||||
|
||||
In order to use nix you would need to be able to download the sources
|
||||
from Internet. Usually the download requires the ports 22, 80 and 443
|
||||
to be open for outgoing traffic.
|
||||
|
||||
Check that you have network access in xeon07 provided by the
|
||||
environment variables "http_proxy" and "https_proxy". Try to fetch a
|
||||
webpage with curl, to ensure the proxy is working:
|
||||
|
||||
xeon07$ curl x.com
|
||||
x
|
||||
|
||||
1.2 SSH keys
|
||||
|
||||
Package sources are usually downloaded directly from the git server,
|
||||
so you must be able to access all repositories without a password
|
||||
prompt.
|
||||
|
||||
Most repositories at https://pm.bsc.es/gitlab are open to read for
|
||||
logged in users, but there are some exceptions (for example the nanos6
|
||||
repository) where you must have explicitly granted read access.
|
||||
|
||||
If you don't have a ssh key at ~/.ssh/*.pub in xeon07 create a new one
|
||||
without password protection by running:
|
||||
|
||||
xeon07$ ssh-keygen
|
||||
Generating public/private rsa key pair.
|
||||
Enter file in which to save the key (~/.ssh/id_rsa):
|
||||
Enter passphrase (empty for no passphrase):
|
||||
Enter same passphrase again:
|
||||
Your identification has been saved in ~/.ssh/id_rsa.
|
||||
Your public key has been saved in ~/.ssh/id_rsa.pub.
|
||||
...
|
||||
|
||||
By default it will create the private key at ~/.ssh/id_rsa. Copy the
|
||||
contents of your public ssh key in ~/.ssh/id_rsa.pub and paste it in
|
||||
GitLab at:
|
||||
|
||||
https://pm.bsc.es/gitlab/profile/keys
|
||||
|
||||
Then, configure it for use in the ~/.ssh/config file, adding:
|
||||
|
||||
Host bscpm03.bsc.es
|
||||
IdentityFile ~/.ssh/id_rsa
|
||||
|
||||
Finally verify the SSH connection to the server works and you get a
|
||||
greeting from the GitLab server with your username:
|
||||
|
||||
xeon07$ ssh git@bscpm03.bsc.es
|
||||
PTY allocation request failed on channel 0
|
||||
Welcome to GitLab, @rarias!
|
||||
Connection to bscpm03.bsc.es closed.
|
||||
|
||||
Verify that you can access nanos6/nanos6 repository (otherwise you
|
||||
first need to ask to be granted read access), at:
|
||||
|
||||
https://pm.bsc.es/gitlab/nanos6/nanos6
|
||||
|
||||
Finally, you should be able to download the nanos6/nanos6 git
|
||||
repository without any password interaction by running:
|
||||
|
||||
xeon07$ git clone git@bscpm03.bsc.es:nanos6/nanos6.git
|
||||
|
||||
You will also need to access MareNostrum 4 from the xeon07 node, in
|
||||
order to submit experiments. Add the following lines as well to the
|
||||
~/.ssh/config file and set your user name:
|
||||
|
||||
Host mn0 mn1 mn2
|
||||
User your-mn4-username
|
||||
IdentityFile ~/.ssh/id_rsa
|
||||
|
||||
Then copy the key to MareNostrum 4 (it will ask you the first time for
|
||||
your password):
|
||||
|
||||
xeon07$ ssh-copy-id -i ~/.ssh/id_rsa.pub mn1
|
||||
|
||||
And ensure that you can connect without a password:
|
||||
|
||||
xeon07$ ssh mn1
|
||||
...
|
||||
login1$
|
||||
|
||||
1.3 The bscpkgs repo
|
||||
|
||||
Once you have Internet and you have granted access to the PM GitLab
|
||||
repositories you can begin building software with nix. First ensure
|
||||
that the nix binaries are available from your shell in xeon07:
|
||||
|
||||
xeon07$ nix --version
|
||||
nix (Nix) 2.3.6
|
||||
|
||||
Now you are ready to build and install packages with nix. Clone the
|
||||
bscpkgs repository:
|
||||
|
||||
xeon07$ git clone git@bscpm03.bsc.es:rarias/bscpkgs.git
|
||||
|
||||
Nix looks in the current folder for a file named "default.nix" for
|
||||
packages, so go to the repo directory:
|
||||
|
||||
xeon07$ cd bscpkgs
|
||||
|
||||
Now you should be able to build nanos6:
|
||||
|
||||
xeon07$ nix-build -A bsc.nanos6
|
||||
..
|
||||
/nix/store/3i0qkdywm9xjv2cm1ldx9smb552sf6r1-nanos6-2.4-6f10a32
|
||||
|
||||
The installation is placed in the nix store (with the path stated in
|
||||
the last line of the build process), with the "result" symbolic link
|
||||
pointing to the same location:
|
||||
|
||||
xeon07$ readlink result
|
||||
/nix/store/3i0qkdywm9xjv2cm1ldx9smb552sf6r1-nanos6-2.4-6f10a32
|
||||
|
||||
1.4 Configuration of mn4 (MareNostrum 4)
|
||||
|
||||
In order to execute the programs built at xeon07, you first need to
|
||||
enter nix environment. To do so, add to the end of the file ~/.bashrc
|
||||
in mn4 the following line:
|
||||
|
||||
export PATH=/gpfs/projects/bsc15/nix/bin:$PATH
|
||||
|
||||
Then logout and login again (our source the ~/.bashrc file) and you
|
||||
will now have the `nix-setup` command available. This command executes
|
||||
a new shell where the /nix store is available. To execute it:
|
||||
|
||||
mn4$ nix-setup
|
||||
|
||||
Now you will see a new shell, where you can access the nix store:
|
||||
|
||||
nix|mn4$ ls /nix
|
||||
gcroots profiles store var
|
||||
|
||||
The last build of nanos6 can be also found in mn4 at the same
|
||||
location:
|
||||
|
||||
/nix/store/3i0qkdywm9xjv2cm1ldx9smb552sf6r1-nanos6-2.4-6f10a32
|
||||
|
||||
Remember to enter the nix environment by running `nix-setup` when you
|
||||
need something from the nix store.
|
||||
|
||||
You cannot perform any build operations from mn4: to do so use the
|
||||
xeon07 machine.
|
||||
|
||||
2. Basic usage of nix
|
||||
|
||||
Nix is a package manager which handles easily reproducibility and
|
||||
configuration of packages and dependencies. See more info here:
|
||||
|
||||
https://nixos.org/nix/manual/
|
||||
|
||||
We will only cover the basic usage of nix for the BSC packages.
|
||||
|
||||
2.1 The user environment
|
||||
|
||||
All nix packages are stored under the /nix directory. When you need to
|
||||
"install" some binary from nix, a symlink is added to a folder
|
||||
included in the $PATH variable. In particular, you should have
|
||||
something similar added to your $PATH:
|
||||
|
||||
xeon07$ echo $PATH | sed 's/:/\n/g' | grep nix
|
||||
/home/Computational/rarias/.nix-profile/bin
|
||||
/nix/var/nix/profiles/default/bin
|
||||
|
||||
The first one is your custom installation of packages that are stored
|
||||
in your home directory and the second one is the default installation
|
||||
which contains the nix tools (which are installed in the /nix
|
||||
directory as well).
|
||||
|
||||
Use `nix search` to look for official packages in the "nixpkgs"
|
||||
channel (the default repository of packages):
|
||||
|
||||
xeon07$ nix search cowsay
|
||||
warning: using cached results; pass '-u' to update the cache
|
||||
* cowsay (cowsay)
|
||||
A program which generates ASCII pictures of a cow with a message
|
||||
|
||||
* neo-cowsay (neo-cowsay)
|
||||
Cowsay reborn, written in Go
|
||||
|
||||
* ponysay (ponysay-3.0.3)
|
||||
Cowsay reimplemention for ponies
|
||||
|
||||
* tewisay (tewisay-unstable-2017-04-14)
|
||||
Cowsay replacement with unicode and partial ansi escape support
|
||||
|
||||
When you need a program that is not available in your environment,
|
||||
much like when you use "module load ..." you can use nix-env to modify
|
||||
what is currently loaded. For example:
|
||||
|
||||
xeon07$ nix-env -iA nixpkgs.cowsay
|
||||
|
||||
Notice that you should specify the prefix "nixpkgs." before. The
|
||||
command will download (if not found already in the nix store), compile
|
||||
(if necessary) and load the program `cowsay` from the nixpkgs
|
||||
repository in the environment. You should be able to run it as:
|
||||
|
||||
xeon07$ cowsay "hello world"
|
||||
_____________
|
||||
< hello world >
|
||||
-------------
|
||||
\ ^__^
|
||||
\ (oo)\_______
|
||||
(__)\ )\/\
|
||||
||----w |
|
||||
|| ||
|
||||
|
||||
You can now inspect the ~/.nix-profile/bin folder, and see that a new
|
||||
symlink was added to the actual installation of the binary:
|
||||
|
||||
xeon07$ file ~/.nix-profile/bin/cowsay
|
||||
/home/Computational/rarias/.nix-profile/bin/cowsay: symbolic link to
|
||||
`/nix/store/673gczmhr5b449521srz2n7g1klykz6n-cowsay-3.03+dfsg2/bin/cowsay'
|
||||
|
||||
You can list the current packages installed in your environment by
|
||||
running:
|
||||
|
||||
xeon07$ nix-env -q
|
||||
cowsay-3.03+dfsg2
|
||||
nix-2.3.6
|
||||
|
||||
Notice that this setup only affects your user environment. Also, it is
|
||||
permanent for any new session until you modify the environment again
|
||||
and is immediate, all sessions will have the new environment
|
||||
instantaneously.
|
||||
|
||||
You can remove any package from the environment using:
|
||||
|
||||
xeon07$ nix-env -e cowsay
|
||||
|
||||
See the manual with `nix-env --help` if you want to know more details.
|
||||
|
||||
2.2 Building packages
|
||||
|
||||
Usually, all official packages are already compiled and distributed
|
||||
from a cache server so you don't need to rebuild them again. However,
|
||||
BSC packages are distributed only in source code form as we don't have
|
||||
any binary cache server yet.
|
||||
|
||||
Nix will handle the build process without any user interaction (with a
|
||||
few exceptions which you shouldn't have to worry). If any other user
|
||||
has already built the package then the build process is not needed,
|
||||
and the package is used as is.
|
||||
|
||||
In order to build a BSC package go to the `bscpkgs` directory, and
|
||||
run:
|
||||
|
||||
xeon07$ nix-build -A bsc.dummy
|
||||
|
||||
Notice the "bsc." prefix for BSC packages. The package will be built
|
||||
and installed in the /nix directory, then a symlink is placed in the
|
||||
result directory:
|
||||
|
||||
xeon07$ find result/ -type f
|
||||
result/
|
||||
result/bin
|
||||
result/bin/dummy
|
||||
|
||||
The way in which nix handles the packages and dependencies ensures
|
||||
that the environment of the build process of any package is exactly
|
||||
the same, so the generated output should be the same if the builds are
|
||||
deterministic.
|
||||
|
||||
You can check the reproducibility of the build by adding the "--check"
|
||||
flag, which will rebuild the package and compare the checksum of every
|
||||
file with the ones previously built:
|
||||
|
||||
xeon07$ nix-build -A bsc.dummy --check
|
||||
...
|
||||
xeon07$ echo $?
|
||||
0
|
||||
|
||||
A return code of zero ensures the output is bit by bit identical to
|
||||
the one installed. There are some packages that include
|
||||
indeterministic information in the build process (such as the
|
||||
timestamp of the current time) which will produce an error. Those
|
||||
packages must be patched to ensure the output is deterministic.
|
||||
|
||||
Notice that if you "cd" into the "result/" directory you will be at
|
||||
/nix directory (as you have follow the symlink) where you don't have
|
||||
write permission. Therefore if your program attempts to write to the
|
||||
current directory it will fail. It is recommended to instead run your
|
||||
program from the top directory:
|
||||
|
||||
xeon07$ result/bin/dummy
|
||||
Hello world!
|
||||
|
||||
Or you can install it in the environment:
|
||||
|
||||
xeon07$ nix-env -i ./result
|
||||
|
||||
And "cd" into any directory where you want to output some files and
|
||||
just run it by the name:
|
||||
|
||||
xeon07$ cd /tmp
|
||||
xeon07$ dummy
|
||||
Hello world!
|
||||
|
||||
Finally, you can remove it from the environment if you don't need it:
|
||||
|
||||
xeon07$ nix-env -e dummy
|
||||
|
||||
If you want to know more details use "nix-build --help" to see the
|
||||
manual.
|
||||
|
||||
2.3 The build process
|
||||
|
||||
Each package is built following a programmable configuration
|
||||
description in the nix language. Builds in nix are performed under
|
||||
very strict conditions. No access to any file in the file system is
|
||||
allowed, unless stated in the dependencies, which are in the /nix
|
||||
store only.
|
||||
|
||||
There is no network access in the build process and other restrictions
|
||||
are enforced so that the build environment is reproducible. See more
|
||||
details here:
|
||||
|
||||
https://nixos.wiki/wiki/Nix#Sandboxing
|
||||
|
||||
The top level "default.nix" file of the bscpkgs serves as a index
|
||||
of all BSC packages. You can see the definition for each package, for
|
||||
example the nbody app:
|
||||
|
||||
nbody = callPackage ./bsc/apps/nbody/default.nix {
|
||||
stdenv = pkgs.gcc9Stdenv;
|
||||
mpi = intel-mpi;
|
||||
icc = icc;
|
||||
tampi = tampi;
|
||||
nanos6 = nanos6-git;
|
||||
};
|
||||
|
||||
The compilation details are specified in the
|
||||
"bsc/apps/nbody/default.nix" file. You can configure the package by
|
||||
changing the inputs, for example, what specific implementation of
|
||||
nanos6 or MPI you want to use. To change the MPI implementation to the
|
||||
official MPICH package use:
|
||||
|
||||
nbody = callPackage ./bsc/apps/nbody/default.nix {
|
||||
stdenv = pkgs.gcc9Stdenv;
|
||||
mpi = pkgs.mpich; # Notice pkgs prefix for official packages
|
||||
icc = icc;
|
||||
tampi = tampi;
|
||||
nanos6 = nanos6-git;
|
||||
};
|
||||
|
||||
Then you can rebuild the nbody package:
|
||||
|
||||
xeon07$ nix-build -A bsc.nbody
|
||||
...
|
||||
|
||||
And verify that the binary is indeed linked to MPICH now:
|
||||
|
||||
xeon07$ ldd result/bin/nbody_mpi.N2.2048.exe | grep mpi
|
||||
libmpi.so.12 => /nix/store/dwkkcv78a5bs8smflpx9ppp3klhz3i98-mpich-3.3.2/lib/libmpi.so.12 (0x00007f6be0f07000)
|
||||
|
||||
If you modify a package which another package requires as a
|
||||
dependency, nix will rebuild all required packages to propagate your
|
||||
changes on demand.
|
||||
|
||||
However, if you come back to the original configuration, the package
|
||||
will still be in the /nix store (unless the garbage collector was
|
||||
manually run and removed your old build), so you don't need to rebuild
|
||||
it again.
|
||||
|
||||
For example if nbody is configured back to use Intel MPI:
|
||||
|
||||
nbody = callPackage ./bsc/apps/nbody/default.nix {
|
||||
stdenv = pkgs.gcc9Stdenv;
|
||||
mpi = intel-mpi;
|
||||
icc = icc;
|
||||
tampi = tampi;
|
||||
nanos6 = nanos6-git;
|
||||
};
|
||||
|
||||
The build process now is not required:
|
||||
|
||||
xeon07$ nix-build -A bsc.nbody
|
||||
/nix/store/rbq7wrjcmg6fzd6yhrlnkfvzcavdbdpc-nbody
|
||||
xeon07$ ldd result/bin/nbody_mpi.N2.2048.exe | grep mpi
|
||||
libmpifort.so.12 => /nix/store/jvsjvxj2a08340fpdrqbqix9z3mpp3bd-intel-mpi-2019.7.217/lib/libmpifort.so.12 (0x00007f3a00402000)
|
||||
libmpi.so.12 => /nix/store/jvsjvxj2a08340fpdrqbqix9z3mpp3bd-intel-mpi-2019.7.217/lib/libmpi.so.12 (0x00007f39fed34000)
|
||||
|
||||
Take a look at the different package description files in the
|
||||
bscpkgs repository if you want to understand more details. Also
|
||||
the nix pills are a very good reference:
|
||||
|
||||
https://nixos.org/nixos/nix-pills/
|
||||
|
||||
2.4 Debugging the build process
|
||||
|
||||
It may happen that the build process fails in an unexpected way. Most
|
||||
problems are related to missing dependencies and can be easily found
|
||||
by looking at the error messages.
|
||||
|
||||
Other build problems are more subtle and require more debugging time.
|
||||
One way of inspecting a build problem is by adding the breakpointHook
|
||||
hook to the nativeBuildInputs array in a nix derivation (see
|
||||
https://nixos.org/nixpkgs/manual/#ssec-setup-hooks for more info),
|
||||
which will stop the build process and allow a shell to be attached to
|
||||
the sandbox.
|
||||
|
||||
xeon07$ nix-build -A bsc.nbody
|
||||
...
|
||||
/nix/store/gvqm2yc9xx4vh3nglgckz8siya66jnkx-stdenv-linux/setup: line
|
||||
83: fake-missing-command: command not found
|
||||
build failed in buildPhase with exit code 127
|
||||
To attach install cntr and run the following command as root:
|
||||
|
||||
cntr attach -t command \
|
||||
cntr-/nix/store/sk2nsj7xfr62cjk6m3725ydfyswqz7n1-nbody
|
||||
|
||||
The command must run as root user, so you can use `sudo -i` to run it,
|
||||
(the -i option is required to load the shell profile which provides
|
||||
the nix path containing the cntr tool):
|
||||
|
||||
xeon$ sudo -i cntr attach -t command \
|
||||
cntr-/nix/store/sk2nsj7xfr62cjk6m3725ydfyswqz7n1-nbody
|
||||
nixbld@localhost:/var/lib/cntr> ls
|
||||
bin build dev etc nix proc tmp var
|
||||
|
||||
Then you can inspect the build environment to see why the build
|
||||
failed. Source the build/env-vars file to get the same environment
|
||||
variables (which include the $PATH) of the build process.
|
||||
|
||||
/* vim: set ts=2 sw=2 tw=72 fo=watqc expandtab spell autoindent: */
|
8
README.md
Normal file
8
README.md
Normal file
@ -0,0 +1,8 @@
|
||||
This repository contains the garlic benchmark with all the BSC
|
||||
packages and patches required to compile it.
|
||||
|
||||
See the user guide at
|
||||
<https://pm.bsc.es/ftp/garlic/ug.html> (also available in
|
||||
[PDF](https://pm.bsc.es/ftp/garlic/ug.pdf)).
|
||||
|
||||
Questions? <garlic@bsc.es>
|
Loading…
Reference in New Issue
Block a user