Commit Graph

16 Commits

Author SHA1 Message Date
5c0f179830 stdexp: rename "name" to "clusterName" 2021-04-16 09:29:33 +02:00
71c06d02da stages: add baywatch stage to check the exit code
This workaround stage prevents srun from returning 0 to the upper stages
when a signal happens after MPI_Finalize. It writes the return code to a
file named .srun.rc.$rank and later checks that exists and contains a 0.

When the program is killed, exits with non-zero and the error is
propagated to the baywatch stage, which aborts immediately without
creating the rc file.
2021-04-16 09:29:26 +02:00
b0af9b8608 srun: add postSrun hook 2021-04-12 17:41:59 +02:00
872ad1a289 stdexp: allow preSrun attribute in the srun stage
This option allows an experiment to inject commands before srun starts,
while keeping the standard srun stage options.
2021-03-29 17:46:19 +02:00
0b95ea20b7 garlicd: allow manual experiment executions 2021-02-25 11:37:58 +01:00
e89139284a stdexp: add result and timetable targets
These targets allow one experiment to directly refer to another
experiment results, thus a dependency chain can be formed to ensure
execution order.

It also simplifies the dataset definition, as they can be automatically
fetched from the experiment directly.
2021-02-03 12:37:54 +01:00
5a8cc1e514 stdexp: Run python snippets and import the result 2020-12-10 15:41:49 +01:00
734d494d96 stdexp: Allow extra mounts 2020-11-20 15:30:47 +01:00
b4a3bb0ede New stdexp resource specification
Now the options	cpusPerTask ntasksPerNode nodes and jobName are required
for the sbatch stage. Also cpuBind has been removed and is always set to
"cores,verbose" in the srun stage.
2020-11-17 11:01:34 +01:00
18afcb1f44 Avoid nixpkgs reevaluation
The bsc attrset is now extensible: replacing a few bsc packages is very
fast. Also we allow the complete bscpkgs to be within other custom
overlays (not tested yet).
2020-11-17 10:49:45 +01:00
42f2227a9f sbatch: Use experiment reservation if given 2020-11-13 10:17:54 +01:00
7a37913b4e Set the ssh host from the machine config 2020-10-13 14:30:03 +02:00
a38ff31cca Introduce the runexp stage 2020-10-13 13:00:59 +02:00
298c7362b3 New config design 2020-10-09 19:33:06 +02:00
53dca32469 Simplify experiment 2020-10-09 17:19:00 +02:00
332b738889 Move apps into garlic/apps 2020-10-09 16:42:06 +02:00