Commit Graph

487 Commits

Author SHA1 Message Date
Raúl Peñacoba
4d629fe8f7 hpcg: remove old comments 2021-04-16 09:32:28 +02:00
Raúl Peñacoba
f5c8d0cb88 hpcg: choose a smaller strong scaling problem size 2021-04-16 09:32:28 +02:00
Raúl Peñacoba
cb6577b439 hpcg: add strongscaling
HPCG rounds problem size axis when its value is < 16
2021-04-16 09:32:28 +02:00
Raúl Peñacoba
b60a46b683 hpcg: add weakscaling over some nblocks to check which axis is better 2021-04-16 09:32:28 +02:00
Raúl Peñacoba
1a6075a2b1 hpcg: add first granularity/scalability exps for tampi+isend+oss+task
- oss.nix runs valid hpcg layouts whereas slices.nix does not
2021-04-16 09:32:28 +02:00
12ff1fd506 garlicd: send logs to the builder 2021-04-16 09:29:33 +02:00
732b0c0e9c garlic tool: improve unit status information 2021-04-16 09:29:33 +02:00
64f077c4f6 stages: prepend the stage name to messages 2021-04-16 09:29:33 +02:00
7c94997023 control: add trap for bad exit 2021-04-16 09:29:33 +02:00
fb0dee4b61 exp: move exit1 experiment to slurm 2021-04-16 09:29:33 +02:00
bde54c69c5 sbatch: store queued status 2021-04-16 09:29:33 +02:00
2151e20bd6 exp: add exit1 experiment
Tests unit bad exits
2021-04-16 09:29:33 +02:00
886d16bcc6 garlic tool: add jq as dependency
So we can parse the experiment configuration in JSON
2021-04-16 09:29:33 +02:00
5c0f179830 stdexp: rename "name" to "clusterName" 2021-04-16 09:29:33 +02:00
422d359b48 script: stop on error by default 2021-04-16 09:29:33 +02:00
60248ab06b article: remove not used figures 2021-04-16 09:29:33 +02:00
1cb63b464d osu: adjust figures for publication 2021-04-16 09:29:33 +02:00
821b4f0d15 rplot: patch scales and fontconfig 2021-04-16 09:29:33 +02:00
0cf35decc5 osu: add mtu and eager experiments 2021-04-16 09:29:33 +02:00
26e3a86c78 garlic tool: check the presence of all the units
This check prevents a user from removing units between the
execution of the experiment and the fetch.
2021-04-16 09:29:33 +02:00
f842f1e01d slurm: add sigsegv experiment
Ensure that we can catch a sigsegv signal before and after the
MPI_Finalize call.
2021-04-16 09:29:33 +02:00
71c06d02da stages: add baywatch stage to check the exit code
This workaround stage prevents srun from returning 0 to the upper stages
when a signal happens after MPI_Finalize. It writes the return code to a
file named .srun.rc.$rank and later checks that exists and contains a 0.

When the program is killed, exits with non-zero and the error is
propagated to the baywatch stage, which aborts immediately without
creating the rc file.
2021-04-16 09:29:26 +02:00
07253c3fa0 fwi: update figure index 2021-04-14 17:18:46 +02:00
eab323a13a fwi: update io figure 2021-04-14 17:18:24 +02:00
8ce2a68cd7 fwi: update strong scaling figure script 2021-04-14 17:16:12 +02:00
99c6196734 fwi: update granularity figure 2021-04-14 17:05:09 +02:00
dd75a840ce fwi: use enableIO instead of ioFreq 2021-04-12 20:09:17 +02:00
e49e3b087f fwi: rename big io experiment 2021-04-12 19:49:31 +02:00
59040d9355 fwi: fix inverted resources 2021-04-12 19:31:35 +02:00
6422741cb7 fwi: merge io experiments into one file
The enableExtended parameter control if the experiment runs with
multiple nodes or only one.
2021-04-12 19:27:45 +02:00
99beac9b23 fwi: generate the model in every node
As we are using local storage, we need a copy of the input in every
node. The current method is to run the generator only in the rank which
has assigned the cpu 0 in the mask.
2021-04-12 19:01:10 +02:00
58dc277d3d fwi: refactor ss-io with common.nix
Also, keep the names short and consistent.
2021-04-12 17:57:46 +02:00
47b326c646 fwi: generate the input at runtime 2021-04-12 17:46:07 +02:00
419e7f95cc fwi: avoid input generation
The ModelGenerator is now included in the fwi-params, so that the input
can be generated at runtime.
2021-04-12 17:43:30 +02:00
b0af9b8608 srun: add postSrun hook 2021-04-12 17:41:59 +02:00
4afda7dbfb fwi: use common.nix in sync_io experiment 2021-04-12 16:27:18 +02:00
02a103565c fwi: use common.nix in reuse experiment 2021-04-12 15:48:59 +02:00
788dd13ebd fwi: merge mpi pure experiment
The getResources function is used to assign the proper cpu binding
depending on the version. However, additional contraints are required to
ensure that we have enough points in Y.

By default the mpi+send+seq branch is disabled.
2021-04-12 15:37:39 +02:00
41665bc6fc fwi: refactor config generation into common.nix 2021-04-12 15:01:25 +02:00
9aa07993b2 fwi: refactor ss and granularity experiments
A common.nix file contains the shared stages
2021-04-12 14:41:26 +02:00
e0a68c077c fwi: merge forkjoin ss experiment into one file
Additional options are only active with enableExtended = true
2021-04-12 12:51:10 +02:00
989f6ee018 fwi: adjust input size to meet timing constraints
The previous iniput size for both granularity and strong scaling tests
where too big to meet the timing constrains needed for garlic. This
patch sets a new, smaller, input size.

Also, a minor cleanup is applied to the rest of the fwi experiments
and figures.
2021-04-07 12:44:14 +02:00
3e5a56ebdb fwi: add tampi non-blocking variant 2021-04-07 12:44:14 +02:00
3ef4a505d3 fwi: add strong scalability tests 2021-04-07 12:44:14 +02:00
aadce016e1 fwi: add granularity and data reuse experiments
The data reuse experiment shows the effect of poor data locality versus
task granularity.
2021-04-07 12:44:14 +02:00
1d9a5c4721 fwi: fix input derivation
The fwiInput derivation must be the same used when compiled the fwi app
as the fwi-input used in the experiment.
2021-04-07 12:44:14 +02:00
11e400abb5 fwi: remove old experiment 2021-04-07 12:44:14 +02:00
a8477b1b05 fwi: add test figure with the time 2021-04-07 12:44:14 +02:00
7a6cbd3a9e fwi: update test experiment 2021-04-07 12:44:14 +02:00
3de7b5a0b6 fwi: save the params and frequencies files 2021-04-07 12:44:14 +02:00
485b9150e5 fwi: add problem size parameters 2021-04-07 12:44:14 +02:00
fa0e9f591f fwi: update repo url to PM server 2021-04-07 12:44:13 +02:00
de175b2380 fwi: fix input name 2021-04-07 12:44:13 +02:00
bfbbc294ae fwi: split into input and solver
All branches compile with several hacks.
2021-04-07 12:44:13 +02:00
9bea3cc264 fwi: add oss experiment 2021-04-07 12:44:13 +02:00
f10f8472ac fwi: add seq test experiment 2021-04-07 12:44:13 +02:00
26ad3e49f7 fwi: add gitBranch and copy params 2021-04-07 12:44:13 +02:00
312656ce54 heat: rename granul -> granularity experiment 2021-04-06 18:42:49 +02:00
63aa07dad5 heat: update granularity plot with modern ggplot 2021-04-06 18:40:19 +02:00
d1c32869c1 heat: split granularity with extended mode
The HWC version is not yet complete.
2021-04-06 18:38:15 +02:00
3566cf0152 develop: add paraver package 2021-04-06 11:14:30 +02:00
0b7e92b6f9 heat: add bar plot with time distribution 2021-04-06 11:05:56 +02:00
f8122f3c8b heat: use the hcut tool to limit the cpus 2021-04-06 11:05:56 +02:00
d68ce914ba heat: use cut to partition the trace
The awk script doesn't take in consideration the events close to the cut
points, which are significative with low parallelism.
2021-04-06 11:05:51 +02:00
cb482fa3ea heat: remove perf from the ctf experiment
As we would be extracting perf stats from the trace processing steps.
2021-04-06 11:05:10 +02:00
8a97fefafa saiph: simplify and update figure scripts 2021-04-01 19:25:38 +02:00
10b1ff8f7a saiph: simplify granularity and ss experiments 2021-04-01 19:25:38 +02:00
0e0f1b265f saiph: add extra parameters for the app 2021-04-01 19:25:38 +02:00
5ea9ff5ad8 machines: add cache sizes 2021-04-01 19:25:38 +02:00
Sandra
2b36e33b7e saiph: modify apps parameters 2021-04-01 19:25:37 +02:00
Sandra
b64b864194 saiph: clean exps and figs 2021-04-01 19:25:37 +02:00
Sandra
72e7a8dab7 shell: add clangOmpss2 and gdb 2021-04-01 19:25:24 +02:00
Sandra
46536548ca saiph: update scaling exp and figures 2021-04-01 19:24:38 +02:00
Sandra
8406c1c4e5 saiph: add total number of local blocks (#tasks) parameter 2021-04-01 19:24:38 +02:00
Sandra
bc912162a0 index: add vtk and boost 2021-04-01 19:24:38 +02:00
Sandra
4e727bf632 shell: add nix-diff 2021-04-01 19:24:38 +02:00
Sandra
5c7af00dfa saiph: add debug/asan flags parameters 2021-04-01 19:24:38 +02:00
Sandra
5caf2f79f3 saiph: change scaling R script 2021-04-01 19:24:38 +02:00
Sandra
a90c044c3e saiph: add manual global blocking
Ensure cuts in a single dimension
2021-04-01 19:24:38 +02:00
Sandra
99532c9c60 saiph: add manual distribution and nbl/nbg 2021-04-01 19:24:34 +02:00
Sandra
ddef901e2f saiph: add nsteps parameter to experiments 2021-03-30 18:54:35 +02:00
Sandra
1ae5acfe6a saiph: add nsteps in saiph app 2021-03-30 18:54:35 +02:00
d108306a29 saiph: add blocking experiments to index
Remove unused environment variables as well.
2021-03-30 18:54:35 +02:00
e0fbbe32a6 saiph: update granularity experiment and R script 2021-03-30 18:54:35 +02:00
Sandra
37e11c749f saiph: add cacheline compilation parameter 2021-03-30 18:54:35 +02:00
Sandra
02a62c18ac saiph: add strong scaling experiment 2021-03-30 18:54:35 +02:00
Sandra
0ac0205366 saiph: add figures for blocking experiment 2021-03-30 18:54:35 +02:00
Sandra
a2306eb941 saiph: add some blocking experiments 2021-03-30 18:54:35 +02:00
Sandra
38d4d0b48c saiph: delete extrae XML configuration files 2021-03-30 18:54:35 +02:00
63b08fa4e8 saiph: use nby for granularity plot 2021-03-30 18:54:35 +02:00
992af14c7f saiph: add scaling experiment 2021-03-30 18:54:35 +02:00
99f3326609 saiph: allow custom gitCommit 2021-03-30 18:54:35 +02:00
a4b2dfddb4 saiph: update granularity experiment 2021-03-30 18:54:35 +02:00
830d648925 saiph: reduce the number of loops
The current app Heat3D_vect has a long initialization time
2021-03-30 18:54:16 +02:00
e4ab177d6c saiph: remove dangerous Intel MPI envvar
It is no longer used, as we have moved to the release library version.
2021-03-30 17:56:26 +02:00
b7dcf7bc69 rplot: add support for gziped datasets 2021-03-30 16:35:47 +02:00
5ac581b573 creams: remove pure mpi from granularity 2021-03-30 16:14:32 +02:00
b900cb95f0 creams: make configurations unique 2021-03-30 16:14:11 +02:00
389d3f6310 creams: simplify granularity figure 2021-03-30 16:07:14 +02:00
76deac0a63 creams: update figures using one single pipeline 2021-03-30 15:59:52 +02:00