This vignette lists a few runs of different simulations to give an idea of the runtimes that are to be expected, as well as how much disk space is required for different sizes of simulations.
id | n | m | q | hsq | k | sibs | run time (s) | cores |
---|---|---|---|---|---|---|---|---|
1 | 100000 | 100000 | 1000 | 0.5 | 0.05 | 2 | 2974.25 | 7 |
2 | 100000 | 100000 | 500 | 0.5 | 0.05 | 2 | 2936.11 | 7 |
3 | 100000 | 100000 | 10 | 0.5 | 0.05 | 4 | 3355.88 | 7 |
4 | 100000 | 100000 | 1000 | 0.5 | 0.05 | 0 | 4540.06 | 3 |
5 | 100000 | 100000 | 5000 | 0.5 | 0.05 | 3 | 9551.67 | 3 |
6 | 100000 | 100000 | 500 | 0.5 | 0.05 | 4 | 11207.75 | 3 |
7 | 100000 | 100000 | 25 | 0.5 | 0.05 | 2 | 11581.89 | 3 |
The parameters that have the biggest impact on the runtimes of the simulations are n
, m
and sibs
with the rest of the parameters not really affecting the runtimes.
The 7 core system is better than the 3 core system in several other important aspects, so please bear in mind that the above table simplifies matters quite a bit. The clock frequency and amount of available memory, the clock frequency of the CPU and the write speed of the disk also plays a significant role. The latter may be the most important of these, as our tests have shown that the most time consuming part of the simulation is writing results to disk.
n | m | .ped/.map total file size (GB) | .bed/.bim/.fam total file size (GB) |
---|---|---|---|
100000 | 100000 | 39.62 | 2.33 |
10000 | 10000 | 0.37 | 0.02 |
10000 | 100000 | 3.73 | 0.23 |
100000 | 10000 | 3.73 | 0.23 |
50000 | 50000 | 9.31 | 0.58 |
The size of the .ped and .map files pretty much follow the linear function \(file\_size \approx n \cdot m \cdot 3.7 \cdot 10^{-9}\). The same goes for .bed, .bim and .fam with the function \(file\_size \approx n \cdot m \cdot 2.3 \cdot 10^{-10}\).
As seen in the table it is favorable to store the data in .bed/.bim/.fam format, since it only uses approximately \(\frac{1}{17}\) of the space .ped/.map uses. convert_geno_file()
can be used to convert from the .ped-format to the .bed-format.