Here, we briefly explain different ways to use R in parallel on the Harvard University FASRC Cannon cluster.
- For R basics, refer to R and R Studio
- For R Parallel, refer to R-Parallel
- Working with large data that does not fit into memory
- Processing Single instruction multiple data problem on shared and distributed memory systems
-
Package
parallel- FAS RC embarrassingly parallel documentation
- FAS RC embarrassingly parallel Cannon example (using
parLapply) - FAS RC Embarrassingly parallel VDI example (using
parLapply) - parallel documentation
-
Package
future- Install future on Cannon
- Example of
multisession(not shared memory) andmulticore(shared memory) and its submit script - future documentation
-
Package
Rmpi -
Package
pbdMPI(programming big data MPI)- Install pbdMPI on Cannon
- Examples based on the
pbdMPIdemos – after installingpbdMPIpackage, all demos can be found in your R library folder$HOME/apps/R/4.0.5/pbdMPI/demo - pbdMPI documentation and GitHub
- pbdR website
Using nested futures and package future.batchtools, we can perform a multi-node and multi-core job.
- Package
futureandfuture.batchtools