git clone
the project into~/cluster_tool
, and thencd ~/cluster_tool
- upload the
zookeeper
file into./controller/zookeeper
- upload the executable file into
./workers/betterpob
chmod +x start.py
./start.py
- use
./delete.sh
to delete the log files, and./cancel.sh
to cancel the running experiments
./start.py
can be run with the following options:
--number
,-n
: number of machines needed, default value is4
--partition
,-p
: partition to filter by, default partition ismedium
--cpu
,-c
: number of cpus per machine, default value is10
--step
,-s
: number of nodes per machine, default value is50
--time
,-t
: maximum experiment time, default time is5:00:00
. Note that format of time is DD-HH:MM:SS.