PowerGraph is not designed for fault tolerant execution, and thus if a node fails you will need to restart the computation on all nodes.
The only mechanism which is designed for fault tolerance is the snapshot mechanism (See https://www.usenix.org/system/files/conference/osdi12/osdi12-final-167.pdf - section 7.5)
To operate the snapshot mechanism you need to execute the synchronous engine. Here is the command line option documentation:
snapshot_interval: (default: -1) If set to a positive value, a snapshot
is taken every this number of iterations. If set to 0, a snapshot
is taken before the first iteration. If set to a negative value,
no snapshots are taken. A snapshot is a binary dump of the graph.
snapshot_path: If snapshot_interval is set to a value >=0,
this option must be specified and should contain a target basename
for the snapshot. The path including folder and file prefix in
which the snapshots should be saved