A hypergraph consists of vertices and hyperedges. A hyperedge connects one or more vertices. A graph is a special case of a hypergraph where each edge has size two (two vertices). The hypergraph model is well suited to parallel computing, where vertices correspond to data objects and hyperedges represent the communication requirements. The basic partitioning problem is to partition the vertices into k approximately equal sets such that the number of cut hyperedges is minimized. Most partitioners (including Zoltan-PHG) allows a more general model where both vertices and hyperedges can be assigned weights. It has been shown that the hypergraph model gives a more accurate representation of communication cost (volume) than the graph model. In particular, for sparse matrix-vector multiplication, the hypergraph model exactly represents communication volume. Sparse matrices can be partitioned either along rows or columns; in the row-net model the columns are vertices and each row corresponds to an hyperedge, while in the column-net model the roles of vertices and hyperedges are reversed.
Zoltan contains a native parallel hypergraph partitioner, called PHG
(Parallel HyperGraph partitioner). In addition, Zoltan provides
access to PaToH,
a serial hypergraph partitioner.
Note that PaToH is not part of Zoltan and should be obtained
separately from the
PaToH web site.
Zoltan-PHG is a fully parallel multilevel hypergraph partitioner. For
further technical description, see [Devine et al, 2006].
Method String: | HYPERGRAPH |
Parameters: | |
HYPERGRAPH_PACKAGE |
PHG (parallel) or PaToH (serial) |
CHECK_HYPERGRAPH |
Check if input data is valid.
(Slows performance;intended for debugging.) |
PHG_OUTPUT_LEVEL |
Level of verbosity; 0 is silent. |
PHG_FINAL_OUTPUT |
Print stats about final
partition? (0/1) |
PHG_NPROC_VERTEX |
Desired number of processes in the vertex direction (for 2D internal layout) |
PHG_NPROC_HEDGE |
Desired number of processes in the hyperedge direction (for 2D internal layout) |
PHG_COARSENING_METHOD | The method to use in matching/coarsening; currently these are
available. agg - agglomerative inner product matching (a.k.a. heavy connectivity matching) ipm - inner product matching (a.k.a. heavy connectivity matching) c-ipm - column ipm; faster method based on ipm within processor columns a-ipm - alternate between fast method (l-ipm ) and ipm l-ipm - local ipm on each processor. Fastest option but often gives poor quality. h-ipm - hybrid ipm that uses partial c-ipm followed by ipm on each level |
PHG_COARSENING_LIMIT |
Number of vertices at which to stop coarsening. |
PHG_VERTEX_VISIT_ORDER |
Ordering of vertices in greedy
matching scheme: 0 - random 1 - natural order (as given by the query functions) 2 - increasing vertex weights 3 - increasing vertex degree 4 - increasing vertex degree, weighted by pins |
PHG_EDGE_SCALING |
Scale edge weights by some
function of size of the hyperedges: 0 - no scaling 1 - scale by 1/(size-1) [absorption scaling] 2 - scale by 2/((size*size-1)) [clique scaling] |
PHG_VERTEX_SCALING |
Variations in "inner product"
similarity metric (for matching): 0 - Euclidean inner product: <x,y> 1 - cosine similarity: <x,y>/(|x|*|y|) 2 - <x,y>/(|x|^2 * |y|^2) 3 - scale by sqrt of vertex weights 4 - scale by vertex weights |
PHG_COARSEPARTITION_METHOD | Method to partition the coarsest (smallest) hypergraph;
typically done in serial: random - random linear - linear (natural) order greedy - greedy method based on minimizing cuts auto - automatically select from the above methods (in parallel, the processes will do different methods) |
PHG_REFINEMENT_METHOD |
Refinement algorithm: fm - two-way approximate FM none - no refinement |
PHG_REFINEMENT_LOOP_LIMIT | Loop limit in FM refinement. Higher number means more
refinement. |
PHG_REFINEMENT_MAX_NEG_MOVE |
Maximum number of negative moves allowed in FM. |
PHG_BAL_TOL_ADJUSTMENT |
Controls how the balance tolerance is adjusted at
each level of bisection. |
PHG_RANDOMIZE_INPUT |
Randomize layout of vertices and
hyperedges in internal parallel 2D layout? (0/1) |
PHG_EDGE_WEIGHT_OPERATION | Operation to be applied to edge
weights supplied by different processes for the same hyperedge: add - the hyperedge weight will be the sum of the supplied weights max - the hyperedge weight will be the maximum of the supplied weights error - if the hyperedge weights are not equal, Zoltan will flag an error, otherwise the hyperedge weight will be the value returned by the processes |
EDGE_SIZE_THRESHOLD |
Ignore hyperedges greater than this fraction times
number of vertices. |
PATOH_ALLOC_POOL0 |
Memory allocation for PaToH; see
the PaToH manual for details. |
PATOH_ALLOC_POOL1 |
Memory allocation for PaToH; see the PaToH manual for details. |
Default values: | |
HYPERGRAPH_PACKAGE = PHG |
|
CHECK_HYPERGRAPH
= 0 |
|
PHG_OUTPUT_LEVEL=0 | |
PHG_FINAL_OUTPUT=0 | |
PHG_REDUCTION_METHOD=ipm | |
PHG_REDUCTION_LIMIT=100 | |
PHG_VERTEX_VISIT_ORDER=0 | |
PHG_EDGE_SCALING=0 | |
PHG_VERTEX_SCALING=0 | |
PHG_COARSEPARTITION_METHOD=greedy | |
PHG_REFINEMENT_METHOD=fm | |
PHG_REFINEMENT_LOOP_LIMIT=10 | |
PHG_REFINEMENT_MAX_NEG_MOVE=100 | |
PHG_BAL_TOL_ADJUSTMENT=0.7 | |
PHG_RANDOMIZE_INPUT=0 | |
PHG_EDGE_WEIGHT_OPERATION=max | |
EDGE_SIZE_THRESHOLD=0.25 | |
PATOH_ALLOC_POOL0=0 | |
PATOH_ALLOC_POOL1=0 | |
Required Query Functions: | |
ZOLTAN_NUM_OBJ_FN | |
ZOLTAN_OBJ_LIST_FN or ZOLTAN_FIRST_OBJ_FN/ZOLTAN_NEXT_OBJ_FN pair | |
ZOLTAN_HG_SIZE_CS_FN
ZOLTAN_HG_CS_FN |
|
Optional Query Functions: | |
ZOLTAN_HG_SIZE_EDGE_WTS_FN | |
ZOLTAN_HG_EDGE_WTS_FN |
It is possible to provide the graph query functions instead of the hypergraph queries, though this is not recommended. If only graph query functions are registered, Zoltan will automatically create a hypergraph from the graph, but some information (specifically, edge weights) will be lost.