disjoint set

Posted Wujunde

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了disjoint set相关的知识,希望对你有一定的参考价值。

MAKE-SET.x/ creates a new set whose only member (and thus representative)

is x. Since the sets are disjoint, we require that x not already be in some other

set.

UNION.x; y/ unites the dynamic sets that contain x and y, say Sx and Sy, into a

new set that is the union of these two sets. We assume that the two sets are disjoint

prior to the operation. The representative of the resulting set is any member

of Sx [ Sy, although many implementations of UNION specifically choose the

representative of either Sx or Sy as the new representative. Since we require

the sets in the collection to be disjoint, conceptually we destroy sets Sx and Sy,

removing them from the collection S. In practice, we often absorb the elements

of one of the sets into the other set.

FIND-SET.x/ returns a pointer to the representative of the (unique) set containing

x.

 

 

 

linklist implement

 

 

In the worst case, the above implementation of the UNION procedure requires an

average of ‚.n/ time per call because we may be appending a longer list onto

a shorter list; we must update the pointer to the set object for each member of

the longer list. Suppose instead that each list also includes the length of the list

(which we can easily maintain) and that we always append the shorter list onto the

longer, breaking ties arbitrarily. With this simple weighted-union heuristic, a single

UNION operation can still take _.n/ time if both sets have _.n/ members. As

the following theorem shows, however, a sequence of m MAKE-SET, UNION, and

FIND-SET operations, n of which are MAKE-SET operations, takes O.m C n lg n/

time.

Tree implement

 

 

 

Heuristics to improve the running time

So far, we have not improved on the linked-list implementation. A sequence of

n 1 UNION operations may create a tree that is just a linear chain of n nodes. By

using two heuristics, however, we can achieve a running time that is almost linear

in the total number of operations m.

The first heuristic, union by rank, is similar to the weighted-union heuristic we

used with the linked-list representation. The obvious approach would be to make

the root of the tree with fewer nodes point to the root of the tree with more nodes.

Rather than explicitly keeping track of the size of the subtree rooted at each node,

we shall use an approach that eases the analysis. For each node, we maintain a

rank, which is an upper bound on the height of the node. In union by rank, we

make the root with smaller rank point to the root with larger rank during a UNION

operation.

The second heuristic, path compression, is also quite simple and highly effective.

As shown in Figure 21.5, we use it during FIND-SET operations to make each

node on the find path point directly to the root. Path compression does not change

any ranks(more nodes linked to the root will cause much possibility to find it).

 

 

When we use both union by rank and path compression, the worst-case running
time is O.m ˛.n//, where ˛.n/ is a very slowly growing function, which we define
in Section 21.4. In any conceivable application of a disjoint-set data structure,
˛.n/  4; thus, we can view the running time as linear in m in all practical situations.
Strictly speaking, however, it is superlinear. In Section 21.4, we prove this
upper bound.

 1 package disjoint_sets;
 2 // there have two ways,one is the linkedlist,the other is the tree,use the tree here
 3 public class disjoint_set {
 4     private static class Node{
 5         private Node p;
 6         private int rank;
 7         private String name;
 8         public Node(String na){
 9             p = this; rank = 0;name = na;
10         }
11     }
12     public static void union(Node x,Node y){
13         link(findset(x),findset(y));
14     }
15     public static void link(Node x,Node y){
16         if(x.rank > y.rank){
17             y.p = x;
18         }
19         else if(y.rank > x.rank){
20             x.p = y;
21         }
22         else{
23             y.p = x;
24             x.rank = x.rank + 1;
25         }
26     }
27     public static Node findset(Node x){
28         if(x != x.p){
29             x.p = findset(x.p);  //path compression
30         }
31         return x.p;
32     }
33     public static void print(Node x){
34         
35             System.out.println(x.name);
36             if(x != x.p){
37             x = x.p;
38             print(x);
39             }
40             return;
41     }
42     public static void main(String[] args) {
43         Node a = new Node("a");
44         Node b = new Node("b");
45         Node c = new Node("c");
46         Node d = new Node("d");
47         union(a,b);
48         union(b,c);
49         union(a,d);
50         print(d);
51         
52         
53     }
54 
55 }

 

以上是关于disjoint set的主要内容,如果未能解决你的问题,请参考以下文章

c_cpp 实现disjoint_set数据结构

Disjoint Sets

The Tree-planting Day and Simple Disjoint Sets

并查集(Data Structure for Disjoint Sets)

并查集(Data Structure for Disjoint Sets)

三路集合不相交(Three-Way Set Disjointness)算法分析