Erdős–Ko–Rado theorem

Two intersecting families of two-element subsets of a four-element set. The sets in the left family all contain the bottom left element. The sets in the right family avoid this element.

In mathematics, the Erdős–Ko–Rado theorem limits the number of sets in a family of sets for which every two sets have at least one element in common. Paul Erdős, Chao Ko, and Richard Rado proved the theorem in 1938, but did not publish it until 1961. It is part of the field of combinatorics, and one of the central results of extremal set theory.[1]

The theorem applies to families of sets that all have the same size, , and are all subsets of some larger set of size . One way to construct a family of sets with these parameters, each two sharing an element, is to choose a single element to belong to all the subsets, and then form all of the subsets that contain the chosen element. The Erdős–Ko–Rado theorem states that when is large enough for the problem to be nontrivial () this construction produces the largest possible intersecting families. When there are other equally-large families, but for larger values of only the families constructed in this way can be largest.

The Erdős–Ko–Rado theorem can also be described in terms of hypergraphs or independent sets in Kneser graphs. Several analogous theorems apply to other kinds of mathematical object than sets, including linear subspaces, permutations, and strings. They again describe the largest possible intersecting families as being formed by choosing an element and forming the family of all objects that contain the chosen element.

Statement

[edit]

Suppose that is a family of distinct -element subsets of an -element set with , and that each two subsets share at least one element. Then the theorem states that the number of sets in is at most the binomial coefficient The requirement that is necessary for the problem to be nontrivial: when , all -element sets intersect, and the largest intersecting family consists of all -element sets, with size .[2]

The same result can be formulated as part of the theory of hypergraphs. A family of sets may also be called a hypergraph, and when all the sets (which are called "hyperedges" in this context) are the same size , it is called an -uniform hypergraph. The theorem thus gives an upper bound for the number of pairwise overlapping hyperedges in an -uniform hypergraph with vertices and .[3]

The Kneser graph , with a vertex for each 2-element subset of the 5-element set {1,2,3,4,5} and an edge for each pair of disjoint subsets. According to the Erdős–Ko–Rado theorem, the independent sets in this graph have at most four vertices.

The theorem may also be formulated in terms of graph theory: the independence number of the Kneser graph for is This is a graph with a vertex for each -element subset of an -element set, and an edge between every pair of disjoint sets. An independent set is a collection of vertices that has no edges between its pairs, and the independence number is the size of the largest independent set.[4] Because Kneser graphs have symmetries taking any vertex to any other vertex (they are vertex-transitive graphs), their fractional chromatic number equals the ratio of their number of vertices to their independence number, so another way of expressing the Erdős–Ko–Rado theorem is that these graphs have fractional chromatic number exactly .[5]

History

[edit]

Paul Erdős, Chao Ko, and Richard Rado proved this theorem in 1938 after working together on it in England. Rado had moved from Berlin to the University of Cambridge and Erdős from Hungary to the University of Manchester, both escaping the influence of Nazi Germany; Ko was a student of Louis J. Mordell at Manchester.[6] However, they did not publish the result until 1961,[7] with the long delay occurring in part because of a lack of interest in combinatorial set theory in the 1930s, and increased interest in the topic in the 1960s.[6] The 1961 paper stated the result in an apparently more general form, in which the subsets were only required to be size at most , and to satisfy the additional requirement that no subset be contained in any other.[7] A family of subsets meeting these conditions can be enlarged to subsets of size exactly either by an application of Hall's marriage theorem,[8] or by choosing each enlarged subset from the same chain in a symmetric chain decomposition of sets.[9]

Maximum and maximal families

[edit]

Families of maximum size

[edit]
The Johnson graph , with a vertex for each two-element subset of {1,2,3,4} and an edge connecting intersecting pairs of subsets, arranged geometrically as an octahedron with disjoint sets at opposite vertices. The largest intersecting families for and come from the triangular faces of this octahedron. Replacing a set in one of these families by its complement corresponds to moving from a triangle to one of its three neighboring triangles.

A simple way to construct an intersecting family of -element sets whose size exactly matches the Erdős–Ko–Rado bound is to choose any fixed element , and let consist of all -element subsets that include . For instance, for 2-element subsets of the 4-element set , with , this produces the family

, , and .

Any two sets in this family intersect, because they both include . The number of sets is , because after the fixed element is chosen there remain other elements to choose, and each set chooses of these remaining elements.[10]

When this is the only intersecting family of this size. However, when , there is a more general construction. Each -element set can be matched up to its complement, the only -element set from which it is disjoint. Then, choose one set from each of these complementary pairs. For instance, for the same parameters above, this more general construction can be used to form the family

, , and ,

where every two sets intersect despite no element belonging to all three sets. In this example, all of the sets have been complemented from the ones in the first example, but it is also possible to complement only some of the sets.[10]

When , families of the first type (variously known as stars,[1] dictatorships,[11] juntas,[11] centered families,[12] or principal families[13]) are the unique maximum families. In this case, a family of nearly-maximum size has an element which is common to almost all of its sets.[14] This property has been called stability,[13] although the same term has also been used for a different property, the fact that (for a wide range of parameters) deleting randomly-chosen edges from the Kneser graph does not increase the size of its independent sets.[15]

Maximal intersecting families

[edit]
The seven points and seven lines (one drawn as a circle) of the Fano plane form a maximal intersecting family.

An intersecting family of -element sets may be maximal, in that no further set can be added (even by extending the ground set) without destroying the intersection property, but not of maximum size. An example with and is the set of seven lines of the Fano plane, much less than the Erdős–Ko–Rado bound of 15.[16] More generally, the lines of any finite projective plane of order form a maximal intersecting family that includes only sets, for the parameters and . The Fano plane is the case of this construction.[17]

The smallest possible size of a maximal intersecting family of -element sets, in terms of , is unknown but at least for .[18] Projective planes produce maximal intersecting families whose number of sets is , but for infinitely many choices of there exist smaller maximal intersecting families of size .[17]

The largest intersecting families of -element sets that are maximal but not maximum have size They are formed from an element and an -element set not containing , by adding to the family of -element sets that include both and at least one element of . This result is called the Hilton–Milner theorem, after its proof by Anthony Hilton and Eric Charles Milner in 1967.[19]

Proofs

[edit]

The original proof of the Erdős–Ko–Rado theorem used induction on . The base case, for , follows easily from the facts that an intersecting family cannot include both a set and its complement, and that in this case the bound of the Erdős–Ko–Rado theorem is exactly half the number of all -element sets. The induction step for larger uses a method called shifting, of substituting elements in intersecting families to make the family smaller in lexicographic order and reduce it to a canonical form that is easier to analyze.[20]

In 1972, Gyula O. H. Katona gave the following short proof using double counting.[21]

Let be any intersecting family of -element subsets of an -element set. Arrange all elements into any cyclic order, and consider the sets from that form intervals of length within this chosen cyclic order. For example if and , one possible cyclic order for the numbers is the order , which has eight 3-element intervals (including the ones that wrap around):
, , , , , , , and .

However, only some of these intervals can belong to , because they do not all intersect. Katona's key observation is that at most intervals from a single cyclic order may belong to . This is because, if is one of these intervals, then every other interval of the same cyclic order that belongs to separates from , for some , by containing precisely one of these two elements. The two intervals that separate these elements are disjoint, so at most one of them can belong to . Thus, the number of intervals in is at most one plus the number of pairs that can be separated.[21]

Based on this idea, it is possible to count the pairs , where is a set in and is a cyclic order for which is an interval, in two ways. First, for each set one may generate by choosing one of permutations of and permutations of the remaining elements, showing that the number of pairs is . And second, there are cyclic orders, each of which has at most intervals of , so the number of pairs is at most . Comparing these two counts gives the inequality and dividing both sides by gives the result[21]

It is also possible to derive the Erdős–Ko–Rado theorem as a special case of the Kruskal–Katona theorem, another important result in extremal set theory.[22] Many other proofs are known.[23]

[edit]

Generalizations

[edit]

A generalization of the theorem applies to subsets that are required to have large intersections. This version of the theorem has three parameters: , the number of elements the subsets are drawn from, , the size of the subsets, as before, and , the minimum size of the intersection of any two subsets. For the original form of the Erdős–Ko–Rado theorem, . In general, for large enough with respect to the other two parameters, the generalized theorem states that the size of a -intersecting family of subsets is at most[24] More precisely, this bound holds when , and does not hold for smaller values of . When , the only -intersecting families of this size are obtained by designating elements as the common intersection of all the subsets, and constructing the family of all -element subsets that include these designated elements.[25] The maximal size of a t-intersecting family when was determined by Ahlswede and Khachatrian, in their Ahlswede–Khachatrian theorem.[26]

The corresponding graph-theoretic formulation of this generalization involves Johnson graphs in place of Kneser graphs.[27] For large enough values of and in particular for , both the Erdős–Ko–Rado theorem and its generalization can be strengthened from the independence number to the Shannon capacity of a graph: the Johnson graph corresponding to the -intersecting -element subsets has Shannon capacity .[28]

The theorem can also be generalized to families in which every subsets have a common intersection. Because this strengthens the condition that every pair intersects (for which ), these families have the same bound on their maximum size, when is sufficiently large. However, in this case the meaning of "sufficiently large" can be relaxed from to .[29]

Analogs

[edit]

Many results analogous to the Erdős–Ko–Rado theorem, but for other classes of objects than finite sets, are known. These generally involve a statement that the largest families of intersecting objects, for some definition of intersection, are obtained by choosing an element and constructing the family of all objects that include that chosen element. Examples include the following:

There is a q-analog of the Erdős–Ko–Rado theorem for intersecting families of linear subspaces over finite fields. If is an intersecting family of -dimensional subspaces of an -dimensional vector space over a finite field of order , and , then where the subscript q marks the notation for the Gaussian binomial coefficient, the number of subspaces of a given dimension within a vector space of a larger dimension over a finite field of order q. In this case, a largest intersecting family of subspaces may be obtained by choosing any nonzero vector and constructing the family of subspaces of the given dimension that all contain the chosen vector.[30]

Two permutations on the same set of elements are defined to be intersecting if there is some element that has the same image under both permutations. On an -element set, there is an obvious family of intersecting permutations, the permutations that fix one of the elements (the stabilizer subgroup of this element). The analogous theorem is that no intersecting family of permutations can be larger, and that the only intersecting families of size are the cosets of one-element stabilizers. These can be described more directly as the families of permutations that map some fixed element to another fixed element. More generally, for any and sufficiently large , a family of permutations each pair of which has elements in common has maximum size , and the only families of this size are cosets of pointwise stabilizers.[31] Alternatively, in graph theoretic terms, the -element permutations correspond to the perfect matchings of a complete bipartite graph and the theorem states that, among families of perfect matchings each pair of which share edges, the largest families are formed by the matchings that all contain chosen edges.[32] Another analog of the theorem, for partitions of a set, includes as a special case the perfect matchings of a complete graph (with even). There are matchings, where denotes the double factorial. The largest family of matchings that pairwise intersect (meaning that they have an edge in common) has size and is obtained by fixing one edge and choosing all ways of matching the remaining vertices.[33]

A partial geometry is a system of finitely many abstract points and lines, satisfying certain axioms including the requirement that all lines contain the same number of points and all points belong to the same number of lines. In a partial geometry, a largest system of pairwise-intersecting lines can be obtained from the set of lines through any single point.[34]

A signed set consists of a set together with sign function that maps each element to . Two signed sets may be said to intersect when they have a common element that has the same sign in each of them. Then an intersecting family of -element signed sets, drawn from an -element universe, consists of at most signed sets. This number of signed sets may be obtained by fixing one element and its sign and letting the remaining elements and signs vary.[35]

For strings of length over an alphabet of size , two strings can be defined to intersect if they have a position where both share the same symbol. The largest intersecting families are obtained by choosing one position and a fixed symbol for that position, and letting the rest of the positions vary arbitrarily. These families consist of strings, and are the only pairwise intersecting families of this size. More generally, the largest families of strings in which every two have positions with equal symbols are obtained by choosing positions and symbols for those positions, for a number that depends on , , and , and constructing the family of strings that each have at least of the chosen symbols. These results can be interpreted graph-theoretically in terms of the Hamming scheme.[36]

Unsolved problem in mathematics:
Is the largest family of intersecting triangulations of a convex polygon obtained by cutting off one vertex and choosing all triangulations of the remaining polygon?

An unproven conjecture, posed by Gil Kalai and Karen Meagher, concerns another analog for the family of triangulations of a convex polygon with vertices. The number of all triangulations is a Catalan number , and the conjecture states that a family of triangulations every pair of which shares an edge has maximum size . An intersecting family of size exactly may be obtained by cutting off a single vertex of the polygon by a triangle, and choosing all ways of triangulating the remaining -vertex polygon.[37]

Applications

[edit]

The Erdős–Ko–Rado theorem can be used to prove the following result in probability theory. Let be independent 0–1 random variables with probability of being one, and let be any fixed convex combination of these variables. Then The proof involves observing that subsets of variables whose indicator vectors have large convex combinations must be non-disjoint and using the Erdős–Ko–Rado theorem to bound the number of these subsets.[38]

The stability properties of the Erdős–Ko–Rado theorem play a key role in an efficient algorithm for finding monochromatic edges in improper colorings of Kneser graphs.[39] The Erdős–Ko–Rado theorem has also been used to characterize the symmetries of the space of phylogenetic trees.[40]

See also

[edit]
  • Helly's theorem, on conditions ensuring that intersecting families of convex sets have a common intersection
  • Sperner's theorem, an upper bound on families of pairwise non-nested sets
  • Steiner system, maximum-sized uniform set families in which no pair (rather than every pair) has a large intersection
  • Sunflower (mathematics), a family of sets where (unlike the maximum intersecting families here) all pairs have equal intersections
  • Thrackle, an unsolved problem on the size of families of intersecting curves

References

[edit]

Notes

[edit]
  1. ^ a b Das & Tran (2016).
  2. ^ Aigner & Ziegler (2018); Godsil & Meagher (2015), p. xiii.
  3. ^ Füredi (1995).
  4. ^ Harvey & Wood (2014); Godsil & Meagher (2015), p. xiv.
  5. ^ Godsil & Meagher (2015), p. 43.
  6. ^ a b Anderson (2013).
  7. ^ a b Erdős, Ko & Rado (1961); Erdős (1987).
  8. ^ van Lint & Wilson (1992).
  9. ^ Anderson (1987).
  10. ^ a b Aigner & Ziegler (2018).
  11. ^ a b Dinur & Friedgut (2009).
  12. ^ Borg (2012).
  13. ^ a b Friedgut (2008).
  14. ^ Friedgut (2008); Dinur & Friedgut (2009); Das & Tran (2016).
  15. ^ Bollobás, Narayanan & Raigorodskii (2016); Balogh, Krueger & Luo (2022).
  16. ^ Polcyn & Ruciński (2017).
  17. ^ a b Füredi (1980).
  18. ^ Dow et al. (1985).
  19. ^ Hilton & Milner (1967); Frankl & Füredi (1986); Godsil & Meagher (2015), Section 1.6: The Hilton–Milner theorem, pp. 15–17.
  20. ^ Erdős, Ko & Rado (1961); Godsil & Meagher (2015), Section 1.1: The original proof, pp. 2–6.
  21. ^ a b c Katona (1972); Anderson (1987); van Lint & Wilson (1992); Aigner & Ziegler (2018).
  22. ^ Godsil & Meagher (2015), Section 1.5: The Kruskal–Katona theorem, pp. 11–15.
  23. ^ Frankl & Graham (1989); Godsil & Meagher (2015), p. 22.
  24. ^ Erdős, Ko & Rado (1961); Godsil & Meagher (2015), Theorem 0.0.2, p. xiv.
  25. ^ Wilson (1984); Godsil & Meagher (2015), p. 2.
  26. ^ Ahlswede & Khachatrian (1997)
  27. ^ Godsil & Meagher (2015), p. xiv.
  28. ^ Schrijver (1981); Deza & Frankl (1983).
  29. ^ Frankl (1976); Anderson (1987).
  30. ^ Frankl & Wilson (1986); Godsil & Meagher (2015), Chapter 9: The Grassmann scheme, pp. 161–183.
  31. ^ Frankl & Deza (1977); Cameron & Ku (2003); Larose & Malvenuto (2004); Godsil & Meagher (2009); Ellis, Friedgut & Pilpel (2011); Godsil & Meagher (2015), Chapter 14: Permutations, pp. 260–278.
  32. ^ Godsil & Meagher (2015), Section 7.5: Perfect matchings in complete bipartite graphs.
  33. ^ Godsil & Meagher (2015), Section 15.2: Perfect matchings, pp. 282–284.
  34. ^ Godsil & Meagher (2015), Section 5.6: Partial geometries, pp. 100–103.
  35. ^ Bollobás & Leader (1997).
  36. ^ Ahlswede & Khachatrian (1998); Godsil & Meagher (2015), Chapter 10: The Hamming scheme, pp. 184–209.
  37. ^ Olarte et al. (2020).
  38. ^ Liggett (1977); Anderson (1987).
  39. ^ Haviv (2022).
  40. ^ Grindstaff (2020).

Works cited

[edit]
[edit]