Association scheme

The theory of association schemes arose in statistics, in the theory of experimental design for the analysis of variance.[1][2][3] In mathematics, association schemes belong to both algebra and combinatorics. In algebraic combinatorics, association schemes provide a unified approach to many topics, for example combinatorial designs and the theory of error-correcting codes.[4][5] In algebra, association schemes generalize groups, and the theory of association schemes generalizes the character theory of linear representations of groups.[6][7][8]

Definition

An n-class association scheme consists of a set X together with a partition S of X × X into n + 1 binary relations, R0, R1, ..., Rn which satisfy:

  • R 0 = { ( x , x ) : x X } {\displaystyle R_{0}=\{(x,x):x\in X\}} ; it is called the identity relation.
  • Defining R := { ( x , y ) : ( y , x ) R } {\displaystyle R^{*}:=\{(x,y):(y,x)\in R\}} , if R in S, then R* in S.
  • If ( x , y ) R k {\displaystyle (x,y)\in R_{k}} , the number of z X {\displaystyle z\in X} such that ( x , z ) R i {\displaystyle (x,z)\in R_{i}} and ( z , y ) R j {\displaystyle (z,y)\in R_{j}} is a constant p i j k {\displaystyle p_{ij}^{k}} depending on i {\displaystyle i} , j {\displaystyle j} , k {\displaystyle k} but not on the particular choice of x {\displaystyle x} and y {\displaystyle y} .

An association scheme is commutative if p i j k = p j i k {\displaystyle p_{ij}^{k}=p_{ji}^{k}} for all i {\displaystyle i} , j {\displaystyle j} and k {\displaystyle k} . Most authors assume this property. Note, however, that while the notion of an association scheme generalizes the notion of a group, the notion of a commutative association scheme only generalizes the notion of a commutative group.

A symmetric association scheme is one in which each R i {\displaystyle R_{i}} is a symmetric relation. That is:

  • if (x, y) ∈ Ri, then (y, x) ∈ Ri. (Or equivalently, R* = R.)

Every symmetric association scheme is commutative.

Two points x and y are called i th associates if ( x , y ) R i {\displaystyle (x,y)\in R_{i}} . The definition states that if x and y are i th associates then so are y and x. Every pair of points are i th associates for exactly one i {\displaystyle i} . Each point is its own zeroth associate while distinct points are never zeroth associates. If x and y are k th associates then the number of points z {\displaystyle z} which are both i th associates of x {\displaystyle x} and j th associates of y {\displaystyle y} is a constant p i j k {\displaystyle p_{ij}^{k}} .

Graph interpretation and adjacency matrices

A symmetric association scheme can be visualized as a complete graph with labeled edges. The graph has v {\displaystyle v} vertices, one for each point of X {\displaystyle X} , and the edge joining vertices x {\displaystyle x} and y {\displaystyle y} is labeled i {\displaystyle i} if x {\displaystyle x} and y {\displaystyle y} are i {\displaystyle i}  th associates. Each edge has a unique label, and the number of triangles with a fixed base labeled k {\displaystyle k} having the other edges labeled i {\displaystyle i} and j {\displaystyle j} is a constant p i j k {\displaystyle p_{ij}^{k}} , depending on i , j , k {\displaystyle i,j,k} but not on the choice of the base. In particular, each vertex is incident with exactly p i i 0 = v i {\displaystyle p_{ii}^{0}=v_{i}} edges labeled i {\displaystyle i} ; v i {\displaystyle v_{i}} is the valency of the relation R i {\displaystyle R_{i}} . There are also loops labeled 0 {\displaystyle 0} at each vertex x {\displaystyle x} , corresponding to R 0 {\displaystyle R_{0}} .

The relations are described by their adjacency matrices. A i {\displaystyle A_{i}} is the adjacency matrix of R i {\displaystyle R_{i}} for i = 0 , , n {\displaystyle i=0,\ldots ,n} and is a v × v matrix with rows and columns labeled by the points of X {\displaystyle X} .

( A i ) x , y = { 1 , if  ( x , y ) R i , 0 , otherwise. ( 1 ) {\displaystyle \left(A_{i}\right)_{x,y}={\begin{cases}1,&{\mbox{if }}(x,y)\in R_{i},\\0,&{\mbox{otherwise.}}\end{cases}}\qquad (1)}

The definition of a symmetric association scheme is equivalent to saying that the A i {\displaystyle A_{i}} are v × v (0,1)-matrices which satisfy

I. A i {\displaystyle A_{i}} is symmetric,
II. i = 0 n A i = J {\displaystyle \sum _{i=0}^{n}A_{i}=J} (the all-ones matrix),
III. A 0 = I {\displaystyle A_{0}=I} ,
IV. A i A j = k = 0 n p i j k A k = A j A i , i , j = 0 , , n {\displaystyle A_{i}A_{j}=\sum _{k=0}^{n}p_{ij}^{k}A_{k}=A_{j}A_{i},i,j=0,\ldots ,n} .

The (x, y)-th entry of the left side of (IV) is the number of paths of length two between x and y with labels i and j in the graph. Note that the rows and columns of A i {\displaystyle A_{i}} contain v i {\displaystyle v_{i}} 1 {\displaystyle 1} 's:

A i J = J A i = v i J . ( 2 ) {\displaystyle A_{i}J=JA_{i}=v_{i}J.\qquad (2)}

Terminology

  • The numbers p i j k {\displaystyle p_{ij}^{k}} are called the parameters of the scheme. They are also referred to as the structural constants.

History

The term association scheme is due to (Bose & Shimamoto 1952) but the concept is already inherent in (Bose & Nair 1939).[9] These authors were studying what statisticians have called partially balanced incomplete block designs (PBIBDs). The subject became an object of algebraic interest with the publication of (Bose & Mesner 1959) and the introduction of the Bose–Mesner algebra. The most important contribution to the theory was the thesis of P. Delsarte (Delsarte 1973) who recognized and fully used the connections with coding theory and design theory.[10] Generalizations have been studied by D. G. Higman (coherent configurations) and B. Weisfeiler (distance regular graphs).

Basic facts

  • p 00 0 = 1 {\displaystyle p_{00}^{0}=1} , i.e., if ( x , y ) R 0 {\displaystyle (x,y)\in R_{0}} then x = y {\displaystyle x=y} and the only z {\displaystyle z} such that ( x , z ) R 0 {\displaystyle (x,z)\in R_{0}} is z = x {\displaystyle z=x} .
  • i = 0 k p i i 0 = | X | {\displaystyle \sum _{i=0}^{k}p_{ii}^{0}=|X|} ; this is because the R i {\displaystyle R_{i}} partition X {\displaystyle X} .

The Bose–Mesner algebra

The adjacency matrices A i {\displaystyle A_{i}} of the graphs ( X , R i ) {\displaystyle \left(X,R_{i}\right)} generate a commutative and associative algebra A {\displaystyle {\mathcal {A}}} (over the real or complex numbers) both for the matrix product and the pointwise product. This associative, commutative algebra is called the Bose–Mesner algebra of the association scheme.

Since the matrices in A {\displaystyle {\mathcal {A}}} are symmetric and commute with each other, they can be diagonalized simultaneously. Therefore, A {\displaystyle {\mathcal {A}}} is semi-simple and has a unique basis of primitive idempotents J 0 , , J n {\displaystyle J_{0},\ldots ,J_{n}} .

There is another algebra of ( n + 1 ) × ( n + 1 ) {\displaystyle (n+1)\times (n+1)} matrices which is isomorphic to A {\displaystyle {\mathcal {A}}} , and is often easier to work with.

Examples

  • The Johnson scheme, denoted by J(v, k), is defined as follows. Let S be a set with v elements. The points of the scheme J(v, k) are the ( v k ) {\displaystyle {v \choose k}} subsets of S with k elements. Two k-element subsets A, B of S are i th associates when their intersection has size k − i.
  • The Hamming scheme, denoted by H(n, q), is defined as follows. The points of H(n, q) are the qn ordered n-tuples over a set of size q. Two n-tuples x, y are said to be i th associates if they disagree in exactly i coordinates. E.g., if x = (1,0,1,1), y = (1,1,1,1), z = (0,0,1,1), then x and y are 1st associates, x and z are 1st associates and y and z are 2nd associates in H(4,2).
  • A distance-regular graph, G, forms an association scheme by defining two vertices to be i th associates if their distance is i.
  • A finite group G yields an association scheme on X = G {\displaystyle X=G} , with a class Rg for each group element, as follows: for each g G {\displaystyle g\in G} let R g = { ( x , y ) x = g y } {\displaystyle R_{g}=\{(x,y)\mid x=g*y\}} where {\displaystyle *} is the group operation. The class of the group identity is R0. This association scheme is commutative if and only if G is abelian.
  • A specific 3-class association scheme:[11]
Let A(3) be the following association scheme with three associate classes on the set X = {1,2,3,4,5,6}. The (i, j ) entry is s if elements i and j are in relation Rs.
  1 2 3 4 5 6
1  0   1   1   2   3   3 
2  1   0   1   3   2   3 
3  1   1   0   3   3   2 
4  2   3   3   0   1   1 
5  3   2   3   1   0   1 
6  3   3   2   1   1   0 

Coding theory

The Hamming scheme and the Johnson scheme are of major significance in classical coding theory.

In coding theory, association scheme theory is mainly concerned with the distance of a code. The linear programming method produces upper bounds for the size of a code with given minimum distance, and lower bounds for the size of a design with a given strength. The most specific results are obtained in the case where the underlying association scheme satisfies certain polynomial properties; this leads one into the realm of orthogonal polynomials. In particular, some universal bounds are derived for codes and designs in polynomial-type association schemes.

In classical coding theory, dealing with codes in a Hamming scheme, the MacWilliams transform involves a family of orthogonal polynomials known as the Krawtchouk polynomials. These polynomials give the eigenvalues of the distance relation matrices of the Hamming scheme.

See also

Notes

References

  • Bailey, Rosemary A. (2004), Association Schemes: Designed Experiments, Algebra and Combinatorics, Cambridge University Press, ISBN 978-0-521-82446-0, MR 2047311. (Chapters from preliminary draft are available on-line.)
  • Bannai, Eiichi; Ito, Tatsuro (1984), Algebraic combinatorics I: Association schemes, Menlo Park, CA: Benjamin/Cummings, ISBN 0-8053-0490-8, MR 0882540
  • Bose, R. C.; Mesner, D. M. (1959), "On linear associative algebras corresponding to association schemes of partially balanced designs", Annals of Mathematical Statistics, 30 (1): 21–38, doi:10.1214/aoms/1177706356, JSTOR 2237117, MR 0102157
  • Bose, R. C.; Nair, K. R. (1939), "Partially balanced incomplete block designs", Sankhyā, 4 (3): 337–372, JSTOR 40383923
  • Bose, R. C.; Shimamoto, T. (1952), "Classification and analysis of partially balanced incomplete block designs with two associate classes", Journal of the American Statistical Association, 47 (258): 151–184, doi:10.1080/01621459.1952.10501161
  • Camion, P. (1998), "18. Codes and Association Schemes: Basic Properties of Association Schemes Relevant to Coding", in Pless, V.S.; Huffman, W.C.; Brualdi, R.A. (eds.), Handbook of Coding Theory, vol. 1, Elsevier, pp. 1441–, ISBN 978-0-444-50088-5
  • Delsarte, P. (1973), "An Algebraic Approach to the Association Schemes of Coding Theory", Philips Research Reports (Supplement No. 10), OCLC 641852316
  • Delsarte, P.; Levenshtein, V. I. (1998). "Association schemes and coding theory". IEEE Transactions on Information Theory. 44 (6): 2477–2504. doi:10.1109/18.720545.
  • Dembowski, P. (1968), Finite Geometries, Springer, ISBN 978-3-540-61786-0
  • Godsil, C. D. (1993), Algebraic Combinatorics, New York: Chapman and Hall, ISBN 0-412-04131-6, MR 1220704
  • MacWilliams, F.J.; Sloane, N.J.A. (1977), The Theory of Error Correcting Codes, North-Holland Mathematical Library, vol. 16, Elsevier, ISBN 978-0-444-85010-2
  • Street, Anne Penfold; Street, Deborah J. (1987), Combinatorics of Experimental Design, Oxford U. P. [Clarendon], ISBN 0-19-853256-3
  • van Lint, J.H.; Wilson, R.M. (1992), A Course in Combinatorics, Cambridge University Press, ISBN 0-521-00601-5
  • Zieschang, Paul-Hermann (2005a), "Association Schemes: Designed Experiments, Algebra and Combinatorics by Rosemary A. Bailey, Review" (PDF), Bulletin of the American Mathematical Society, 43 (2): 249–253, doi:10.1090/S0273-0979-05-01077-3
  • Zieschang, Paul-Hermann (2005b), Theory of association schemes, Springer, ISBN 3-540-26136-2
  • Zieschang, Paul-Hermann (2006), "The exchange condition for association schemes", Israel Journal of Mathematics, 151 (3): 357–380, doi:10.1007/BF02777367, MR 2214129, S2CID 120009352
  • v
  • t
  • e
Scientific
methodTreatment
and blockingModels
and inferenceDesigns

Completely
randomized
  • v
  • t
  • e
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
  • Z-test (normal)
  • Student's t-test
  • F-test
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
  • Category
  • icon Mathematics portal
  • Commons
  • WikiProject
Authority control databases: National Edit this at Wikidata
  • Israel
  • United States