Abstract
A statistical approach to a posteriori blockmodeling for graphs
is proposed. The model assumes that the vertices of the graph are partitioned into two unknown blocks and that the probability of an edge between two vertices depends only on the blocks to which they belong.
Statistical procedures are derived for estimating the probabilities of edges
and for predicting the block structure from observations of the edge pattern only. ML estimators can be computed using the EM algorithm, but this
strategy is practical only for small graphs. A Bayesian estimator,
based on Gibbs sampling, is proposed. This estimator is practical also
for large graphs. When ML estimators are used, the block structure can be
predicted based on predictive likelihood. When Gibbs sampling is used,
the block structure can be predicted from posterior predictive probabilities.
A side result is that when the number of vertices tends to infinity while
the probabilities remain constant, the block structure can be recovered
correctly with probability tending to 1.
is proposed. The model assumes that the vertices of the graph are partitioned into two unknown blocks and that the probability of an edge between two vertices depends only on the blocks to which they belong.
Statistical procedures are derived for estimating the probabilities of edges
and for predicting the block structure from observations of the edge pattern only. ML estimators can be computed using the EM algorithm, but this
strategy is practical only for small graphs. A Bayesian estimator,
based on Gibbs sampling, is proposed. This estimator is practical also
for large graphs. When ML estimators are used, the block structure can be
predicted based on predictive likelihood. When Gibbs sampling is used,
the block structure can be predicted from posterior predictive probabilities.
A side result is that when the number of vertices tends to infinity while
the probabilities remain constant, the block structure can be recovered
correctly with probability tending to 1.
Original language | English |
---|---|
Pages (from-to) | 75-100 |
Journal | Journal of Classification |
Volume | 14 |
DOIs | |
Publication status | Published - 1997 |
Subject classification (UKÄ)
- Probability Theory and Statistics
Free keywords
- Colored graph
- EM algorithm
- Latent class model
- Social networks
- Gibbs sampling