Below is the hands-on exercises on the advanced methods. This will cover the heuristic search and how to perform Bayesian model averaging.

Note: in the following text: Bayesian network, structure and DAG are synonyms.

MCMC over the structures

Usually, the output of a Bayesian network analysis of a dataset ends-up with a single well adjusted DAG. From the researcher point of view this could be frustrating. Indeed, the model is the one that is best supported by the data but the uncertainty quantification is missing. Classically in epidemiology, researchers are used to express point estimate with an uncertainty measure. An arc in a Bayesian network is a point estimate, we will see how to perform model averaging. The link strength measure is designed to account for that. An more natural alternative is to perform MCMC over structures (Friedman & Koller, 2003).

We use the mcmcabn() function on the cache of pre-computed networks scores. One needs to define the type of score used (here is the marginal likelihood mlik). The maximum of number of parents per node (same as the one used in buildscorecache()). The MCMC learning scheme, defined as: number of MCMC samples, number of thinned sampled (to avoid autocorrelation) and the length of the burn-in phase. Possibly a starting DAG and a structural prior. We also need to select the relative probability of performing radical moves (shuffling). Indeed, a naive MCMC approach is known to get very easily stuck in local maximum (for more details see: (Grzegorczyk & Husmeier, 2008; Su & Borsuk, 2016)).

mcmc.out <- mcmcabn(score.cache = mycache,
                  score = "mlik",
                  data.dists = dist,
                  max.parents = 4,
                  mcmc.scheme = c(1000,9,500),
                  seed = 321,
                  verbose = FALSE,
                  start.dag = "random",
                  prob.rev = 0.07,
                  prob.mbr = 0.07,
                  prior.choice = 1)

This is again computationally complex!

Her is a plot of the MCMC samples. One can see the scores of the structures on the y-axis in function of the index steps. The dots are the radical moves. One the right side a histogram shows the number of structures with a given score. As one can see the histogram is very peaked on the maximum possible score.

One can also display the cumulative maximum score (used for network score optimization).

But the major advantage of this method is the possibility of querying the MCMC sample using a formula statement.

# average individual arc support
query(mcmcabn = mcmc.out)
                  AR     pneumS     female     livdam       eggs  wormCount
AR        0.00000000 0.01069893 0.01139886 0.01159884 0.07599240 0.00349965
pneumS    0.00589941 0.00000000 0.01539846 0.00829917 0.10398960 0.00429957
female    0.00579942 0.01879812 0.00000000 0.01769823 0.00659934 0.00249975
livdam    0.00689931 0.00999900 0.01119888 0.00000000 0.44275572 0.00439956
eggs      0.14908509 0.09239076 0.01019898 0.29377062 0.00000000 0.02959704
wormCount 0.76062394 0.08119188 0.03349665 0.07039296 0.91700830 0.00000000
age       0.13568643 0.07349265 0.15768423 0.11338866 0.09739026 0.01399860
adg       0.03019698 0.01589841 0.08359164 0.00959904 0.17288271 0.00349965
                 age        adg
AR        0.77212279 0.08649135
pneumS    0.19238076 0.03989601
female    0.29857014 0.12168783
livdam    0.06649335 0.01169883
eggs      0.25057494 0.36466353
wormCount 0.88481152 0.31056894
age       0.00000000 0.47095290
adg       0.52864714 0.00000000
# probability that worm count being linked to age but not to female directly
query(mcmcabn = mcmc.out,formula = ~wormCount|age-wormCount|female)
[1] 0.01889811
# probability that worm count being directly linked to age and adg and that adg is link to age (undirected)
query(mcmcabn = mcmc.out,formula = ~wormCount|age + wormCount|adg + age|adg)+
  query(mcmcabn = mcmc.out,formula = ~wormCount|age + wormCount|adg + adg|age)
[1] 0.2451755

References

Friedman, N., & Koller, D. (2003). Being bayesian about network structure. A bayesian approach to structure discovery in bayesian networks. Machine Learning, 50(1-2), 95–125.
Grzegorczyk, M., & Husmeier, D. (2008). Improving the structure MCMC sampler for bayesian networks by introducing a new edge reversal move. Machine Learning, 71(2-3), 265.
Koivisto, M., & Sood, K. (2004). Exact bayesian structure discovery in bayesian networks. Journal of Machine Learning Research, 5(May), 549–573.
Korb, K. B., & Nicholson, A. E. (2010). Bayesian artificial intelligence. CRC press.
Su, C., & Borsuk, M. E. (2016). Improving structure mcmc for bayesian networks through markov blanket resampling. The Journal of Machine Learning Research, 17(1), 4042–4061.
