buildNetwork.Rd
Wrapper for microbial network inference methods.
SPIEC-EASI, CoNet (see function barebonesCoNet
) and bnlearn are supported.
Abundances are assumed to have been already filtered and transformed.
The function returns the network as an igraph object,
with nodes labeled and colored at the selected taxonomic levels
and, for SPIEC-EASI and CoNet, edges in red or green, depending on whether they
represent negative or positive associations.
buildNetwork( abundances, lineages = NULL, min.occ = 0, norm = FALSE, clr = FALSE, method = "mb", repNum = 20, minStrength = 0.5, alpha = 0.05, renorm = FALSE, initEdgeNum = 100, methodNum = 2, directed = FALSE, nameLevel = "Genus", colorLevel = "Class", widthFactor = 10, legend = TRUE, left.margin = 10 )
abundances | a matrix with taxa as rows and samples as columns |
---|---|
lineages | a matrix with lineages where row names match row names of abundances; first column gives kingdom, following columns give phylum, class, order, family, genus and species (obligatory for SPIEC-EASI) |
min.occ | only keep rows with at least the given number of non-zero values, keep the sum of the other rows (carried out before network construction) |
norm | normalize matrix (carrried out after filtering); not carried out for SPIEC-EASI (where it is already applied) |
clr | apply CLR transform (after filtering and normalization; |
method | mb or glasso (SPIEC-EASI), any combination of pearson, spearman, bray and/or kld (CoNet), hc, tabu, h2pc, mmhc, rsmax2, aracne, pc.stable, gs, fast.iamb, iamb.fdr, mmpc, hpc or si.hiton.pc (bnlearn, for aracne mi is set to mi-g) |
repNum | the number of permutation/bootstrap iterations for CoNet with dissimilarities and of bootstrap iterations in SPIEC-EASI and bnlearn; can be omitted (set to 0) for time-intensive bnlearn methods or for CoNet without p-values |
minStrength | bnlearn only: the minimum probability for an arc to appear across bootstraps (only applicable if repNum is set larger 1) |
alpha | p-value threshold for bnlearn and CoNet (ignored in methods aracne, hc, tabu, rsmax2, mmhc and h2pc) |
renorm | compute correlation p-values in CoNet with renorm enabled (slow) |
initEdgeNum | CoNet parameter for the number of initial top and bottom edges |
methodNum | CoNet threshold on the minimum method number needed to keep an edge (only used when more than one method is provided) |
directed | bnlearn only: infer a directed network (hc, tabu, rsmax2, mmhc and h2pc only infer directed networks, aracne only undirected networks) |
nameLevel | the taxonomic level used for node names (only if lineages provided) |
colorLevel | the taxonomic level used for node colors (only if lineages provided) |
widthFactor | edge width scaling factor |
legend | if lineages are provided: show node color code |
left.margin | if lineages are provided and legend is true: set the left margin (the margin is restored to its previous value afterwards) |
igraph object
For SPIEC-EASI, edge width represents the abolute of the regression coefficients, whereas for bnlearn,
it represents the probability of an arc to re-appear across bootstrap iterations. For CoNet, it either
represents absolute association strength, method number supporting an edge or significance,
depending on whether more than one method was used to compute the network and whether or not
p-values were computed (significance is defined as -log10(pval)). For single dissimilarities in CoNet, edge weight
is scaled (see barebonesCoNet
for details) such that for all network construction methods,
a thicker edge means a stronger/more stable/more significant association.
For CoNet, p-values are computed by default with cor.test for correlations and permutations and bootstraps
for dissimilarities and are multiple-testing corrected with Benjamini-Hochberg.
P-value computation in CoNet can be disabled by setting repNum to 0.
If lineages are provided and legend is true, the color code for the node colors is plotted in a separate bar plot.
SPIEC-EASI and CoNet return undirected graphs, whereas bnlearn methods support the inference
of directed graphs. SPIEC-EASI methods tend to be faster than bnlearn methods.
The returned network can be visualized with igraph's interactive network visualisation
function tkplot or simply with igraph's plot function. When RCy3 is installed and Cytoscape is open,
the graph can also be sent to Cytoscape directly with RCy3's function createNetworkFromIgraph().
data(ibd_taxa) data(ibd_lineages) top.indices=sort(rowSums(ibd_taxa),decreasing = TRUE,index.return=TRUE)$ix[1:30] taxa=ibd_taxa[top.indices,] lineages=ibd_lineages[top.indices,] hc.out=buildNetwork(taxa,lineages,method="hc",nameLevel="species")#> #> Bayesian network learned via Score-based methods #> #> model: #> [Eubacterium_rectale][Ruminococcus_sp_5_1_39BFAA][Bifidobacterium_longum] #> [Akkermansia_muciniphila][Dialister_invisus][Bacteroides_vulgatus] #> [Prevotella_copri][Roseburia_intestinalis][Bacteroides_ovatus] #> [Streptococcus_thermophilus][Bacteroides_stercoris][Parabacteroides_merdae] #> [Enterococcus_faecium][Faecalibacterium_prausnitzii|Eubacterium_rectale] #> [Ruminococcus_gnavus|Eubacterium_rectale] #> [Ruminococcus_torques|Ruminococcus_sp_5_1_39BFAA] #> [Collinsella_aerofaciens|Bifidobacterium_longum] #> [Alistipes_onderdonkii|Roseburia_intestinalis] #> [Bacteroides_dorei|Parabacteroides_merdae] #> [Bacteroides_caccae|Parabacteroides_merdae] #> [Parabacteroides_unclassified|Bacteroides_vulgatus] #> [Escherichia_coli|Faecalibacterium_prausnitzii] #> [Subdoligranulum_unclassified|Ruminococcus_gnavus:Ruminococcus_torques] #> [Bifidobacterium_adolescentis|Collinsella_aerofaciens] #> [Eubacterium_hallii|Eubacterium_rectale:Ruminococcus_torques] #> [Ruminococcus_bromii|Subdoligranulum_unclassified] #> [Alistipes_putredinis|Subdoligranulum_unclassified:Bacteroides_stercoris] #> [Dorea_longicatena|Ruminococcus_sp_5_1_39BFAA:Bifidobacterium_longum:Eubacterium_hallii] #> [Coprococcus_comes|Eubacterium_hallii] #> [Bacteroides_uniformis|Alistipes_putredinis] #> nodes: 30 #> arcs: 22 #> undirected arcs: 0 #> directed arcs: 22 #> average markov blanket size: 1.87 #> average neighbourhood size: 1.47 #> average branching factor: 0.73 #> #> learning algorithm: Hill-Climbing #> score: BIC (Gauss.) #> penalization coefficient: 2.521713 #> tests used in the learning procedure: 1073 #> optimized: TRUE #> #> [1] "Carrying out 20 bootstraps to assess edge strengths..." #> [1] "logLik: 8099.57779267286" #> [1] "AIC: 8017.57779267286" #> [1] "BIC: 7892.79736287917"plot(hc.out)