pubmed.ncbi.nlm.nih.gov

Mapping the ecological networks of microbial communities - PubMed

  • ️Sun Jan 01 2017

Mapping the ecological networks of microbial communities

Yandong Xiao et al. Nat Commun. 2017.

Abstract

Mapping the ecological networks of microbial communities is a necessary step toward understanding their assembly rules and predicting their temporal behavior. However, existing methods require assuming a particular population dynamics model, which is not known a priori. Moreover, those methods require fitting longitudinal abundance data, which are often not informative enough for reliable inference. To overcome these limitations, here we develop a new method based on steady-state abundance data. Our method can infer the network topology and inter-taxa interaction types without assuming any particular population dynamics model. Additionally, when the population dynamics is assumed to follow the classic Generalized Lotka-Volterra model, our method can infer the inter-taxa interaction strengths and intrinsic growth rates. We systematically validate our method using simulated data, and then apply it to four experimental data sets. Our method represents a key step towards reliable modeling of complex, real-world microbial communities, such as the human gut microbiota.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing financial interests.

Figures

Fig. 1
Fig. 1

Inferring ecological interaction types for a small microbial community. The interaction types are coded as the sign-pattern of the Jacobian matrix. a For a microbial community of two taxa, its ecological network and the sign-pattern of the corresponding Jacobian matrix are shown here. b There are three possible steady-state samples (shown as colored pie charts), and two of them x{1,2},x1 share taxon 1. We calculate the green line that passes through the origin and is perpendicular to the vector (x1,2-x1) (shown as a blue line segment). This green line crosses the origin, and two other orthants (shown in light cyan and green), offering a set of possible sign-patterns: 0,0,(-,-) and (+,-), for which s1=(signJ11,sign(J12)) may belong to. Provided that J11<0, we conclude that s1=(-,+). c For a microbial community of three taxa, its ecological network and the sign-pattern of the corresponding Jacobian matrix are shown here. d There are seven possible steady-state samples, and we plot four of them that share taxon 1. Consider a line segment (x{1,3}-x{1}) (solid blue). We calculate the orange plane that passes through the origin and is perpendicular to this solid blue line. This orange plane crosses nine regions: the origin and the other eight regions (denoted in different color cubes, color lines), offering nine possible sign-patterns for s 1. We can consider another line segment that connects two steady-state samples sharing taxon 1, say, x{1,3} and x{1,2,3}, and repeat the above procedure. We do this for all the sample pairs (dashed blue lines), record the regions crossed by the corresponding orthogonal planes. Finally, the intersection of the regions crossed by all those orthogonal hyperplanes yields a minimum set of sign-patterns S^1={-,0,+,0,0,0,+,0,-} that s 1 may belong to. If we know that J11<0, then we can uniquely determine s1=(-,0,+)

Fig. 2
Fig. 2

Consistency check of the GLV model and the observed steady-state samples. For a microbial community following exactly the GLV dynamics, all its steady-state samples sharing one common taxon will align onto a hyperplane in the state space. a Here we consider a microbial community of three taxa. There are four steady-state samples {x{1},x{1,2},x{1,3},x{1,2,3}} that share common taxon 1. Those four steady-state samples represent four points in the state space, and they align onto a plane (light red). The normal vector of this plane is parallel to the first row a 1 of the interaction matrix A in the GLV model. Given any one of non-zero entries in a 1, we can determine the exact values of all other entries. Otherwise, we can always express the inter-taxa interaction strengths a ij (j≠i) as a function of the intra-taxa interaction strength a ii. b Here we again consider a microbial community of three taxa. Taxon-1 and taxon-2 follow the GLV dynamics, but taxon-3 does not. Then those steady-state samples that share taxon-3 do not align onto a plane anymore. Here we show the best fitted plane (in green) by minimizing the distance between this plane and the four steady states, with the coefficient of determination R2=0.77

Fig. 3
Fig. 3

Validation of our method in inferring interaction types using simulated data. a,b Consider a small microbial community of N=8 taxa. We generate steady-state samples using four different population dynamics models: Generalized Lotka–Volterra (GLV), Holling Type II (Holling II), DeAngelis–Beddington (DB), and Crowley–Martin (CM). We compare the performance of the brute-force algorithm (with solution space ~38=6,561) and the heuristic algorithm (with solution space ~Ψ=5N=40). a In the noiseless case, we plot the inference accuracy (defined as the percentage of correctly inferred signs in the Jacobian matrix) as a function of sample size Ω. b In the presence of noise, we plot the inference accuracy as a function of the noise level η. Here the sample size is fixed: Ω=5N=40. The error bar represents standard deviation for 10 different realizations. c, d We calculate the minimal sample size Ω* required for the heuristic algorithm to achieve high accuracy (100% for GLV, 95% for Holling II, DB, and CM) at different system sizes. We consider two different taxa presence patterns: uniform and heterogenous (see insets). Here the simulated data is generated in the noiseless case and we chose Ψ=10N. ad The underlying ecological network is generated from a directed random graph model with connectivity 0.4 (i.e., with probability 0.4 there will be a directed edge between any two taxa)

Fig. 4
Fig. 4

Validation of our method in inferring interaction strengths using simulated data. Here, we simulate steady-state samples using the GLV model with N taxa and intra-taxa interaction strength aii=-1 for each taxon. The underlying ecological network is generated from a directed random graph model with connectivity 0.4. The noise is added to steady-state samples as follows: xiI→xiI+ηu, where the random number u follows a uniform distribution U[-xiI,xiI], and η is the noise level. a, b Consider the ideal case: (1) noiseless η=0; and (2) we know exactly aii=-1. a We can perfectly infer a ij’s and r i’s. b The minimal sample size Ω* required to correctly infer the interaction strengths scales linearly with the system size N. Here we consider two different taxa presence pattern: uniform and heterogenous. cf In the presence of noise, and during the we just assume that the intra-taxa interaction strengths âii follows a half-normal distribution. c Using the Lasso regularization induces high false-discovery rate (FDR) ~ 0.448 at η=0.1. d Using classical Lasso with cross validation, both NRMSE and FDR increase with increasing η. e For the same data set used in c, we use the knockoff filter to control the FDR below a certain level q=0.2. f With increasing noise level η, FDR can still be successfully controlled below q=0.2 by applying the knockoff filter. In subfigures a, cf, we have N=50. The error bar represents standard deviation for 10 different realizations

Fig. 5
Fig. 5

Inferring interaction types of a synthetic soil microbial community. The steady-state samples were experimentally collected from a synthetic soil microbial community of eight bacterial species. Those steady-state samples involve 101 different species combinations: all eight solos, 28 duos, 56 trios, all eight septets, and 1 octet. a From the eight solos (monoculture experiments) and 28 duos (pair-wise co-culture experiments), one can calculate the relative yield R ij, quantifying the promotion (positive) or inhibition (negative) impact of species j on species i. The values shown in the relative yield matrix R=(Rij) quantify the strengths of promotion and inhibition effects. The sign-pattern of this matrix serves as the ground truth of that of the Jacobian matrix associated with the unknown population dynamics of this microbial community. b Without considering the 8 solos and 28 duos, we analyze the other steady-state samples. We use the brute-force method to infer the ecological interaction types, i.e., the sign-pattern of the Jacobian matrix. Blue (or red) means inhibition (or promotion) effect of species j on species i, respectively. 10 signs (labeled by ‘×’) are falsely inferred, four signs (gray) are undetermined by the analyzed steady-state samples. c, d The robustness of the inference results in the presence of artificially added noise: xiI→xiI+ηu, where the random number u follows a uniform distribution U[-xiI,xiI], and η is the noise level. At each noise level, we run 50 different realizations. Yellow (or blue) means many (or few) inferred J ij keep the same sign among 50 different realizations. c. Many of the inferred J ij keep their signs in the presence of noise up to noise level η=0.3. d At η=0.04, we plot the percentage of unchanged signs for inferred Jacobian matrix in 50 different realizations. The ‘×’ labels correspond to the 10 falsely inferred signs shown in b. Five of the 10 falsely inferred interactions change their signs frequently even when the perturbation is very small, implying that the falsely inferred signs in b could be due to measurement noise in the experiments

Fig. 6
Fig. 6

Inferring interaction types in a synthetic community of maize roots with seven bacterial species. The data set consists of seven sextets and one septet. a Without considering the one septet, we analyze the seven sextets (steady-state samples involving six of the seven species). We use the brute-force method to infer the ecological interaction types, i.e., the sign-pattern of the Jacobian matrix. Blue (or red) means inhibition (or promotion) effect of species j on species i, respectively. One sign (gray) is undetermined from the seven sextets. b The changes of species abundance before and after, respectively, adding one species into a sextet. Blue (or red) corresponds to the decrease (or increase) of one species’ abundance after introducing a new species into sextets, respectively. Each column corresponds to a sextet (a six-baterial community), the name of the newly introduced species is marked in the top of each column. ‘×’ indicates the false prediction. There are in total 12 false predictions

Similar articles

Cited by

References

    1. Clemente J, Ursell L, Parfrey L, Knight R. The impact of the gut microbiota on human health: an integrative view. Cell. 2012;148:1258–1270. doi: 10.1016/j.cell.2012.01.035. - DOI - PMC - PubMed
    1. Flint H, Scott K, Louis P, Duncan S. The role of the gut microbiota in nutrition and health. Nat. Rev. Gastroenterol. Hepatol. 2012;9:577–589. doi: 10.1038/nrgastro.2012.156. - DOI - PubMed
    1. Schimel, J. & Schaeffer, S. Microbial control over carbon cycling in soil. Front. Microbiol.3, 348 (2012). - PMC - PubMed
    1. Nannipieri P, et al. Microbial diversity and soil functions. Eur. J. Soil Sci. 2003;54:655–670. doi: 10.1046/j.1351-0754.2003.0556.x. - DOI
    1. Berendsen R, Pieterse C, Bakker P. The rhizosphere microbiome and plant health. Trends Plant. Sci. 2012;17:478–486. doi: 10.1016/j.tplants.2012.04.001. - DOI - PubMed

Publication types

MeSH terms

LinkOut - more resources