Variational Bayesian Neural Network to Prioritize Important Relationships of High-Dimensional Microbiome Multiomics Data
Method | Accuracy (SMAPE) | Feature Selection | Uncertainty Quantification | Computational Speed | Interpretability |
---|---|---|---|---|---|
VBayesMM
|
34.7% - 55.1%
Best across all datasets |
Probabilistic
Spike-and-Slab Prior |
Bayesian Posterior
Full uncertainty quantification |
1.5h - 120h+
Moderate (scales with complexity) |
Selection Probabilities
Highly interpretable |
MiMeNet
|
38.8% - 60.1%
Second best performance |
L2 Regularization
Standard MLP approach |
Point estimates
No uncertainty |
1.2h - 99h
Fastest among neural methods |
Limited context
Hyperparameter dependent |
MMvec
|
47.6% - 76.5%
Third place performance |
Equal treatment
No feature selection |
MAP estimates
No uncertainty |
1.5h - 120h+
Similar to VBayesMM |
Black box
Limited interpretability |
sPLS
|
67.5% - 93.1%
Poorest performance |
L1/L2 regularization
Manual tuning required |
No uncertainty
Deterministic output |
Slowest
Poor scalability |
Linear assumptions
Limited to linear relationships |
Given paired microbiome-metabolome dataset D = [X, Y] with K samples, N taxonomic units, and M metabolites, VBayesMM learns conditional probabilities:
Where $$\mathbf{u}_i, \mathbf{v}_j \in \mathbb{R}^L$$ are latent embeddings for microbes and metabolites respectively.
The core innovation lies in modeling each taxonomic unit with a mixture distribution that probabilistically determines feature relevance:
Where $$\gamma_i \in \{0,1\} : \text{acts as a binary selector}$$ $$\delta_0 : \text{represents the Dirac spike at zero for irrelevant features}$$ $$\sigma^2_{\text{slab}} : \text{controls the variance of important features}$$
We employ mean-field variational inference to approximate the intractable posterior distribution:
The variational parameters $$\Theta = \{\alpha_U, \beta_U, \xi, \alpha_V, \beta_V\}$$ are optimized via gradient descent on the Evidence Lower Bound (ELBO).
The optimization objective combines data likelihood with Bayesian regularization:
The first term ensures data fidelity while the KL divergence term provides automatic complexity control through Bayesian regularization.
Unlike traditional L1/L2 regularization, our spike-and-slab approach provides principled Bayesian feature selection with interpretable selection probabilities.
Bayesian neural network architecture provides full posterior distributions over parameters, enabling robust uncertainty estimation.
Variational inference transforms intractable MCMC sampling into efficient gradient-based optimization, achieving O(n log n) complexity.
Models microbe-metabolite relationships through conditional probabilities using multinomial likelihood with neural network embeddings.
Variational Bayesian inference pipeline with spike-and-slab priors in encoder network for sparse feature selection
Choose your preferred deep learning framework with identical functionality
v2.x Compatible
v1.x Compatible
Analysis of microbiome-metabolome relationships in mouse models of sleep apnea using 16S rRNA sequencing and LC-MS/MS metabolomics.
Investigation of diet-induced changes in microbiome-metabolome interactions in murine models of metabolic dysfunction.
Clinical application in cancer patients using whole-genome shotgun sequencing and CE-TOFMS metabolomics.
Let us first load the simple sample data to see an example of the VBayesMM. VBayesMM supports for loading arbitrary biom, tsv, and csv formats.
examples/
Train data | Test data |
---|---|
![]() |
![]() |
Latent microbiome matrix | Microbial species selection |
---|---|
![]() |
![]() |
Train data | Test data |
---|---|
![]() |
![]() |
Latent microbiome matrix | Microbial species selection |
---|---|
![]() |
![]() |
All of the code is in the src/
folder, you can use to re-make the analyses in the paper:
tensorflow/VBayesMM.py
:
file contains Python codes for VBayesMM method for TensorFlow User.
pytorch/VBayesMM.py
:
file contains Python codes for VBayesMM method for PyTorch User.
If you have any problem, please contact me via email: dangthanhtung91@vn-bml.com