A two-stage statistical procedure for feature selection and comparison in functional analysis of metagenomes:

 

In the first stage of the proposed procedure, the informative features are selected using elastic net as reducing the dimension of metagenomic data; in the second stage the differentially abundant features are detected using generalized linear models with a negative binomial distribution.

 

This is an introduction (README file) to using the R code for the proposed method for detection of significantly differentially abundant features of different metagenomic communities/conditions.

 

R code

 

Example data (feature count and phenotype info)