Lingling An's home page

An effective normalization method for taxonomic analysis of metagenomic sequence count data:

We propose a novel scaling normalization method to meet with the challenges of taxonomic analysis of multiple metagenomic communities based on next generation sequencing data. A key issue on this topic is how to make the samples comparable among communities. The proposed normalization takes into account of abundance proportions of taxonomic units, and the under-sampling issue.

This is an introduction (README file) to using the R code for the proposed normalization method, and to generate simulated metagenomic data.

R codes: WSS Simulation

Example data (feature count and phenotype info)