- Room No: G.2.11
- Telephone: +64 7 838 4466
- Extension: 8766
- Facsimile: +64 7 838 4155
BCMS Honours Topic: "Information-based metagenomics."
Metagenomics is the study of metagenomes, genetic material recovered directly from environmental samples. Metagenomic samples can contain reads from a huge number of organisms, a significant proportion of which may be unknown. We want to be able to classify these reads the best we can into their individual species, a problem compounded by the fact that these reads are also inherently noisy.
Working from the assumption that genetic material within a species has recurring patterns, a possible method of classifying these reads would be to apply compression. The more similar the data, the more compressed it will be. Analysis on how much a set of data was compressed will then bring insights to how well a read fits in with other reads, which can now be a basis for classification. My project aims to apply this approach to DNA clustering and assembly for metagenomics.