Sankhya: The Indian Journal of Statistics

2001, Volume 63, Series B, Pt. 2, pp. 218--233

NON-DECIMATED WAVELET ANALYSIS OF BIOLOGICAL SEQUENCES: APPLICATIONS TO PROTEIN STRUCTURE AND GENOMICS

By

MARINA VANNUCCI Texas A & M University, U.S.A

and

PIETRO LIÒ University of Cambridge, U.K

SUMMARY. Here we investigate the potential of wavelet methods in the analysis of biological sequences as a complement method to those currently available. Specifically, we show how the non-decimated wavelet transforms and the wavelet variance and correlation scale-by-scale decompositions can be used to extract relevant structural features from proteins, such as helices in membrane proteins, to highlight similarities among different amino acid sequences and to detect genomic regions that have different composition characteristics from the nearby regions.

AMS (1991) subject classification. 42C40, 65T60, 92D10, 92D20.

Key words and phrases. Non-decimated wavelet transform, wavelet variance and correlation, membrane proteins, genome analysis, G+C plot.

Full paper (PDF)