skip to main content
10.1145/1722024.1722048acmotherconferencesArticle/Chapter ViewAbstractPublication PagesisbConference Proceedingsconference-collections
research-article

Improving motif refinement using hybrid expectation maximization and random projection

Published: 15 February 2010 Publication History

Abstract

The main goal of the motif finding problem is to detect novel, over-represented unknown signals in a set of sequences. Popular algorithms like Expectation Maximization (EM) and Gibbs sampling are sensitive to the initial guesses and are known to converge to the nearest local maximum very quickly. A novel optimization framework searches the neighborhood regions of the initial alignments in a systematic manner to explore the multiple local optimal solutions. This effective search is achieved by transforming the original optimization problem into its corresponding dynamical system and estimating the practical stability boundary of the local maximum. The work aims at implementing the hybrid algorithm and enhancing it by trying different global methods and other techniques. Then aggregation methods rather than projection methods are tried.

References

[1]
Sagot M, 'Spelling Approximate Repeated or Common Motifs Using a Suffix Tree', Lecture Notes in Computer Science, 1380:111--127, 1998.
[2]
Vilo J, Brazma A, Jonassen I, Robinson A, Ukonnen E, 'Mining for Putative Regulatory Elements in the Yeast Genome using Gene Expression Data', In Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology AAAI Press San Diego, CA, pp:384--394, 2000.
[3]
E. Eskin, 'From Profiles to Patterns and Back Again: A Branch and Bound Algorithm for Finding Near Optimal Motif Profiles', Proceedings of the Eighth Annual International Conference on Research in Computational Molecular Biology, 115--124, 2004.
[4]
Lawrence CE, Reilly A A, 'An Expectation Maximization (EM) Algorithm for the Identification and Characterization of Common Sites in Unaligned Biopolymer Sequences', Proteins, 1990, 7:41--51.
[5]
Bailey TL, Elkan C, 'Unsupervised Learning of Multiple Motifs in Biopolymers using Expectation Maximization', Machine Learning, 21:51--80, 1995.
[6]
Lawrence CE, Altschul SF, Boguski MS, Liu JS, Neuwald AF, Wootton JC, 'Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment', Science, 262:208--214, 1993.
[7]
Liu JS, 'Monte Carlo Strategies in Scientific Computing', Springer Series in Statistics; 2001.
[8]
Liu J S, Neuwald A F, Lawrence CE, 'Bayesian Models for Multiple Local Sequence Alignment and Gibbs Sampling Strategies', J Amer Statist Assoc, 90:1156--1170, 1995.
[9]
Roth F. P., Hughes J. D., Estep P. W. and Church G. M, 'Finding DNA Regulatory Motifs within Unaligned Noncoding Sequences Clustered by Whole-Genome MRNA Quantization', Nat. Biotechnol., 16, 939--945, 1998.
[10]
Frith M C, Hansen U, Sponge J L, Weng Z, 'Finding Functional Sequence Elements by Multiple Local Alignment', Nucleic acid Res 32 189--200, 2004.
[11]
Liu X, Brutlag D. L., Liu J S, 'BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-expressed Genes', Symposium on Biocomputing, 6, 127--138, 2001.
[12]
Eskin E, Pevzner P, 'Finding Composite Regulatory Patternsin DNA Sequence', Bioinformatics, 18: 354--363, 2002.
[13]
K. Rose, 'Deterministic Annealing for Clustering, Compression, Classification, Regression, and Related Optimization Problems', Proceedings. IEEE, vol. 86, no. 11, pp. 2210--2239, 1998.
[14]
N. Ueda and R. Nakano, 'Deterministic Annealing EM Algorithm', Neural Networks, vol. 11, no. 2, pp. 271--282, 1998.
[15]
F. Pernkopf, D. Bouchaffra, 'Genetic-Based EM Algorithm for Learning Gaussian Mixture Models', IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 8, pp. 1344--1348, Aug. 2005.
[16]
A. M. Martinez and J. Vitri, 'Learning Mixture Models Using a Genetic Version of the EM Algorithm', Pattern Recognition Letters, vol. 21, no. 8, pp. 759--769, 2000.

Cited By

View all

Index Terms

  1. Improving motif refinement using hybrid expectation maximization and random projection

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ISB '10: Proceedings of the International Symposium on Biocomputing
    February 2010
    312 pages
    ISBN:9781605587226
    DOI:10.1145/1722024
    • Conference Chair:
    • Dan Tulpan,
    • Program Chairs:
    • Mathew J. Palakal,
    • K. A. Abdul Nazeer,
    • Aswati Nair R.
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Indiana University Purdue University, Indianapolis
    • National Institute of Technology, Calicut, India

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. expectation maximization
    2. motif finding
    3. random projection
    4. refinement

    Qualifiers

    • Research-article

    Conference

    ISB '10
    Sponsor:
    ISB '10: International Symposium on BioComputing
    February 15 - 17, 2010
    Kerala, Calicut, India

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 06 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media