• Login
    View Item 
    •   Repository
    • Open Access Articles
    • Open Access Journals
    • Biology and Biotechnology
    • View Item
    •   Repository
    • Open Access Articles
    • Open Access Journals
    • Biology and Biotechnology
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis

    Thumbnail
    View/Open
    Full text (561.6Kb)
    Date
    2011-07
    Author
    Meher, Jayakishan K.
    Dash, Gananath N.
    Meher, Pramod K.
    Raval, Mukesh K.
    Metadata
    Show full item record
    Abstract
    Development of efficient gene prediction algorithms is one of the fundamental efforts in gene prediction study in the area of genomics. In genomic signal processing the basic step of the identification of protein coding regions in DNA sequences is based on the period-3 property exhibited by nucleotides in exons. Several approaches based on signal processing tools and numerical representations have been applied to solve this problem, trying to achieve more accurate predictions. This paper presents a new indicator sequence based on amino acid sequence, called as aminoacid indicator sequence, derived from DNA string that uses the existing signal processing based time-domain and frequency domain methods to predict these regions within the billions long DNA sequence of eukaryotic cells which reduces the computational load by one-third. It is known that each triplet of bases, called as codon, instructs the cell machinery to synthesize an amino acid. The codon sequence therefore uniquely identifies an amino acid sequence which defines a protein. Thus the protein coding region is attributed by the codons in amino acid sequence. This property is used for detection of period-3 regions using amino acid sequence. Physico-chemical properties of amino acids are used for numerical representation. Various accuracy measures such as exonic peaks, discriminating factor, sensitivity, specificity, miss rate, wrong rate and approximate correlation are used to demonstrate the efficacy of the proposed predictor. The proposed method is validated on various organisms using the standard data-set HMR195, Burset and Guigo and KEGG. The simulation result shows that the proposed method is an effective approach for protein coding prediction.
    URI
    http://dx.doi.org/10.4236/ajmb.2011.12010
    http://hdl.handle.net/123456789/970
    Collections
    • Biology and Biotechnology [193]

    University of Embu copyright © 2025 2021
    Contact us | Send Feedback
    Library ER 
    Atmire NV
     

     

    Browse

    All of RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    University of Embu copyright ©  2021
    Contact us | Send Feedback
    Library ER 
    Atmire NV