Evolutionary Bioinformatics
Synopsis: An open access, peer reviewed electronic journal that covers computational evolutionary biology and evolutionary bioinformatics.
Indexed by: Pubmed, Pubmed Central, CAS, DOAJ, Embase, Embiology, Science Citation Index Expanded (SciSearch®), Journal Citation Reports/Science Edition, Current Contents®/Agriculture, Biology and Environmental Sciences, Current Contents®/Life Sciences, Google Scholar, EBSCO Academic Search Complete, OAIster.
Visibility: Most popular article read 3200+ times.
Processing time: Decision in 2 weeks for 90% of papers.
A Fast and Specific Alignment Method for Minisatellite Maps
Sèverine Bérard1, François Nicolas2, Jérôme Buard3, Olivier Gascuel2 and Eric Rivals2
1INRA, Département MIA, Toulouse, France. 2LIRMM, UMR 5506 CNRS-Université de Montpellier II, Montpellier, France. 3Institut de Génétique Humaine, UPR-CNRS 1142, Montpellier, France.
Abstract
Background: Variable minisatellites count among the most polymorphic markers of eukaryotic and prokaryotic genomes. This variability can affect gene coding regions, like in the prion protein gene, or gene regulation regions, like for the cystatin B gene, and be associated or implicated in diseases: the Creutzfeld-Jakob disease and the myoclonus epilepsy type 1, for our examples. When it affects neutrally evolving regions, the polymorphism in length (i.e., in number of copies) of minisatellites proved useful in population genetics.
Motivation: In these tandem repeat sequences, different mutational mechanisms let the number of copies, as well as the copies themselves, vary. Especially, the interspersion of events of tandem duplication/contraction and of punctual mutation makes the succession of variant repeats much more informative than the sole allele length. To exploit this information requires the ability to align minisatellite alleles by accounting for both punctual mutations and tandem duplications.
Results: We propose a minisatellite maps alignment program that improves on previous solutions. Our new program is faster, simpler, considers an extended evolutionary model, and is available to the community. We test it on the data set of 609 alleles of the MSY1 (DYF155S1) human minisatellite and confirm its ability to recover known evolutionary signals. Our experiments highlight that the informativeness of minisatellites resides in their length and composition polymorphisms. Exploiting both simultaneously is critical to unravel the implications of variable minisatellites in the control of gene expression and diseases.
Availability: Software is available at http://atgc.lirmm.fr/ms_align/
Readers of this also read:
- Identification of Conflicting Selective Effects on Highly Expressed Genes
- MapToGenome: A Comparative Genomic Tool that Aligns Transcript Maps to Sequenced Genomes
- On the Adaptive Design Rules of Biochemical Networks in Evolution
- LASER: A Maximum Likelihood Toolkit for Detecting Temporal Shifts in Diversification Rates From Molecular Phylogenies
- Improved Heuristics for Minimum-Flip Supertree Construction
- 02/Oct/2009
A Model for Protein Sequence Evolution Based on Selective Pressure for Protein Stability: Application to Hemoglobins
- 27/Aug/2009
On the Use of Bootstrapped Topologies in Coalescent-Based Bayesian MCMC Inference: A Comparison of Estimation and Computational Efficiencies
- 31/Jul/2009
PhyLIS: A Simple GNU/Linux Distribution for Phylogenetics and Phyloinformatics
- 30/Jul/2009
A Comprehensive Analysis of Gene Expression Evolution Between Humans and Mice
- 06/Jul/2009
Initial Implementation of a Comparative Data Analysis Ontology
- 03/Jul/2009
A Mixture Model and a Hidden Markov Model to Simultaneously Detect Recombination Breakpoints and Reconstruct Phylogenies
- 25/Jun/2009
Phylogenetic Applications of the Minimum Contradiction Approach on Continuous Characters
- 11/Jun/2009
Phylogenetic Analyses Reveal Monophyletic Origin of the Ergot Alkaloid Gene dmaW in Fungi
- 04/Jun/2009
Evolution and Functional Diversification of the GLI Family of Transcription Factors in Vertebrates
- 18/May/2009
REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era
- 06/May/2009
The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics
- 03/Nov/2008
Evidence for a Complex Mosaic Genome Pattern in a Full-length Hepatitis C Virus Sequence
- 30/Oct/2008
Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
- 09/Oct/2008
Bayesian, Maximum Parsimony and UPGMA Models for Inferring the Phylogenies of Antelopes Using Mitochondrial Markers
- 06/Oct/2008
Minimum Contradiction Matrices in Whole Genome Phylogenies
- 21/Jul/2008
Estimation of Genetic Effects and Genotype-Phenotype Maps
- 28/Jun/2008
Testing the Accuracy of Eukaryotic Phylogenetic Profiles for Prediction of Biological Function
- 18/Jun/2008
Electrostatic and Functional Analysis of the Seven-Bladed WD β-Propellers
- 13/Jun/2008
Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification
- 26/May/2008
Exploring the Evolutionary History of the Differentially Expressed Genes between Human Populations: Action of Recent Positive Selection
- 15/May/2008
Maximum Gene-Support Tree
- 15/May/2008
Examining Ancient Inter-domain Horizontal Gene Transfer
- 09/May/2008
Strategies for Reliable Exploitation of Evolutionary Concepts in High Throughput Biology
- 08/May/2008
Dissection of Functional Residues in Receptor Activity-Modifying Proteins Through Phylogenetic and Statistical Analyses
- 28/Apr/2008
Assessing the Evolution of Gene Expression Using Microarray Data
- 24/Apr/2008
Identifying Coevolving Partners from Paralogous Gene Families
- 24/Apr/2008
The ABCs of MGR with DCJ
- 10/Apr/2008
Comparative Evolutionary Histories of the Fungal Chitinase Gene Family Reveal Non-Random Size Expansions and Contractions due to Adaptive Natural Selection
- 18/Mar/2008
Expected Anomalies in the Fossil Record
- 18/Mar/2008
How to Summarize Estimates of Ancestral Divergence Times
- 18/Mar/2008
A Simple Derivation of the Distribution of Pairwise Local Protein Sequence Alignment Scores
- 14/Feb/2008
Why Should We Care About Molecular Coevolution?
- 14/Feb/2008
Detecting Microsatellites in Genome Data: Variance in Definitions and Bioinformatic Approaches Cause Systematic Bias
- 09/Feb/2008
Reverse Conservation Analysis Reveals the Specificity Determining Residues of Cytochrome P450 Family 2 (CYP 2)
- 09/Feb/2008
SPR Distance Computation for Unrooted Trees
- 09/Feb/2008
Evolution of the Influenza A Virus: Some New Advances
- 30/Jan/2008
The 2006 NESCent Phyloinformatics Hackathon: A Field Report
- 14/Dec/2007
Phylogenetic Study of the Evolution of PEP-Carboxykinase
- 11/Dec/2007
Efficient Tree Searches with Available Algorithms
- 12/Nov/2007
Structural Evolution of the ABC Transporter Subfamily B
- 08/Nov/2007
An Insulin-Like Modular Basis for the Evolution of Glucose Transporters (GLUT) with Implications for Diabetes
- 15/Oct/2007
Minimizing Contradictions on Circular Order of Phylogenic Trees
- 11/Oct/2007
Selection Effects on the Positioning of Genes and Gene Structures from the Interplay of Replication and Transcription in Bacterial Genomes
- 09/Oct/2007
Phylogenomic Analysis Reveals Extensive Phylogenetic Mosaicism in the Human GPCR Superfamily
- 26/Sep/2007
Non-Molecular-Clock-Like Evolution following Viral Origins in Homo sapiens
- 26/Sep/2007
Underlying Principles of Natural Selection in Network Evolution: Systems Biology Approach
- 26/Sep/2007
Hedging Our Bets: The Expected Contribution of Species to Future Phylogenetic Diversity
- 25/Sep/2007
Genome Signatures, Self-Organizing Maps and Higher Order Phylogenies: A Parametric Analysis
- 17/Sep/2007
The GC Skew Index: A Measure of Genomic Compositional Asymmetry and the Degree of Replicational Selection
- 06/Sep/2007
Interactions Between SNP Alleles at Multiple Loci and Variation in Skin Pigmentation in 122 Caucasians
- 06/Sep/2007
Identifcation of Novel Retroid Agents in Danio rerio, Oryzias latipes, Gasterosteus aculeatus and Tetraodon nigroviridis
- 06/Sep/2007
Gathering Computational Genomics and Proteomics to Unravel Adaptive Evolution
- 06/Sep/2007
Prediction of Protein-protein Interactions on the Basis of Evolutionary Conservation of Protein Functions
- 08/Aug/2007
Characterization and Evolution of the Cell Cycle-Associated Mob Domain-Containing Proteins in Eukaryotes
- 08/Aug/2007
Environmental Quality, Developmental Plasticity and the Thrifty Phenotype: A Review of Evolutionary Models
- 19/Jun/2007
Choosing and Using Introns in Molecular Phylogenetics
- 14/Jun/2007
Fast Genes and Slow Clades: Comparative Rates of Molecular Evolution in Mammals
- 31/May/2007
A Reduction Algorithm for Computing The Hybridization Number of Two Trees
- 30/May/2007
An Improved Implementation of Codon Adaptation Index
- 17/May/2007
Mammoth and Elephant Phylogenetic Relationships: Mammut Americanum, the Missing Outgroup
- 29/Mar/2007
Mlcoalsim: Multilocus Coalescent Simulations
- 02/Mar/2007
Why Bacteriophage Encode Exotoxins and other Virulence Factors
- 28/Feb/2007
On the Adaptive Design Rules of Biochemical Networks in Evolution
- 28/Feb/2007
Improved Heuristics for Minimum-Flip Supertree Construction
- 28/Feb/2007
The identification of functional motifs in temporal gene expression analysis
- 27/Feb/2007
Outsourcing the Nucleus: Nuclear Pore Complex Genes are no Longer Encoded in Nucleomorph Genomes
- 27/Feb/2007
MySSP: Non-stationary evolutionary sequence simulation, including indels
- 26/Feb/2007
Topological Bias in Distance-Based Phylogenetic Methods: Problems with Over- and Underestimated Genetic Distances
- 26/Feb/2007
Estimation of Phylogeny Using a General Markov Model
- 25/Feb/2007
Pattern-Based Phylogenetic Distance Estimation and Tree Reconstruction
- 25/Feb/2007
Evolution of proteins and proteomes: a phylogenetics approach
- 24/Feb/2007
Multiple Property Tolerance Analysis for the Evaluation of Missense Mutations
- 24/Feb/2007
Arlequin (version 3.0): An integrated software package for population genetics data analysis
- 23/Feb/2007
ALFRED: An Allele Frequency Database for Microevolutionary Studies
- 22/Feb/2007
A Fast and Specific Alignment Method for Minisatellite Maps
- 22/Feb/2007
Phylogenetic biodiversity assessment based on systematic nomenclature
- 21/Feb/2007
Genome Halving with an Outgroup
- 21/Feb/2007
Fast Structural Search in Phylogenetic Databases
- 20/Feb/2007
Estimating the Relative Order of Speciation or Coalescence Events on a Given Phylogeny
- 20/Feb/2007
The Role of the Phylogenetic Diversity Measure, PD, in Bio-informatics: Getting the Definition Right
- 19/Feb/2007
Co-evolutionary Rates of Functionally Related Yeast Genes
- 18/Feb/2007
A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences
- 17/Feb/2007
Phylogenetic diversity (PD) and biodiversity conservation: some bioinformatics challenges
- 17/Feb/2007
Minimum Conflict Individual Haplotyping from SNP Fragments and Related Genotype
- 16/Feb/2007
Gaussian Quadrature Formulae for Arbitrary Positive Measures
- 15/Feb/2007
Identification of Conflicting Selective Effects on Highly Expressed Genes
- 14/Feb/2007
MapToGenome: A Comparative Genomic Tool that Aligns Transcript Maps to Sequenced Genomes
- 14/Feb/2007
LASER: A Maximum Likelihood Toolkit for Detecting Temporal Shifts in Diversification Rates From Molecular Phylogenies
- 14/Feb/2007
Total Evidence, Average Consensus and Matrix Representation with Parsimony: What a Difference Distances Make
- 13/Feb/2007
Estimating Population Parameters using the Structured Serial Coalescent with Bayesian MCMC Inference when some Demes are Hidden
- 12/Feb/2007
Unravelling Selection Shifts Among Foot-and-Mouth Disease Virus (FMDV) Serotypes
- 11/Feb/2007
PIRSF Family Classification System for Protein Functional and Evolutionary Analysis
- 10/Feb/2007
Spectral Analysis of Sequence Variability in Basic-Helix-loop-helix (bHLH) Protein Domains
- 09/Feb/2007
CoMET: A Mesquite package for comparing models of continuous character evolution on phylogenies
- 07/Feb/2007
MBEToolbox 2.0: An enhanced version of a MATLAB toolbox for Molecular Biology and Evolution
- 06/Feb/2007
Modelling prokaryote gene content
- 05/Feb/2007
Assessing the Applicability of the GTR Nucleotide Substitution Model Through Simulations
- 04/Feb/2007
Using minimum bootstrap support for splits to construct confidence regions for trees
- 03/Feb/2007
A new effective method for estimating missing values in the sequence data prior to phylogenetic analysis
- 01/Feb/2007