Affordable Access

deepdyve-link
Publisher Website

MapRepeat: an approach for effective assembly of repetitive regions in prokaryotic genomes.

Authors
  • Mariano, Diego Cb1
  • Pereira, Felipe L2
  • Ghosh, Preetam3
  • Barh, Debmalya4
  • Figueiredo, Henrique Cp2
  • Silva, Artur5
  • Ramos, Rommel Tj5
  • Azevedo, Vasco Ac1
  • 1 Laboratório de Genética Celular e Molecular, Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, CEP 31270-901, Belo Horizonte, Minas Gerais, Brazil. , (Brazil)
  • 2 National Reference Laboratory for Aquatic Animal Diseases of Ministry of Fisheries and Aquaculture, Universidade Federal de Minas Gerais, CEP 31270-901, Belo Horizonte, Minas Gerais, Brazil. , (Brazil)
  • 3 Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, PurbaMedinipur, WB-721172, India ; Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia, USA. , (India)
  • 4 Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, PurbaMedinipur, WB-721172, India. , (India)
  • 5 Instituto de Ciências Biológicas, Universidade Federal do Pará, Rua Augusto Corrêa, 01 - Guamá, Belém, PA, Brazil. , (Brazil)
Type
Published Article
Journal
Bioinformation
Publisher
Biomedical Informatics
Publication Date
Jan 01, 2015
Volume
11
Issue
6
Pages
276–279
Identifiers
DOI: 10.6026/97320630011276
PMID: 26229287
Source
Medline
Keywords
Language
English
License
Unknown

Abstract

The newest technologies for DNA sequencing have led to the determination of the primary structure of the genomes of organisms, mainly prokaryotes, with high efficiency and at lower costs. However, the presence of regions with repetitive sequences, in addition to the short reads produced by the Next-Generation Sequencing (NGS) platforms, created a lot of difficulty in reconstructing the original genome in silico. Thus, even today, genome assembly continues to be one of the major challenges in bioinformatics specifically when repetitive sequences are considered. In this paper, we present an approach to assemble repetitive regions in prokaryotic genomes. Our methodology enables (i) the identification of these regions through visual tools, (ii) the characterization of sequences on the extremities of gaps and (iii) the extraction of consensus sequences based on mapping of raw data to a reference genome. We also present a case study on the assembly of regions that encode ribosomal RNAs (rRNA) in the genome of Corynebacterium ulcerans FRC11, in order to show the efficiency of the strategies presented here. The proposed methods and tools will help in finishing genome assemblies, besides reducing the running time and associated costs. All scripts are available at http://github.com/dcbmariano/maprepeat.

Report this publication

Statistics

Seen <100 times