Considerations and complications of mapping small RNA high-throughput data to transposable elements

Investor logo

Warning

This publication doesn't include Faculty of Arts. It includes Central European Institute of Technology. Official publication website can be found on muni.cz.
Authors

BOUSIOS A. GAUT B.S. DARZENTAS Nikos

Year of publication 2017
Type Article in Periodical
Magazine / Source Mobile DNA
MU Faculty or unit

Central European Institute of Technology

Citation
Web https://mobilednajournal.biomedcentral.com/track/pdf/10.1186/s13100-017-0086-z?site=mobilednajournal.biomedcentral.com
Doi http://dx.doi.org/10.1186/s13100-017-0086-z
Keywords Transposable elements; Small RNAs; High-throughput sequencing; siRNAs; Genome mapping; Annotation; Bioinformatics; RNA-seq
Description Background: High-throughput sequencing (HTS) has revolutionized the way in which epigenetic research is conducted. When coupled with fully-sequenced genomes, millions of small RNA (sRNA) reads are mapped to regions of interest and the results scrutinized for clues about epigenetic mechanisms. However, this approach requires careful consideration in regards to experimental design, especially when one investigates repetitive parts of genomes such as transposable elements (TEs), or when such genomes are large, as is often the case in plants. Results: Here, in an attempt to shed light on complications of mapping sRNAs to TEs, we focus on the 2,300 Mb maize genome, 85% of which is derived from TEs, and scrutinize methodological strategies that are commonly employed in TE studies. These include choices for the reference dataset, the normalization of multiply mapping sRNAs, and the selection among sRNA metrics. We further examine how these choices influence the relationship between sRNAs and the critical feature of TE age, and contrast their effect on low copy genomic regions and other popular HTS data. Conclusions: Based on our analyses, we share a series of take-home messages that may help with the design, implementation, and interpretation of high-throughput TE epigenetic studies specifically, but our conclusions may also apply to any work that involves analysis of HTS data.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.