Orzechowski, Patryk Moore, Jason H
Published in
Bioinformatics (Oxford, England)
In this paper, we present an open source package with the latest release of Evolutionary-based BIClustering (EBIC), a next-generation biclustering algorithm for mining genetic data. The major contribution of this paper is adding a full support for multiple graphics processing units (GPUs) support, which makes it possible to run efficiently large ge...
Jiang, Tao Zhou, Zuji Zhang, Zhendong Cao, Shuqi Wang, Yadong Liu, Yadong
Published in
Bioinformatics (Oxford, England)
Mobile genetic elements (MEs) are heritable mutagens that significantly contribute to genetic diseases. The advent of long-read sequencing technologies, capable of resolving large DNA fragments, offers promising prospects for the comprehensive detection of ME variants (MEVs). However, achieving high precision while maintaining recall performance re...
Rimal, Prawin Panday, Shailesh Kumar Xu, Wang Peng, Yunhui Alexov, Emil
Published in
Bioinformatics (Oxford, England)
Mutations in protein-protein interactions can affect the corresponding complexes, impacting function and potentially leading to disease. Given the abundance of membrane proteins, it is crucial to assess the impact of mutations on the binding affinity of these proteins. Although several methods exist to predict the binding free energy change due to ...
Mohammad-Taheri, Sara Navada, Pruthvi Prakash Hoyt, Charles Tapley Zucker, Jeremy Sachs, Karen Gyori, Benjamin M Vitek, Olga
Published in
Bioinformatics (Oxford, England)
We introduce Eliater, a Python package for estimating the effect of perturbation of an upstream molecule on a downstream molecule in a biomolecular network. The estimation takes as input a biomolecular network, observational biomolecular data, and a perturbation of interest, and outputs an estimated quantitative effect of the perturbation. We showc...
Lyu, Yue Lin, Steven H Wu, Hao Li, Ziyi
Published in
Bioinformatics (Oxford, England)
The growing number of single-cell RNA-seq (scRNA-seq) studies highlights the potential benefits of integrating multiple datasets, such as augmenting sample sizes and enhancing analytical robustness. Inherent diversity and batch discrepancies within samples or across studies continue to pose significant challenges for computational analyses. Questio...
Linderman, Michael D Wallace, Jacob van der Heyde, Alderik Wieman, Eliza Brey, Daniel Shi, Yiran Hansen, Peter Shamsi, Zahra Liu, Jeremiah Gelb, Bruce D
...
Published in
Bioinformatics (Oxford, England)
Structural variants (SVs) play a causal role in numerous diseases but can be difficult to detect and accurately genotype (determine zygosity) with short-read genome sequencing data (SRS). Improving SV genotyping accuracy in SRS data, particularly for the many SVs first detected with long-read sequencing, will improve our understanding of genetic va...
Gisdon, Florian J Zunker, Mariella Wolf, Jan Niclas Prüfer, Kai Ackermann, Jörg Welsch, Christoph Koch, Ina
Published in
Bioinformatics (Oxford, England)
The functional complexity of biochemical processes is strongly related to the interplay of proteins and their assembly into protein complexes. In recent years, the discovery and characterization of protein complexes have substantially progressed through advances in cryo-electron microscopy, proteomics, and computational structure prediction. This d...
Groot Koerkamp, Ragnar Ivanov, Pesho
Published in
Bioinformatics (Oxford, England)
Sequence alignment has been at the core of computational biology for half a century. Still, it is an open problem to design a practical algorithm for exact alignment of a pair of related sequences in linear-like time. We solve exact global pairwise alignment with respect to edit distance by using the A* shortest path algorithm. In order to efficien...
Liu, Erhu Lyu, Hongqiang Liu, Yuan Fu, Laiyi Cheng, Xiaoliang Yin, Xiaoran
Published in
Bioinformatics (Oxford, England)
Topologically associating domains (TADs) are fundamental building blocks of 3D genome. TAD-like domains in single cells are regarded as the underlying genesis of TADs discovered in bulk cells. Understanding the organization of TAD-like domains helps to get deeper insights into their regulatory functions. Unfortunately, it remains a challenge to ide...
Ji, Fahu Zhou, Qian Ruan, Jue Zhu, Zexuan Liu, Xianming
Published in
Bioinformatics (Oxford, England)
Seeding is a rate-limiting stage in sequence alignment for next-generation sequencing reads. The existing optimization algorithms typically utilize hardware and machine-learning techniques to accelerate seeding. However, an efficient solution provided by professional next-generation sequencing compressors has been largely overlooked by far. In addi...