PATO: genome-wide prediction of lncRNA-DNA triple helices. 2023

Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
Computer Architecture Group, Department of Computer Engineering, CITIC, Universidade da Coruña, Campus de Elviña, A Coruña 15071, Spain.

Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different molecular mechanisms, including direct RNA-DNA hybridization via triplexes, cotranscriptional RNA-RNA interactions, and RNA-DNA binding mediated by protein complexes. While the functional annotation of lncRNA transcripts has been widely studied over the last 20 years, barely a handful of tools have been developed with the specific purpose of detecting and evaluating lncRNA-DNA triple helices. What is worse, some of these tools have nearly grown a decade old, making new triplex-centric pipelines depend on legacy software that cannot thoroughly process all the data made available by next-generation sequencing (NGS) technologies. We present PATO, a modern, fast, and efficient tool for the detection of lncRNA-DNA triplexes that matches NGS processing capabilities. PATO enables the prediction of triple helices at the genome scale and can process in as little as 1 h more than 60 GB of sequence data using a two-socket server. Moreover, PATO's efficiency allows a more exhaustive search of the triplex-forming solution space, and so PATO achieves higher levels of prediction accuracy in far less time than other tools in the state of the art. Source code, user manual, and tests are freely available to download under the MIT License at https://github.com/UDC-GAC/pato.

UI MeSH Term Description Entries
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D062085 RNA, Long Noncoding A class of untranslated RNA molecules that are typically greater than 200 nucleotides in length and do not code for proteins. Members of this class have been found to play roles in transcriptional regulation, post-transcriptional processing, CHROMATIN REMODELING, and in the epigenetic control of chromatin. LincRNA,RNA, Long Untranslated,LINC RNA,LincRNAs,Long Intergenic Non-Protein Coding RNA,Long Non-Coding RNA,Long Non-Protein-Coding RNA,Long Noncoding RNA,Long ncRNA,Long ncRNAs,RNA, Long Non-Translated,lncRNA,Long Intergenic Non Protein Coding RNA,Long Non Coding RNA,Long Non Protein Coding RNA,Long Non-Translated RNA,Long Untranslated RNA,Non-Coding RNA, Long,Non-Protein-Coding RNA, Long,Non-Translated RNA, Long,Noncoding RNA, Long,RNA, Long Non Translated,RNA, Long Non-Coding,RNA, Long Non-Protein-Coding,Untranslated RNA, Long,ncRNA, Long,ncRNAs, Long

Related Publications

Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
February 1995, Archives of biochemistry and biophysics,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
January 2022, Computational and structural biotechnology journal,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
January 2020, International journal of molecular sciences,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
November 1994, Biochemistry,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
April 2000, Biological chemistry,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
September 2017, Nucleic acids research,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
June 2000, Journal of biomolecular structure & dynamics,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
June 2014, BMC bioinformatics,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
December 2015, Nature communications,
Iñaki Amatria-Barral, and Jorge González-Domínguez, and Juan Touriño
August 2008, Biochimie,
Copied contents to your clipboard!