loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: David Medina-Ortiz 1 ; 2 ; Gabriel Cabas-Mora 2 ; Iván Moya 2 ; Nicole Soto-García 2 and Roberto Uribe-Paredes 2

Affiliations: 1 Centre for Biotechnology and Bioengineering, CeBiB, Universidad de Chile, Beauchef 851, Santiago, Chile ; 2 Departamento de Ingeniería En Computación, Universidad de Magallanes, Avenida Bulnes 01855, Punta Arenas, Chile

Keyword(s): DNA-Binding Proteins, Single-Stranded and Double-Stranded DNA, Machine Learning, Protein Language Models.

Abstract: DNA-binding proteins play crucial roles in biological processes such as replication, transcription, pack-aging, and chromatin remodeling. Their study has gained importance across scientific fields, with computational biology complementing traditional methods. While machine learning has advanced bioinformatics, generalizable pipelines for identifying DNA-binding proteins and their specific interactions remain scarce. We present RUDEUS, a Python library with hierarchical classification models to identify DNA-binding proteins and distinguish between single- and double-stranded DNA interactions. RUDEUS integrates protein language models, supervised learning, and Bayesian optimization, achieving 95% precision in DNA-binding identification and 89% accuracy in distinguishing interaction types. The library also includes tools for annotating unknown sequences and validating DNA-protein interactions through molecular docking. RUDEUS delivers competitive performance and is easily integrated int o protein engineering workflows. It is available under the MIT License, with the source code and models available on the GitHub repository https://github.com/ProteinEngineering-PESB2/RUDEUS. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 74.48.170.251

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Medina-Ortiz, D. ; Cabas-Mora, G. ; Moya, I. ; Soto-García, N. and Uribe-Paredes, R. (2024). RUDEUS: A Machine Learning Classification System to Study DNA-Binding Proteins. In Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR; ISBN 978-989-758-716-0; ISSN 2184-3228, SciTePress, pages 302-310. DOI: 10.5220/0012946500003838

@conference{kdir24,
author={David Medina{-}Ortiz and Gabriel Cabas{-}Mora and Iván Moya and Nicole Soto{-}García and Roberto Uribe{-}Paredes},
title={RUDEUS: A Machine Learning Classification System to Study DNA-Binding Proteins},
booktitle={Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR},
year={2024},
pages={302-310},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012946500003838},
isbn={978-989-758-716-0},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
TI - RUDEUS: A Machine Learning Classification System to Study DNA-Binding Proteins
SN - 978-989-758-716-0
IS - 2184-3228
AU - Medina-Ortiz, D.
AU - Cabas-Mora, G.
AU - Moya, I.
AU - Soto-García, N.
AU - Uribe-Paredes, R.
PY - 2024
SP - 302
EP - 310
DO - 10.5220/0012946500003838
PB - SciTePress