Skip to main navigation Skip to search Skip to main content

ProteinFlow: An advanced framework for feature engineering in protein data analysis

  • Yanlin Mi
  • , Stefan Bogdan Marcu
  • , Venkata V.B. Yallapragada
  • , Sabin Tabirca
  • University College Cork
  • Munster Technological University
  • Transilvania University of Brasov

Research output: Contribution to journalArticlepeer-review

Abstract

In the burgeoning field of proteins, the effective analysis of intricate protein data remains a formidable challenge, necessitating advanced computational tools for data processing, feature extraction, and interpretation. This study introduces ProteinFlow, an innovative framework designed to revolutionize feature engineering in protein data analysis. ProteinFlow stands out by offering enhanced efficiency in data collection and preprocessing, along with advanced capabilities in feature extraction, directly addressing the complexities inherent in multidimensional protein data sets. Through a comparative analysis, ProteinFlow demonstrated a significant improvement over traditional methods, notably reducing data preprocessing time and expanding the scope of biologically significant features identified. The framework's parallel data processing strategy and advanced algorithms ensure not only rapid data handling but also the extraction of comprehensive, meaningful insights from protein sequences, structures, and interactions. Furthermore, ProteinFlow exhibits remarkable scalability, adeptly managing large-scale data sets without compromising performance, a crucial attribute in the era of big data.

Original languageEnglish
Pages (from-to)3563-3571
Number of pages9
JournalBiotechnology and Bioengineering
Volume121
Issue number11
DOIs
Publication statusPublished - Nov 2024

Keywords

  • data preprocessing
  • feature engineering
  • multidimensional feature extraction
  • protein data analysis
  • proteins

Fingerprint

Dive into the research topics of 'ProteinFlow: An advanced framework for feature engineering in protein data analysis'. Together they form a unique fingerprint.

Cite this