Abstract
In the burgeoning field of proteins, the effective analysis of intricate protein data remains a formidable challenge, necessitating advanced computational tools for data processing, feature extraction, and interpretation. This study introduces ProteinFlow, an innovative framework designed to revolutionize feature engineering in protein data analysis. ProteinFlow stands out by offering enhanced efficiency in data collection and preprocessing, along with advanced capabilities in feature extraction, directly addressing the complexities inherent in multidimensional protein data sets. Through a comparative analysis, ProteinFlow demonstrated a significant improvement over traditional methods, notably reducing data preprocessing time and expanding the scope of biologically significant features identified. The framework's parallel data processing strategy and advanced algorithms ensure not only rapid data handling but also the extraction of comprehensive, meaningful insights from protein sequences, structures, and interactions. Furthermore, ProteinFlow exhibits remarkable scalability, adeptly managing large-scale data sets without compromising performance, a crucial attribute in the era of big data.
| Original language | English |
|---|---|
| Pages (from-to) | 3563-3571 |
| Number of pages | 9 |
| Journal | Biotechnology and Bioengineering |
| Volume | 121 |
| Issue number | 11 |
| DOIs | |
| Publication status | Published - Nov 2024 |
Keywords
- data preprocessing
- feature engineering
- multidimensional feature extraction
- protein data analysis
- proteins
Fingerprint
Dive into the research topics of 'ProteinFlow: An advanced framework for feature engineering in protein data analysis'. Together they form a unique fingerprint.Press/Media
-
University College Cork Reports Findings in Data Analytics (ProteinFlow: An advanced framework for feature engineering in protein data analysis)
7/08/24
1 item of Media coverage
Press/Media
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver