An accurate and exact clustering algorithm for next generation sequencing metagenomic sequences

dc.authoridBhat, Ashaq Hussain/0000-0003-1706-8442
dc.authoridCengiz, Korhan/0000-0001-6594-8861
dc.authorwosidBhat, Ashaq Hussain/C-7541-2018
dc.authorwosidCengiz, Korhan/HTN-8060-2023
dc.contributor.authorBhat, Ashaq Hussain
dc.contributor.authorNguyen, Tu N.
dc.contributor.authorCengiz, Korhan
dc.contributor.authorPrabhu, Puniethaa
dc.date.accessioned2024-06-12T10:50:11Z
dc.date.available2024-06-12T10:50:11Z
dc.date.issued2021
dc.departmentTrakya Üniversitesien_US
dc.description.abstractClustering algorithms are the essential tools in the target metagenomics, used to perform the taxonomic profiling of microbial communities. In the present study, an algorithmic tool called hash-based exact alignment (HBEA) clustering algorithm is presented, which uses exact pairwise global alignment algorithm to improve the cluster quality and creates a hash table for extraction of cluster representatives. The algorithm is de novo based and uses the general de facto 97% sequence similarity score to cluster the sequences. Our experimental investigation on various types of datasets with distinct parameters and attributes showed that HBEA produces better operational taxonomic unit (OTU) clusters and computational complexity than other algorithms.en_US
dc.identifier.doi10.1002/mma.7748
dc.identifier.issn0170-4214
dc.identifier.issn1099-1476
dc.identifier.scopus2-s2.0-85114663221en_US
dc.identifier.scopusqualityQ1en_US
dc.identifier.urihttps://doi.org/10.1002/mma.7748
dc.identifier.urihttps://hdl.handle.net/20.500.14551/17892
dc.identifier.wosWOS:000695173600001en_US
dc.identifier.wosqualityQ1en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherWileyen_US
dc.relation.ispartofMathematical Methods In The Applied Sciencesen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subject16S Rrnaen_US
dc.subjectClusteringen_US
dc.subjectHash Tableen_US
dc.subjectMetagenomicsen_US
dc.subjectNext Generation Sequencingen_US
dc.subjectOtusen_US
dc.subjectCollision-Avoidanceen_US
dc.subjectIdentificationen_US
dc.subjectSearchen_US
dc.subjectAlignmenten_US
dc.subjectTaxonomyen_US
dc.subjectProgramen_US
dc.subjectStateen_US
dc.titleAn accurate and exact clustering algorithm for next generation sequencing metagenomic sequencesen_US
dc.typeArticleen_US

Dosyalar