Path-based Connectivity for Clustering Genome Sequences


Sengel O., Kursun O.

38th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Florida, United States Of America, 16 - 20 August 2016, pp.3092-3095 identifier identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/embc.2016.7591383
  • City: Florida
  • Country: United States Of America
  • Page Numbers: pp.3092-3095

Abstract

Clustering is an unsupervised data mining tool and in bioinformatics, clustering genome sequences is used to group related biological sequences when there is no additional supervision. Sequence clusters are often related with gene/protein families, which can shed some light onto determining tertiary structures. To extract such hidden and valuable structures in a data set of genome sequences can benefit from better clustering methods such as the recently popular Spectral Clustering. In this study, we apply spectral clustering and its improved variations to sequence clustering task in our efforts to develop a novel approach for improving it.