A Hough Transform Based Biclustering Algorithm for Gene Expression Data
View/ Open
File version
Accepted Manuscript (AM)
Author(s)
To, C
Nguyen, TT
Liew, AWC
Griffith University Author(s)
Year published
2014
Metadata
Show full item recordAbstract
In pattern classification, when the feature space is of high dimension or patterns are "similar" on a subset of features only, the traditional clustering methods do not have good performances. Biclustering is a class of methods that simultaneously group on two dimensions and has many applications to different fields, especially gene expression data analysis. Because of simultaneous classification on both rows and columns of a data matrix, the biclustering problem is inherently intractable and computationally complex. and oOne of the most complex models in biclustering problem is linear coherent model. So sSeveral biclustering ...
View more >In pattern classification, when the feature space is of high dimension or patterns are "similar" on a subset of features only, the traditional clustering methods do not have good performances. Biclustering is a class of methods that simultaneously group on two dimensions and has many applications to different fields, especially gene expression data analysis. Because of simultaneous classification on both rows and columns of a data matrix, the biclustering problem is inherently intractable and computationally complex. and oOne of the most complex models in biclustering problem is linear coherent model. So sSeveral biclustering algorithms based on this model have been proposed in recent years. However, none of them is able to perfectly recognize all linear patterns in a bicluster. In this work, we propose a novel algorithm based on Hough transform that can find all linear coherent patterns and apply it to gene expression data.
View less >
View more >In pattern classification, when the feature space is of high dimension or patterns are "similar" on a subset of features only, the traditional clustering methods do not have good performances. Biclustering is a class of methods that simultaneously group on two dimensions and has many applications to different fields, especially gene expression data analysis. Because of simultaneous classification on both rows and columns of a data matrix, the biclustering problem is inherently intractable and computationally complex. and oOne of the most complex models in biclustering problem is linear coherent model. So sSeveral biclustering algorithms based on this model have been proposed in recent years. However, none of them is able to perfectly recognize all linear patterns in a bicluster. In this work, we propose a novel algorithm based on Hough transform that can find all linear coherent patterns and apply it to gene expression data.
View less >
Conference Title
Communications in Computer and Information Science
Volume
481
Publisher URI
Copyright Statement
© 2014 Springer Berlin/Heidelberg. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. The original publication is available at www.springerlink.com
Subject
Pattern Recognition and Data Mining