ITTC Project


First Award: Identify Informative Genes for Cancer Classification

Project Award Date: 05-01-2004



Description

With the completion of human genome project and the advance of microarray technologies, it is now possible to explore the whole genome both systematically and comprehensively. Microarrays have been extensively used for screening gene expressions and for exploiting important clues to understanding the role of genes and the underlying gene regulatory networks. Use of microarrays is rapidly generating large amounts of data (typically terabytes) that create both opportunities and challenging problems. Conventional methods are increasingly unable to deal with the huge amount of data. For example, when applied to cancer classification, microarray data are overwhelming conventional machine learning algorithms because the number of samples is much less than the number of features (genes). A major challenge is the identification of informative genes for cancer classification from gene expression measurements. In fact, it has been demonstrated that only a small number of genes are relevant to a specific cancer classification problem. Identifying these relevant genes is important in numerous microarray-based applications such as drug discovery, early disease detection, and proper treatment guidance.

This project addresses the problems of identification of informative genes for cancer classification. The main objective of this work is to perform a preliminary investigation on a new margin and genetic-algorithm-based feature-selection algorithm. In addition, we will conduct a comparative and comprehensive study of several fundamental gene selection algorithms in microarray-based cancer classification problems to assess their performance on different data sets on the equal footing.

The intellectual merit of this project will include major progress in the informative gene identification problem that is a primary challenge in microarray data analysis, better understanding of feature selection algorithms in small sample problems, and potential solutions to choosing suitable gene selection algorithms for given problems. The new gene selection algorithm is expected to perform equally well on both training and test data in classification problems. When combined with support vector machines, the new algorithm will be able to predict the data that are unseen during training, even for small training samples.

The broader impacts resulting from project activities include a robust method to extract information from large datasets; the potential integration of the small number of identified genes into cancer diagnosis process; the applications to gene function discoveries; and the integration of research activities into a new bioinformatics course, Machine Learning with Life Science Applications. The class will be offered to graduate and senior undergraduate students in EECS and students in other department such as Biology who are interested in bioinformatics.


Investigators

Faculty Investigator(s): Xue-wen Chen (PI)

Student Investigator(s): Mei Liu, Manjunath Narayana


Project Sponsors


Primary Sponsor(s): NSF and KTEC


Partner with ITTC

The Information and Telecommunication Technology Center at the University of Kansas has developed several assistance policies that enhance interactions between the Center and local, Kansas, or national companies. 

ITTC assistance includes initial free consulting (normally one to five hours). If additional support is needed, ITTC will offer one of the following approaches: 

Sponsored Research Agreement

Individuals and organizations can enter into agreements with KUCR/ITTC and provide funds for sponsored research to be performed at ITTC with the assistance of faculty, staff and students.

Licensing and Royalty/Equity Agreement

An ITTC goal is the development of investment-grade technologies for transfer to, and marketing by, local, Kansas, and national businesses. To enhance this process, the Center has developed flexible policies that allow for licensing, royalty, and equity arrangements to meet both the needs of ITTC and the company.

Commercialization Development

Companies with a technology need that can be satisfied with ITTC's resources can look to us for assistance. We can develop a relationship with interested partners that will provide for the development of a technology suited for commercialization.

ITTC Resource Access

ITTC resources, including computers and software systems, may be made available to Kansas companies in accordance with the Center's mission and applicable Regents and University policies.

ITTC Calendar
There are no upcoming events at this time.