1.
Introduction
GenExplore is a
Microarray data analysis tool which can help you easily find out
relationships among different genes. Results can be validated by
experts or domain information, or suggest new experiments.
Related Link: http://www.cs.uncc.edu/~xwu/bio/GenExplore.html
2. Installation
2.0 Note
This instruction is
currently for redhat linux platform only, RH linux 8.0 or higher version
is recommanded. Moreover, you must use "administrator" or "root" as your login id,
because during installation u need rights to write to some
directory such as "/usr".
2.1 FLTK
-->Get FLTK
source code from ftp://ftp2.easysw.com/pub/fltk/1.1.3/fltk-1.1.3-source.zip
-->Note:
You can also download 1.1.4 or latest version and install it
-->use "unzip
fltk-1.1.3-source.zip" command under your own directory(e.g. "/usr/app/")
-->after unzip, type "cd fltk-1.1.3" to enter fltk directory.
-->type "./configure"
-->type "./make"
-->type "./make install"
2.2 Graphviz
-->get Graphviz
from http://www.graphviz.org/pub/graphviz/ARCHIVE/graphviz-1.10.tar.gz
-->Note:
You can also download latest version and install it
-->type "gzip -d
graphviz-1.10.tar.gz" command
-->type "tar -xf graphviz-1.10.tar" command
-->type "cd graphviz-1.10" to enter its directory
-->type "./configure"
-->type "./make"
-->type "./make install"
2.3 GenExplore
-->get GenExplore package
-->type"gunzip GenExplore0.98b.tar.gz"
-->type "tar -xvf
GenExplore0.98b.tar"
-->type "cd GenExplore0.98b"
-->type "./install.sh"
-->type "ln -b /usr/app/graphviz-1.10/dotneato/neato neato", here we
assume "/usr/app/graphviz-1.10/" is where your graphviz was installed,
you can replace that by using your own path. If you have already ran
"make install" command in Graphviz directory during 2.2, you do not need
to do this step.
2.4 apriori
-->an excutable
"apriori" program is included in GenExplore0.98b root directory. So no
further step needed.
-->you can also download the linux excutable file in http://fuzzy.cs.uni-magdeburg.de/~borgelt/src/apriori
-->you can also download the source file in http://fuzzy.cs.uni-magdeburg.de/~borgelt/src/apriori.zip
-->check this page for the latest news of apriori--http://fuzzy.cs.uni-magdeburg.de/~borgelt/doc/apriori/apriori.html
3. Execution
------------------------------
3.1 load program
-->under your GenExplore0.98b root directory, type "./GenExplore" to
start the program.
3.2 load data
file
-->After the GUI is loaded. Go to "File" menu to open a micro array
data file.
Note: data
file format
******Important!!!******
******Please make sure your have a data file with correct file
format:******
GenExplore now only
support "Format 1" array data file. (will support Format 2 in the next
version)
3.2.1
-->Format 1: support data files like "data_expts1-300_ratios.txt",
you could find this file in "http://www.rii.com/register/cell2000102Hughes/EULA.htm".
Note:The first row of this file is just
a comment, so before you load this file into GenExplore, please delete the first row
by using any text editor.
-->The first row of the data file should be:
"UNIQID NAME GWEIGHT sample1 sample2 ... sampleN"
-->from the second row, the data is like:
"YHR007c ERG11,CYP51 0.60 0.001 0.033 ..."
--> GenExplore
program will automatically get rid of NAME and GWEIGHT cols to proceed.
3.2.2
-->Format 2: support a simplified file format.
-->The first row
of the data file should be:
"UNIQID sample1 sample2 ... sampleN"
-->from the second row, the data is like:
"YHR007c 0.001 0.033 ..."
--> GenExplore
program will deal with this kind of data file in future.
3.3
Preprocess
3.3.1 Preprocess/
Apriori
-->after data file was loaded, go to "Preprocess" menu and choose
"Apriori", you must
perform this operation before you do any further
operation. During this step the raw data will be preprocessed (some temporary file
will be generated) for further operations. and frequent large item
set of the raw data will also be generated by "Apriori " program.
3.3.2 Preprocess
/ OpenGTRFile
--> Before you perform this operation, you need use some clustering
software, such as http://rana.lbl.gov/downloads/Cluster/Cluster_vers_2_20.exe, to generate hiearchical
cluster(*.GTR) file based on the macroarray data you opened. Related
software can be found in http://genome-www.stanford.edu/software/
-->A demo.gtr
file was provided with the GenExplore package, it was generated from data_expts1-300_ratios.txt, before
load this file. Please perform 3.3.1 first.
--> After you
perform this operation, a dialog box will pop up for you to select GTR
file to open.
--> After file is
selected, a new window with menu will pop up.
--> From the menu
of this window you can choose use "new" or "old" method to show
hierarchical graph of the data.
3.4 Analysis
3.4.1 Analysis /
Graphic Gauss Model
-->after you performed 3.3.1, you can see some clusters are listed
in the clusters list. However, you can not see the final result graph
unless you perform Graphic Gaussian Model analysis. After analysis,
select one cluster from the list then you can see the picture of that
cluster. Moreover, you can Add genes by using the "Add Gene"
button.
3.4.2 Analysis /
Loglinear Model
-->after you performed 3.4.1, you can pick up any cluster in the
clusters list. And the member genes of that cluster will shown in the
Cluster Member browser. Then you can perform Loglinear Model
analysis by press the "Loglinear MD" button or click menu item
"Analysis/Loglinear Model". The program will analysis current highlighted
cluster in the cluster list.
In the Cluster
Member browser, each member gene has a check box. You can also
select several genes by click their check box, and you must
press "New Cluster" to generate a new cluster which only include
the genes you selected, then you can perform loglinear model
analysis for this new cluster.
3.5 Help / About
-->The help document is under developing now. I think by the next
revision, I can finish it and add it to this program. It will be a HTML
style document.
4. bug report
------------------------------
If you find any bug
in this program, please send the detail info to me, really
appreciate your help. My email address is yye@uncc.edu.
I found that while
install this program on RH linux 7.x platform, sometimes it can not
compile successfully. However RH 8.0/9.0 looks fine for this
program, so if you meet any trouble during installation, try a
higher version of linux or try the latest GCC or G++, GCC 3.2 or higher
version is recommended.
GenExplore is
developed and maintainded by Yong Ye
& Dr. Xintao Wu &
Dr. K.R. Subramnian. Matt Miller
contributed on hierarchical cluster visualization part. |