Table 2: Details on the chosen set of clusters for each (benchmark + additional) data set.
(a) Name of data set; (b) similarity parameter and (c) inflation value parameter used to generate these set of clusters; (d) number of clusters in the chosen set and (e) the number of subsequences in the largest cluster.
Gene (a) |
P (b) |
I (c) |
# clusters (d) |
# el largest cluster (e)
|
cfos |
0 |
4 |
12 |
5
|
hoxb2 |
-10 |
4 |
4 |
6
|
pax6 |
0 |
4 |
20 |
6
|
scl |
-10 |
4 |
11 |
4
|
EGR3 |
0 |
4 |
11 |
8
|
GSH1 |
-10 |
4 |
12 |
4
|
HIV-EP1 |
0 |
4 |
13 |
6
|
HOXB5 |
0 |
4 |
1 |
4
|
MEIS2 |
0 |
4 |
24 |
6
|
PCHD8 |
0 |
4 |
14 |
4
|