| Description | Number of Classes
* |
Number of Attributes |
Training |
Test Set ** |
Files |
||||||||
| Categ. | Contin. | Ignore | Instances |
Instances |
nam |
dat |
tst |
inf |
inf2 |
zip |
|||
| agaricus_lepiota | Mushroom Data | 2 (52%) |
22 |
8124 (2480) |
y |
y |
y |
y |
|||||
| australian_credit | Australian Credit Data | 2 (56%) |
8 |
6 |
690 |
y |
y |
y |
|||||
| bcst96 | Webpage Classification | 2 (54%) |
13430 |
1186 |
509 |
y |
y |
y |
|||||
| breast-cancer | Breast Cancer Data | 2 (66%) |
9 |
1 |
699 (16) |
y |
y |
y |
|||||
| cancer | Cancer Data | 3 (40%) |
100 |
100 |
y |
y |
|||||||
| chess | Chess Endgame | 2 (95%) |
7 |
647 |
y |
y |
|||||||
| colonTumor | Colon Tumour | 2 (65%) |
2000 |
62 |
y |
y |
|||||||
| contact_lenses | Contact Lenses | 3 (88%) |
5 |
108 |
y |
y |
|||||||
| crx | Credit Card Applications | 2 (56%) |
9 |
6 |
690 (37) |
200 (12) |
y |
y |
y |
||||
| degrees | Degree Classification | 2 (77%) |
5 |
26 |
y |
y |
|||||||
| diabetes | Diabetes Data | 2 (65%) |
8 |
768 |
y |
y |
y |
||||||
| ecoli | E-coli | 8 (43%) |
7 |
1 |
336 |
y |
y |
y |
|||||
| games | Sporting Preferences | 2 (58%) |
4 |
12 |
y |
y |
|||||||
| genetics | Genetics | 3 (52%) |
60 |
3190 |
y |
y |
|||||||
| glass | Types of glass | 6 (36%) |
9 |
1 |
214 |
y |
y |
y |
y |
||||
| golf | Decision whether to play | 2 (64%) |
2 |
2 |
14 |
y |
y |
||||||
| hepatitis | Hepatitis Data | 2 (79%) |
13 |
6 |
155 (75) |
y |
y |
y |
|||||
| hypo | Hypothyroid Data | 5 (92%) |
22 |
7 |
2514 (2514) |
1258 (1258) |
y |
y |
y |
||||
| iris | Iris Data | 3 (33%) |
4 |
150 |
y |
y |
y |
||||||
| labor-ne | Labour Negotiations | 2 (65%) |
8 |
8 |
40 (39) |
17 (17) |
y |
y |
y |
||||
| lens24 | Contact Lenses (reduced version) | 3 (63%) |
4 |
24 |
y |
y |
|||||||
| leukaemia | Leukaemia | 2 (71%) |
7129 |
38 |
34 |
y |
y |
y |
|||||
| monk1 | Monk's Problem 1 | 2 (50%) |
6 |
124 |
432 |
y |
y |
y |
|||||
| monk2 | Monk's Problem 2 | 2 (62%) |
6 |
169 |
432 |
y |
y |
y |
|||||
| monk3 | Monk's Problem 3 | 2 (51%) |
6 |
122 |
432 |
y |
y |
y |
|||||
| pendigits | Handwriting Recognition | 10 (11%) |
16 |
1200 (1200) |
400 (400) |
y |
y |
y |
|||||
| pima-indians | Pima Indians Data | 2 (65%) |
8 |
768 |
y |
y |
y |
||||||
| play | Play Data | 4 (53%) |
3 |
2 |
30 |
y |
y |
||||||
| segmentation | Segmentation | 7 (all equal%) |
19 |
210 |
2100 |
y |
y |
y |
y |
||||
| sick-euthyroid | Sick Euthyroid | 2 (91%) |
18 |
7 |
3163 (3161) |
y |
y |
||||||
| soybean | Soybean Data | 19 (13%) |
35 |
683 (121) |
y |
y |
|||||||
| vote | Voting Records | 2 (61%) |
16 |
300 |
135 |
y |
y |
y |
|||||
| wake_vortex | Air Traffic Control Data | 2 (50%) |
3 |
1 |
53 |
1714 |
y |
y |
|||||
| wake_vortex_full | Air Traffic Control Data (full) | 2 (50%) |
19 |
32 |
6 |
1714 (3) |
y |
y |
|||||
| yeast | Yeast | 10 (31%) |
8 |
1484 |
y |
y |
y |
||||||
| zoo | Zoo Data | 7 (41%) |
16 |
1 |
101 |
y |
y |
y |
|||||
* with % size of majority class in training set given in
parentheses
** with number of instances having at least one missing value in parentheses
(if non-zero)