Trouble to load Big data set
Posted: Thu May 12, 2016 3:26 pm
Hi all,I have a dataset with 10000 features (terms) with tf_idf values.
when I try to load the data (which is in C45 format) I get the following error:
data = orange.ExampleTable(file)
SystemError: C45ExampleGenerator: line 1 of file '../features/pubmed.data' too long
So I reduced the feature space to 5000 by use of term frequency treshold (>2).
But it doesn't work neather.
Does Orange have a problem with 5000 features and if so, where is its limit?
best regards,
when I try to load the data (which is in C45 format) I get the following error:
data = orange.ExampleTable(file)
SystemError: C45ExampleGenerator: line 1 of file '../features/pubmed.data' too long
So I reduced the feature space to 5000 by use of term frequency treshold (>2).
But it doesn't work neather.
Does Orange have a problem with 5000 features and if so, where is its limit?
best regards,