Error while fitting the model #52

sachinlodhi · 2024-04-03T18:33:37Z

Following is the dataframe:

and following is the additional code:

df.rename(columns={'result': 'Decision'}, inplace=True)

Output:

Index(['Date', 'Country', 'League', 'Season', 'HomeTeam', 'AwayTeam',
       'home_goal', 'away_goal', 'Decision'],
      dtype='object')

config = {"algorithm" : "C4.5"}
model = chef.fit(df, config,  target_label = "Decision")

I am getting error:

[INFO]:  4 CPU cores will be allocated in parallel running
C4.5  tree is going to be built...
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
/tmp/ipykernel_22413/130574452.py in ?()
----> 1 model = chef.fit(df, config,  target_label = "Decision")

~/anaconda3/envs/rover/lib/python3.10/site-packages/chefboost/Chefboost.py in ?(df, config, target_label, validation_df)
    209                 if enableParallelism == True:
    210                         json_file = "outputs/rules/rules.json"
    211                         functions.createFile(json_file, "[\n")
    212 
--> 213 		trees = Training.buildDecisionTree(df, root = root, file = file, config = config
    214                                 , dataset_features = dataset_features
    215 				, parent_level = 0, leaf_id = 0, parents = 'root', validation_df = validation_df, main_process_id = process_id)
    216 

~/anaconda3/envs/rover/lib/python3.10/site-packages/chefboost/training/Training.py in ?(df, root, file, config, dataset_features, parent_level, leaf_id, parents, tree_id, validation_df, main_process_id)
    432                 pivot = pd.DataFrame(subdataset.Decision.value_counts()).reset_index()
    433                 pivot = pivot.rename(columns = {"Decision": "Instances","index": "Decision"})
    434                 pivot = pivot.sort_values(by = ["Instances"], ascending = False).reset_index()
    435 
--> 436                 else_decision = "return '%s'" % (pivot.iloc[0].Decision)
    437 
    438                 if enableParallelism != True:
    439                         functions.storeRule(file,(functions.formatRule(root), "else:"))

~/anaconda3/envs/rover/lib/python3.10/site-packages/pandas/core/generic.py in ?(self, name)
   6200             and name not in self._accessors
   6201             and self._info_axis._can_hold_identifiers_and_holds_name(name)
   6202         ):
   6203             return self[name]
-> 6204         return object.__getattribute__(self, name)

AttributeError: 'Series' object has no attribute 'Decision'

Even If I do not rename it doesn't matter. It always throws this error.

serengil · 2024-04-05T10:07:44Z

What about this?

model = chef.fit(df, config,  target_label = "result")

sachinlodhi · 2024-04-06T23:55:20Z

What about this?

model = chef.fit(df, config,  target_label = "result")

Still I see same error.

VPK02 · 2024-04-12T05:42:34Z

I also got same error. Please help to solve this error.

sachinlodhi · 2024-04-16T20:41:24Z

I got it working but it is not consistent. Sometimes it works sometimes it doesn't. So I took these steps:

Clone the repository.
Create the virtual environment. You can also activate the existing one.
Install the requirements.
Run the script.
NOW, IF you get the error then do a thing.... As you installed chefboost by cloning your package is basically located in the cloned directory. What I did is that I was backtracking the calls made and I used print("some random text") in Chefboost.py at line #274, #155, in Training.py at line #395.
I am pretty sure this may seem really ridiculous but I do not know once I started printing the status or some random text it started working. But it is not consistent. Sometimes it fails.

serengil · 2024-04-25T09:35:49Z

package on pip is old. i need to publish the recent changes. meanwhile, you can pull the source code and run it instead of pip package.

git clone https://github.com/serengil/chefboost.git
cd chefboost
pip install -e .

sachinlodhi · 2024-04-26T03:26:37Z

package on pip is old. i need to publish the recent changes. meanwhile, you can pull the source code and run it instead of pip package.
git clone https://github.com/serengil/chefboost.git
cd chefboost
pip install -e .

Yes I tried that and sometimes it works but sometimes everything freezes and I have to stop and run the cell again and then it gives some output. For large dataframes like with like 20k rows it freezes. Number of features are like 10-15.

serengil · 2024-04-26T09:00:20Z

would you please try to set parallelism false as

config = {'algorithm': 'C4.5', 'enableParallelism': False}

sachinlodhi changed the title ~~Erro while fitting the model~~ Error while fitting the model Apr 3, 2024

serengil added the bug Something isn't working label Apr 5, 2024

serengil mentioned this issue Apr 25, 2024

'Series' object has no attribute 'Decision' #54

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while fitting the model #52

Error while fitting the model #52

sachinlodhi commented Apr 3, 2024 •

edited

serengil commented Apr 5, 2024

sachinlodhi commented Apr 6, 2024

VPK02 commented Apr 12, 2024

sachinlodhi commented Apr 16, 2024

serengil commented Apr 25, 2024

sachinlodhi commented Apr 26, 2024

serengil commented Apr 26, 2024

Error while fitting the model #52

Error while fitting the model #52

Comments

sachinlodhi commented Apr 3, 2024 • edited

serengil commented Apr 5, 2024

sachinlodhi commented Apr 6, 2024

VPK02 commented Apr 12, 2024

sachinlodhi commented Apr 16, 2024

serengil commented Apr 25, 2024

sachinlodhi commented Apr 26, 2024

serengil commented Apr 26, 2024

sachinlodhi commented Apr 3, 2024 •

edited