Fit transform #145

tomlincr · 2023-08-10T17:20:29Z

Added .fit_transform method to all node embedding algorithms, primarily motivated by desire to use karateclub algorithms in a scikit-learn pipeline.

Adds:

y=None argument, for scikit-learn compatibility
Passthrough if y is not None to allow passing e.g. node attributes through for a downstream task in the pipeline

Tests:

Method tested for each algorithm
Generally testing that output matches that of .get_embedding()
Unless stochastic method, when testing that shapes match

This reverts commit c039fbb. Attributed node embeddings Need different fit_transform method that can account for features

stochastic therefore check shape

tomlincr · 2023-08-10T17:25:00Z

Apologies, long day and thought I'd opened this PR on my fork to test coverage, CI etc.

tomlincr · 2023-08-10T17:40:04Z

Interesting, all passes locally.
Seems to be some variation in the embeddings generated by multiple fits when run by actions.
Will test shape matches instead for these offenders

codecov-commenter · 2023-08-10T17:49:09Z

Codecov Report

Merging #145 (c7ceb75) into master (d750b33) will increase coverage by 0.12%.
The diff coverage is 100.00%.

❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more.

@@            Coverage Diff             @@
##           master     #145      +/-   ##
==========================================
+ Coverage   97.41%   97.53%   +0.12%     
==========================================
  Files          63       63              
  Lines        2707     2845     +138     
==========================================
+ Hits         2637     2775     +138     
  Misses         70       70

Files Changed	Coverage Δ
karateclub/estimator.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/ae.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/asne.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/bane.py	`100.00% <100.00%> (ø)`
...arateclub/node_embedding/attributed/feathernode.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/fscnmf.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/musae.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/sine.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/tadw.py	`100.00% <100.00%> (ø)`
karateclub/node_embedding/attributed/tene.py	`100.00% <100.00%> (ø)`
... and 18 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

NB NEU is a meta model!

Add set_params to Estimator base class

LucaCappelletti94 · 2024-02-22T17:29:59Z

I have tried to run the test suite of this pull request, but it is currently failing at the HOPE model test. I see that you are comparing the two embeddings - maybe there are numerical instabilities that lead to different results over different runs? I am not familiar with the internals of numpy & scipy that much.

tomlincr added 22 commits August 10, 2023 15:01

gitignore VScode

be2ee12

gitignore Mac clutter

9c91571

add fit_transform & fit_predict

aea8c2a

add fit_transform

19257f5

y=None consistency

9f338aa

docs

272e52a

add deep=True argument to get_params

af71588

add y passthrough

cfe8a30

test fit_transform

1ba2a46

add fit_transform to all neighbourhood methods

c078a43

fix self.embedding

4b779f8

assert equal shape for stochastic methods

7d49703

list -> array

782bdbd

structural fit_transform + tests

d8fd406

meta fit_transform

7272e25

meta tests

95d1423

attributed fit_transform

c039fbb

Revert "attributed fit_transform"

82de7de

This reverts commit c039fbb. Attributed node embeddings Need different fit_transform method that can account for features

attributed POC

99dcb9f

attributed fit_transform method

b7e8148

attributed fit_transform test

276e614

fix sine test

edaadd1

stochastic therefore check shape

try assert equal shape for fails

bae0046

tomlincr added 4 commits August 11, 2023 07:52

test y arg of fit_transform

877f66f

test y arg for fit_transform

8cdf885

test y arg for fit_transform neighbourhood

a3c89d8

test y arg for fit_transform attributed

8092ed0

tomlincr and others added 7 commits August 11, 2023 08:32

add model to fit_transform

3b7052e

NB NEU is a meta model!

test META model, not nested estimator

c7ceb75

add VSCode + Mac to gitignore

45e12b5

add set_params to estimator

b327689

test set_params

45e833e

Merge pull request #3 from tomlincr/set_params

90bc37b

Add set_params to Estimator base class

Merge branch 'dev' into fit_transform

716a796

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fit transform #145

Fit transform #145

tomlincr commented Aug 10, 2023 •

edited

tomlincr commented Aug 10, 2023

tomlincr commented Aug 10, 2023

codecov-commenter commented Aug 10, 2023 •

edited

LucaCappelletti94 commented Feb 22, 2024

Fit transform #145

Are you sure you want to change the base?

Fit transform #145

Conversation

tomlincr commented Aug 10, 2023 • edited

tomlincr commented Aug 10, 2023

tomlincr commented Aug 10, 2023

codecov-commenter commented Aug 10, 2023 • edited

Codecov Report

LucaCappelletti94 commented Feb 22, 2024

tomlincr commented Aug 10, 2023 •

edited

codecov-commenter commented Aug 10, 2023 •

edited