Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Concoct error #108

Closed
AroneyS opened this issue Aug 7, 2023 · 4 comments
Closed

Concoct error #108

AroneyS opened this issue Aug 7, 2023 · 4 comments

Comments

@AroneyS
Copy link
Collaborator

AroneyS commented Aug 7, 2023

Concoct failed for ~20 single-sample recovery with the following error. Is it something about contig names? I had to rerun concoct manually to get the error since stderr is sent to /dev/null and this error at least wasn't in concoct's log.txt.

Traceback (most recent call last):
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/bin/concoct", line 90, in <module>
    results = main(args)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/bin/concoct", line 37, in main
    transform_filter, pca = perform_pca(
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/concoct/transform.py", line 5, in perform_pca
    pca_object = PCA(n_components=nc, random_state=seed).fit(d)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/base.py", line 1151, in wrapper
    return fit_method(estimator, *args, **kwargs)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/decomposition/_pca.py", line 434, in fit
    self._fit(X)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/decomposition/_pca.py", line 483, in _fit
    X = self._validate_data(
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/base.py", line 579, in _validate_data
    self._check_feature_names(X, reset=reset)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/base.py", line 440, in _check_feature_names
    feature_names_in = _get_feature_names(X)
  File "/home/aroneys/m/users/aroneys/.conda/envs/00d91568b98b8a6967e97dcf18eb70ed_/lib/python3.10/site-packages/sklearn/utils/validation.py", line 2021, in _get_feature_names
    raise TypeError(
TypeError: Feature names are only supported if all input features have string names, but your input has ['int', 'str'] as feature name / column name types. If you want feature names to be stored and validated, you must convert them all to strings, by using X.columns = X.columns.astype(str) for example. Otherwise you can remove feature / column names from your input data, or convert them all to a non-string data type.
@rhysnewell
Copy link
Owner

Have you check the contig names that get produced in the concoct coverage table? Are some of them appearing as just raw numbers without any extra fluff?

Btw, although annoying concoct stderr gets piped to /dev/null because it is really verbose and it would start causing severe runtime issues if it was left to continue chirping

@AroneyS
Copy link
Collaborator Author

AroneyS commented Aug 7, 2023

They are all things like: NODE_3536854_length_56_cov_1.000000.concoct_part_0

@AroneyS
Copy link
Collaborator Author

AroneyS commented Aug 8, 2023

Issue with sklearn dependency version: BinPro/CONCOCT#323 (comment)
Fix for bioconda: bioconda/bioconda-recipes#42187

@rhysnewell
Copy link
Owner

See #107 for implemented fix

@AroneyS AroneyS closed this as completed Aug 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants