Finetune Problem #235

ChinnYu · 2023-12-05T10:14:39Z

I wanted to extend my sincere appreciation for your project. As a devoted fan of your work, I have thoroughly enjoyed being part of this journey. However, I have recently encountered an issue with the latest version. Specifically, when attempting to perform code finetuning and clicking on "Run Filter," it displays a fail message. However, when I click on "Run Finetune," it shows that the process is completed successfully, and the code even indicates acceptance. On top of that, the run finetune process gets interrupted immediately. This problem did not exist in the previous version.

I understand that software development can be complex, and unforeseen issues may arise. Therefore, I kindly request your assistance in resolving this matter. If there are any potential solutions or suggestions you could provide to prevent the interruption during the run finetune process, it would be greatly appreciated.

Once again, thank you for your dedication and hard work. I eagerly anticipate future updates and improvements to this remarkable project.

Thank you sincerely,

hazratisulton · 2023-12-08T01:01:59Z

Hello, @ChinnYu!
Can you provide logs?
Are you using release docker or using source code?
What message you see when clicking on "Run Filter"?

ChinnYu · 2023-12-09T12:34:29Z

Hello, @hazratisulton. I used the source code and the provided Dockerfile to build the image. I noticed that the built image has an error when I press 'run filter.

JegernOUTT · 2023-12-26T09:22:06Z

@hazratisulton have you managed to reproduce it?

hazratisulton · 2023-12-27T13:51:28Z

@hazratisulton have you managed to reproduce it?

No, I couldn't. I asked @mitya52 to take a look, maybe he could offer some ideas.

ChinnYu · 2023-12-28T11:38:33Z

HI @hazratisulton @JegernOUTT , I attempted to build the image using the latest version of the source code (12/28) from the 'dev' branch, and it seems that the same issue persists. If there's a specific log for analysis that you need, please let me know, and I'll provide it.

ChinnYu · 2024-01-02T13:15:25Z

After numerous code modifications, I discovered that changing 'aux' to '_aux' allows locating the Python module. But it also brings about two issues. 1. 'Index out of bounds' occurs when pressing 'run filter' and selecting 'codellama.' 2. 'AssertionError: You have to have more files to process than processes' happens at the beginning of Finetune. Indeed, the number of my files exceeds the number of processes. The first one is resolved by changing to transformers==4.34.0, and for the second one, the allocation rules need to be modified.

olegklimov · 2024-01-02T13:17:54Z

Interesting! But it works in nightly without any changes 🤔 Let's ask what @JegernOUTT and @mitya52 think.

ChinnYu · 2024-01-10T15:39:42Z

Hi, I'd like to ask another question. I'm attempting to integrate 'deepseek-ai/deepseek-coder-33b-base' into the refact. I added the 33B model to these two files: 'refact/known_models_db/refact_known_models/huggingface.py' and 'self_hosting_machinery/finetune/configuration/supported_models.py'. The modification process is similar to 'deepseek-coder/5.7b/mqa-base', and I've also added 'known_models' in 'refact-lsp'. However, Visual Studio Code (vscode) continues to report the following errors. I have checked the main page and confirmed that the model has been successfully initialized. Do the experts have any debugging suggestions for this issue?

Additionally, there is a warning as shown in the second image. What should be configured in this case? Thank you.

olegklimov · 2024-01-11T12:23:37Z

@ChinnYu awesome that you are trying this! You might need a change in refact-lsp, just add the model there by analogy like the other models.

There was this idea to try new models using "works like this other known model" in settings. But then it appeared not very practical (the best settings is no settings, because it gets outdated, needs tech support to remove unnecessary settings once it's there and server side changes, etc). Or maybe we could return to this idea, because it allows to try a model quickly without recompiling the lsp.

JegernOUTT · 2024-01-23T05:23:25Z

 I discovered that changing 'aux' to '_aux' allows locating the Python module

It's interesting, we never had such import namings, check this out
https://github.com/smallcloudai/refact/blob/main/self_hosting_machinery/finetune/scripts/finetune_filter.py#L16
Are you sure you haven't changed them yourself accidentally?

ChinnYu · 2024-02-06T14:09:54Z

HI @JegernOUTT I obtained the GitHub code by downloading the zip file, and I'm using 7zip as the decompression software. It seems that the built-in zip tool in Windows cannot decompress it. I've tried several times, but it keeps generating '_aux', as shown in the figure below.

JegernOUTT · 2024-02-22T06:02:00Z

@ChinnYu
Sorry for the delay, completely forgot about this
Yes, I've got the problem, it's due to some legacy Windows folder naming limitations.
We'll rename those folders in the next release, sorry for the inconvenience

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetune Problem #235

Finetune Problem #235

ChinnYu commented Dec 5, 2023 •

edited

hazratisulton commented Dec 8, 2023

ChinnYu commented Dec 9, 2023

JegernOUTT commented Dec 26, 2023

hazratisulton commented Dec 27, 2023

ChinnYu commented Dec 28, 2023

ChinnYu commented Jan 2, 2024 •

edited

olegklimov commented Jan 2, 2024

ChinnYu commented Jan 10, 2024

olegklimov commented Jan 11, 2024

JegernOUTT commented Jan 23, 2024

ChinnYu commented Feb 6, 2024

JegernOUTT commented Feb 22, 2024

Finetune Problem #235

Finetune Problem #235

Comments

ChinnYu commented Dec 5, 2023 • edited

hazratisulton commented Dec 8, 2023

ChinnYu commented Dec 9, 2023

JegernOUTT commented Dec 26, 2023

hazratisulton commented Dec 27, 2023

ChinnYu commented Dec 28, 2023

ChinnYu commented Jan 2, 2024 • edited

olegklimov commented Jan 2, 2024

ChinnYu commented Jan 10, 2024

olegklimov commented Jan 11, 2024

JegernOUTT commented Jan 23, 2024

ChinnYu commented Feb 6, 2024

JegernOUTT commented Feb 22, 2024

ChinnYu commented Dec 5, 2023 •

edited

ChinnYu commented Jan 2, 2024 •

edited