-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some questions about the Hugging Face Models. #3838
Comments
At this point, I think generation cannot be done. But MLM can be done. In the initialization of hugging-face model, along with model and tokenizer, I would suggest also to pass model = HuggingFaceModel(mode=model, task='mlm', tokenizer=tokenizer) |
Yeah ! as far as i could only mlm task can be done. But it works.
However, I would like to know if is also possible to do some fine-tuning in terms of certain types of sequences. So the filling task will use a specific fine tune model. Seems like this models requires a lot of RAM to retrain right ? I try this in colab and died :(
|
RAM usage is definitely challenging for these models. I'd say at least 16GB of RAM to train a model. I would love to see if we can work out how to do completion with ChemBERTa though since that would be very useful for conditional generation |
Yeah, some hardware requirement must be satisfied for finetuning. About 1/2 A100 can be used to finetune a Large language model with 7 billion parameters. In my research, at least 16GM for prediction of protein structures with more than total 1400 length for Alphafold-multimer. |
Hi !
I have a couple of questions related to the HUgging Face models.
Base on the task avaialbe in the deepchem wrapper. Is not possible to perform task like text-generation of this kind of examples ?
with the output
So if its not possible to use text generation task maybe is it possible to use masked language modeling with the paremeter mlm.
In this kind of task we will predict the mask in the sequence.
however I am getting an error :
Is it related to the empty task in the deepchem dataset ?
The text was updated successfully, but these errors were encountered: