Requesting support for IBM's OpenSource Granite models #441

q5sys · 2024-05-09T00:34:46Z

These open source models were just released yesterday at Red Hat Summit.
https://huggingface.co/ibm-granite
https://arxiv.org/abs/2405.04324

If this ends up being a bigger ask than I think it is, and there's something I can do to help in making this happen, let me know.

danielhanchen · 2024-05-09T17:18:04Z

Oh interesting!

junzzhu · 2024-05-26T17:35:03Z

Fine tuning for both ibm-granite/granite-3b-code-instruct and ibm-granite/granite-8b-code-base is working now as far as I checked with Llama3 Colab notebook, with training loss decreasing as expected. However, inference outputs are both useless still.

Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
Continue the fibonnaci sequence.

### Input:
1, 1, 2, 3, 5, 8

### Response:
1#<fim_prefix>A
# str
 growth
 for
 for
 for
 for





  `



  `
 `
 
 
           
 ` ` ` ` ` ` ` ` ` `                                                           9\ `<fim_prefix><fim_prefix><fim_prefix><fim_prefix>

danielhanchen added the currently fixing Am fixing now! label May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requesting support for IBM's OpenSource Granite models #441

Requesting support for IBM's OpenSource Granite models #441

q5sys commented May 9, 2024

danielhanchen commented May 9, 2024

junzzhu commented May 26, 2024

Requesting support for IBM's OpenSource Granite models #441

Requesting support for IBM's OpenSource Granite models #441

Comments

q5sys commented May 9, 2024

danielhanchen commented May 9, 2024

junzzhu commented May 26, 2024