Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SparkNLP 997 Introducing QWEN2Transformer #14188

Draft
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

prabod
Copy link
Contributor

@prabod prabod commented Feb 29, 2024

Description

This PR introduce the QWEN family of LLMs

Qwen: comprehensive language model series

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model
pretrained on a large amount of data. In comparison with the previous released Qwen, the
improvements include:

6 model sizes, including 0.5B, 1.8B, 4B, 7B, 14B, and 72B; Significant performance improvement
in Chat models; Multilingual support of both base and chat models; Stable support of 32K
context length for models of all sizes

Types of changes

  • New feature (non-breaking change which adds functionality)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation.
  • I have updated the documentation accordingly.
  • I have read the CONTRIBUTING page.
  • I have added tests to cover my changes.
  • All new and existing tests passed.

@prabod prabod added new-feature Introducing a new feature new model DON'T MERGE Do not merge this PR labels Feb 29, 2024
@prabod prabod self-assigned this Feb 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DON'T MERGE Do not merge this PR new model new-feature Introducing a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant