[SLM] Support BERT architecture. Implement a text embedding module #2249

rickzx · 2024-04-29T20:26:30Z

This PR supports text embedding in MLC-LLM with a BERT encoder-only model.

Example usage: https://github.com/rickzx/mlc-llm/blob/18aa7ee378b826a61ce4baa98e4bab1bf3d64038/python/mlc_llm/embeddings/embeddings.ipynb

tqchen · 2024-04-30T00:12:20Z

This is a good first step towards embedding support through python-level API, would be great to also think about what does it take to bring it as part of the ThreadEngine, in which case we do need to support multiple models, but also have opportunity to support it as a universal embedding endpoint

tqchen · 2024-05-07T21:14:56Z

please fix the jenkins here

rickzx · 2024-05-07T21:46:49Z

please fix the jenkins here

Should be addressed by #2292. I'm triggering a rebuild now

rickzx · 2024-05-08T21:08:23Z

To fix CUDA error, apache/tvm#16982

[SLM] Support BERT architecture. Implement a text embedding module

e335fc1

rickzx requested review from CharlieFRuan and MasterJH5574 April 29, 2024 20:26

Fix lint

14e5fc5

rickzx force-pushed the pr-embedding branch from cab72d6 to 14e5fc5 Compare April 29, 2024 20:37

CharlieFRuan approved these changes May 1, 2024

View reviewed changes

Empty commit to trigger build

8bdf015

Remove softmax_with_temperature from bert_model.py

ce0c1f9

Empty commit to trigger build

424d738

rickzx merged commit 459ffe3 into mlc-ai:main May 10, 2024
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SLM] Support BERT architecture. Implement a text embedding module #2249

[SLM] Support BERT architecture. Implement a text embedding module #2249

rickzx commented Apr 29, 2024

tqchen commented Apr 30, 2024

tqchen commented May 7, 2024

rickzx commented May 7, 2024 •

edited

rickzx commented May 8, 2024

[SLM] Support BERT architecture. Implement a text embedding module #2249

[SLM] Support BERT architecture. Implement a text embedding module #2249

Conversation

rickzx commented Apr 29, 2024

tqchen commented Apr 30, 2024

tqchen commented May 7, 2024

rickzx commented May 7, 2024 • edited

rickzx commented May 8, 2024

rickzx commented May 7, 2024 •

edited