-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Upstream encoder/decoder support based on multiple blocktables #161
base: main
Are you sure you want to change the base?
Commits on Feb 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d7f3964 - Browse repository at this point
Copy the full SHA d7f3964View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5574081 - Browse repository at this point
Copy the full SHA 5574081View commit details -
Configuration menu - View commit details
-
Copy full SHA for 344020c - Browse repository at this point
Copy the full SHA 344020cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 95529e3 - Browse repository at this point
Copy the full SHA 95529e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 93dc5a2 - Browse repository at this point
Copy the full SHA 93dc5a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for fd5dcc5 - Browse repository at this point
Copy the full SHA fd5dcc5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c530e2c - Browse repository at this point
Copy the full SHA c530e2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f32cdd - Browse repository at this point
Copy the full SHA 6f32cddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4caf704 - Browse repository at this point
Copy the full SHA 4caf704View commit details -
Configuration menu - View commit details
-
Copy full SHA for 57f0449 - Browse repository at this point
Copy the full SHA 57f0449View commit details
Commits on Feb 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f7c1234 - Browse repository at this point
Copy the full SHA f7c1234View commit details
Commits on Feb 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ef978fe - Browse repository at this point
Copy the full SHA ef978feView commit details
Commits on Feb 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 70f3e8e - Browse repository at this point
Copy the full SHA 70f3e8eView commit details -
Optimize Triton MoE Kernel (vllm-project#2979)
Co-authored-by: Cade Daniel <edacih@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for cfc15a1 - Browse repository at this point
Copy the full SHA cfc15a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6e4a13 - Browse repository at this point
Copy the full SHA d6e4a13View commit details
Commits on Feb 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d9f726c - Browse repository at this point
Copy the full SHA d9f726cView commit details -
Configuration menu - View commit details
-
Copy full SHA for c1c0d00 - Browse repository at this point
Copy the full SHA c1c0d00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4dd6416 - Browse repository at this point
Copy the full SHA 4dd6416View commit details -
Support Orion model (vllm-project#2539)
Co-authored-by: zhangdacheng <zhangdacheng@ainirobot.com> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Configuration menu - View commit details
-
Copy full SHA for 48a8f4a - Browse repository at this point
Copy the full SHA 48a8f4aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2410e32 - Browse repository at this point
Copy the full SHA 2410e32View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4bd18ec - Browse repository at this point
Copy the full SHA 4bd18ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for e0ade06 - Browse repository at this point
Copy the full SHA e0ade06View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b430d7 - Browse repository at this point
Copy the full SHA 8b430d7View commit details -
Enable GQA support in the prefix prefill kernels (vllm-project#3007)
Signed-off-by: Tao He <sighingnow@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 71bcaf9 - Browse repository at this point
Copy the full SHA 71bcaf9View commit details
Commits on Feb 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a868310 - Browse repository at this point
Copy the full SHA a868310View commit details -
Configuration menu - View commit details
-
Copy full SHA for e46fa5d - Browse repository at this point
Copy the full SHA e46fa5dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b7178c - Browse repository at this point
Copy the full SHA 3b7178cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 929b4f2 - Browse repository at this point
Copy the full SHA 929b4f2View commit details
Commits on Feb 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dd82ba3 - Browse repository at this point
Copy the full SHA dd82ba3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f2fd579 - Browse repository at this point
Copy the full SHA f2fd579View commit details -
Configuration menu - View commit details
-
Copy full SHA for 01a5d18 - Browse repository at this point
Copy the full SHA 01a5d18View commit details -
Configuration menu - View commit details
-
Copy full SHA for a6d471c - Browse repository at this point
Copy the full SHA a6d471cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9289e57 - Browse repository at this point
Copy the full SHA 9289e57View commit details -
Configuration menu - View commit details
-
Copy full SHA for bfdcfa6 - Browse repository at this point
Copy the full SHA bfdcfa6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2fb6905 - Browse repository at this point
Copy the full SHA 2fb6905View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c08ff2 - Browse repository at this point
Copy the full SHA 2c08ff2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 29a8d6a - Browse repository at this point
Copy the full SHA 29a8d6aView commit details -
Add guided decoding for OpenAI API server (vllm-project#2819)
Co-authored-by: br3no <breno@veltefaria.de> Co-authored-by: simon-mo <simon.mo@hey.com>
Configuration menu - View commit details
-
Copy full SHA for 703e42e - Browse repository at this point
Copy the full SHA 703e42eView commit details
Commits on Mar 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 54d3544 - Browse repository at this point
Copy the full SHA 54d3544View commit details -
Configuration menu - View commit details
-
Copy full SHA for 27ca23d - Browse repository at this point
Copy the full SHA 27ca23dView commit details -
docs: Add tutorial on deploying vLLM model with KServe (vllm-project#…
…2586) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 49d849b - Browse repository at this point
Copy the full SHA 49d849bView commit details -
fix relative import path of protocol.py (vllm-project#3134)
Co-authored-by: huohuarong <huohuarong@zuoshouyisheng.com>
Configuration menu - View commit details
-
Copy full SHA for 90fbf12 - Browse repository at this point
Copy the full SHA 90fbf12View commit details -
Configuration menu - View commit details
-
Copy full SHA for be58c3b - Browse repository at this point
Copy the full SHA be58c3bView commit details -
Integrate Marlin Kernels for Int4 GPTQ inference (vllm-project#2497)
Co-authored-by: Robert Shaw <114415538+rib-2@users.noreply.github.com> Co-authored-by: alexm <alexm@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for c0c2335 - Browse repository at this point
Copy the full SHA c0c2335View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82091b8 - Browse repository at this point
Copy the full SHA 82091b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 70837fd - Browse repository at this point
Copy the full SHA 70837fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 42a6e2b - Browse repository at this point
Copy the full SHA 42a6e2bView commit details -
allow user chose log level by --log-level instead of fixed 'info'. (v…
…llm-project#3109) Co-authored-by: zixiao <shunli.dsl@alibaba-inc.com> Co-authored-by: Simon Mo <simon.mo@hey.com>
Configuration menu - View commit details
-
Copy full SHA for 29e70e3 - Browse repository at this point
Copy the full SHA 29e70e3View commit details
Commits on Mar 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e3fd30d - Browse repository at this point
Copy the full SHA e3fd30dView commit details -
Merge pull request #1 from afeldman-nm/enc_dec_t5
T5 enc/dec example file; linting/formatting
Configuration menu - View commit details
-
Copy full SHA for db726e6 - Browse repository at this point
Copy the full SHA db726e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43e920e - Browse repository at this point
Copy the full SHA 43e920eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 431f014 - Browse repository at this point
Copy the full SHA 431f014View commit details -
Configuration menu - View commit details
-
Copy full SHA for 37fcf99 - Browse repository at this point
Copy the full SHA 37fcf99View commit details -
Configuration menu - View commit details
-
Copy full SHA for baee28c - Browse repository at this point
Copy the full SHA baee28cView commit details -
Merge pull request #2 from afeldman-nm/enc_dec_t5
Small PR for debug print statements
Configuration menu - View commit details
-
Copy full SHA for 4bf056b - Browse repository at this point
Copy the full SHA 4bf056bView commit details -
Add Automatic Prefix Caching (vllm-project#2762)
Co-authored-by: ElizaWszola <eliza@neuralmagic.com> Co-authored-by: Michael Goin <michael@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for ce4f5a2 - Browse repository at this point
Copy the full SHA ce4f5a2View commit details
Commits on Mar 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d65fac2 - Browse repository at this point
Copy the full SHA d65fac2View commit details -
[FIX] Fix styles in automatic prefix caching & add a automatic prefix…
… caching benchmark (vllm-project#3158)
Configuration menu - View commit details
-
Copy full SHA for 996d095 - Browse repository at this point
Copy the full SHA 996d095View commit details
Commits on Mar 4, 2024
-
Make it easy to profile workers with nsight (vllm-project#3162)
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 17c3103 - Browse repository at this point
Copy the full SHA 17c3103View commit details -
Configuration menu - View commit details
-
Copy full SHA for d0fae88 - Browse repository at this point
Copy the full SHA d0fae88View commit details -
Configuration menu - View commit details
-
Copy full SHA for 901cf4c - Browse repository at this point
Copy the full SHA 901cf4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 27a7b07 - Browse repository at this point
Copy the full SHA 27a7b07View commit details -
enable --gpu-memory-utilization in benchmark_throughput.py (vllm-proj…
…ect#3175) Co-authored-by: zixiao <shunli.dsl@alibaba-inc.com>
Configuration menu - View commit details
-
Copy full SHA for 9cbc7e5 - Browse repository at this point
Copy the full SHA 9cbc7e5View commit details -
[Minor fix] The domain dns.google may cause a socket.gaierror excepti…
…on (vllm-project#3176) Co-authored-by: guofangze <guofangze@kuaishou.com>
Configuration menu - View commit details
-
Copy full SHA for 76e8a70 - Browse repository at this point
Copy the full SHA 76e8a70View commit details -
Push logprob generation to LLMEngine (vllm-project#3065)
Co-authored-by: Avnish Narayan <avnish@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 22de452 - Browse repository at this point
Copy the full SHA 22de452View commit details -
Add health check, make async Engine more robust (vllm-project#3015)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for ff578ca - Browse repository at this point
Copy the full SHA ff578caView commit details -
Fix the openai benchmarking requests to work with latest OpenAI apis (v…
…llm-project#2992) Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9a4548b - Browse repository at this point
Copy the full SHA 9a4548bView commit details
Commits on Mar 5, 2024
-
[ROCm] enable cupy in order to enable cudagraph mode for AMD GPUs (vl…
…lm-project#3123) Co-authored-by: lcskrishna <lollachaitanya@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 05af6da - Browse repository at this point
Copy the full SHA 05af6daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a5060f - Browse repository at this point
Copy the full SHA 8a5060fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 29d6f44 - Browse repository at this point
Copy the full SHA 29d6f44View commit details -
Configuration menu - View commit details
-
Copy full SHA for a4950ba - Browse repository at this point
Copy the full SHA a4950baView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c03760 - Browse repository at this point
Copy the full SHA 9c03760View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8999ec3 - Browse repository at this point
Copy the full SHA 8999ec3View commit details
Commits on Mar 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2efce05 - Browse repository at this point
Copy the full SHA 2efce05View commit details -
Merge pull request #3 from afeldman-nm/enc_dec_t5
fix _make_tensor_with_pad args change which broke decoder scenarios
Configuration menu - View commit details
-
Copy full SHA for 9f20ccf - Browse repository at this point
Copy the full SHA 9f20ccfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 24aecf4 - Browse repository at this point
Copy the full SHA 24aecf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for a33ce60 - Browse repository at this point
Copy the full SHA a33ce60View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4cb3b92 - Browse repository at this point
Copy the full SHA 4cb3b92View commit details
Commits on Mar 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d3c04b6 - Browse repository at this point
Copy the full SHA d3c04b6View commit details -
Update requirements-dev.txt to include package for benchmarking scrip…
…ts. (vllm-project#3181) Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for cbf4c05 - Browse repository at this point
Copy the full SHA cbf4c05View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2daf23a - Browse repository at this point
Copy the full SHA 2daf23aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 385da2d - Browse repository at this point
Copy the full SHA 385da2dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d6dccd - Browse repository at this point
Copy the full SHA 6d6dccdView commit details -
Possible fix for conflict between Automated Prefix Caching (vllm-proj…
…ect#2762) and multi-LoRA support (vllm-project#1804) (vllm-project#3263)
Configuration menu - View commit details
-
Copy full SHA for 8cbba46 - Browse repository at this point
Copy the full SHA 8cbba46View commit details
Commits on Mar 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b35cc93 - Browse repository at this point
Copy the full SHA b35cc93View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2339d6 - Browse repository at this point
Copy the full SHA d2339d6View commit details -
Configuration menu - View commit details
-
Copy full SHA for c59e120 - Browse repository at this point
Copy the full SHA c59e120View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ece1ae - Browse repository at this point
Copy the full SHA 1ece1aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 99c3cfb - Browse repository at this point
Copy the full SHA 99c3cfbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1cb0cc2 - Browse repository at this point
Copy the full SHA 1cb0cc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for c2c5e09 - Browse repository at this point
Copy the full SHA c2c5e09View commit details
Commits on Mar 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f48c679 - Browse repository at this point
Copy the full SHA f48c679View commit details -
[Speculative decoding 3/9] Worker which speculates, scores, and appli…
…es rejection sampling (vllm-project#3103)
Configuration menu - View commit details
-
Copy full SHA for 8437bae - Browse repository at this point
Copy the full SHA 8437baeView commit details
Commits on Mar 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0bba88d - Browse repository at this point
Copy the full SHA 0bba88dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4a28e5 - Browse repository at this point
Copy the full SHA e4a28e5View commit details
Commits on Mar 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9e8744a - Browse repository at this point
Copy the full SHA 9e8744aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b59f00 - Browse repository at this point
Copy the full SHA 4b59f00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f8844b - Browse repository at this point
Copy the full SHA 2f8844bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 657061f - Browse repository at this point
Copy the full SHA 657061fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4c92270 - Browse repository at this point
Copy the full SHA 4c92270View commit details -
Configuration menu - View commit details
-
Copy full SHA for c9415c1 - Browse repository at this point
Copy the full SHA c9415c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 654865e - Browse repository at this point
Copy the full SHA 654865eView commit details
Commits on Mar 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7035178 - Browse repository at this point
Copy the full SHA 7035178View commit details -
Configuration menu - View commit details
-
Copy full SHA for dbec357 - Browse repository at this point
Copy the full SHA dbec357View commit details -
docs: Add BentoML deployment doc (vllm-project#3336)
Signed-off-by: Sherlock113 <sherlockxu07@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for b0925b3 - Browse repository at this point
Copy the full SHA b0925b3View commit details -
llm_engine.py conflict resolution; removed prefix caching code; Seque…
…nce constructor call takes is_encoder_decoder, eos_token_id, lora_request calls; set is_encoder_decoder field in constructor
Configuration menu - View commit details
-
Copy full SHA for 4b2a121 - Browse repository at this point
Copy the full SHA 4b2a121View commit details -
actually updated Sequence constructor to take i_encoder_decoder, eos_…
…token_id, lora_request arguments
Configuration menu - View commit details
-
Copy full SHA for a93c17d - Browse repository at this point
Copy the full SHA a93c17dView commit details -
xformers.py accept incoming changes; replace paged_attention function…
… with import of PagedAttentionImpl
Configuration menu - View commit details
-
Copy full SHA for a62c3af - Browse repository at this point
Copy the full SHA a62c3afView commit details -
Configuration menu - View commit details
-
Copy full SHA for c31921f - Browse repository at this point
Copy the full SHA c31921fView commit details -
attempt at fixing model_runner conflicts related to encoder/decoder &…
… prefix caching; low confidence of success
Configuration menu - View commit details
-
Copy full SHA for 0c78be9 - Browse repository at this point
Copy the full SHA 0c78be9View commit details -
encoder/decoder + prefix caching not supported; moved check from llm.…
…py to model_runner.py
Configuration menu - View commit details
-
Copy full SHA for e25e6b8 - Browse repository at this point
Copy the full SHA e25e6b8View commit details -
refactoring, including: moved enc_dec_attention.py into vllm/model_ex…
…ecutor/layers/attention
Configuration menu - View commit details
-
Copy full SHA for 7f70d76 - Browse repository at this point
Copy the full SHA 7f70d76View commit details -
Configuration menu - View commit details
-
Copy full SHA for 36c8291 - Browse repository at this point
Copy the full SHA 36c8291View commit details -
fixed encoder/decoder reshape and cache bug, but paged attention call…
…s are still incorrect
Configuration menu - View commit details
-
Copy full SHA for 08f268a - Browse repository at this point
Copy the full SHA 08f268aView commit details -
augmented paged attention with context_lens, max_context_len, block_t…
…ables arguments to override input_metadata values; tests still pass but enc/dec still fails
Configuration menu - View commit details
-
Copy full SHA for b9b0600 - Browse repository at this point
Copy the full SHA b9b0600View commit details -
Configuration menu - View commit details
-
Copy full SHA for 63e9dca - Browse repository at this point
Copy the full SHA 63e9dcaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d7e5a8 - Browse repository at this point
Copy the full SHA 4d7e5a8View commit details
Commits on Mar 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 49a3c86 - Browse repository at this point
Copy the full SHA 49a3c86View commit details -
Configuration menu - View commit details
-
Copy full SHA for 602358f - Browse repository at this point
Copy the full SHA 602358fView commit details -
[Fix] Fix quantization="gptq" when using Marlin (vllm-project#3319)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Configuration menu - View commit details
-
Copy full SHA for b167109 - Browse repository at this point
Copy the full SHA b167109View commit details -
Configuration menu - View commit details
-
Copy full SHA for e221910 - Browse repository at this point
Copy the full SHA e221910View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba8dc95 - Browse repository at this point
Copy the full SHA ba8dc95View commit details -
Configuration menu - View commit details
-
Copy full SHA for 739c350 - Browse repository at this point
Copy the full SHA 739c350View commit details -
Add missing kernel for CodeLlama-34B on A/H100 (no tensor parallelism…
…) when using Multi-LoRA. (vllm-project#3350)
Configuration menu - View commit details
-
Copy full SHA for ae0ccb4 - Browse repository at this point
Copy the full SHA ae0ccb4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e9bd08 - Browse repository at this point
Copy the full SHA 7e9bd08View commit details -
Configuration menu - View commit details
-
Copy full SHA for c33afd8 - Browse repository at this point
Copy the full SHA c33afd8View commit details -
Configuration menu - View commit details
-
Copy full SHA for eeab52a - Browse repository at this point
Copy the full SHA eeab52aView commit details
Commits on Mar 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 81653d9 - Browse repository at this point
Copy the full SHA 81653d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for a37415c - Browse repository at this point
Copy the full SHA a37415cView commit details -
[Kernel] change benchmark script so that result can be directly used;…
… tune moe kernel in A100/H100 with tp=2,4,8 (vllm-project#3389)
Configuration menu - View commit details
-
Copy full SHA for 8fe8386 - Browse repository at this point
Copy the full SHA 8fe8386View commit details -
Configuration menu - View commit details
-
Copy full SHA for 06ec486 - Browse repository at this point
Copy the full SHA 06ec486View commit details -
Add args for mTLS support (vllm-project#3410)
Co-authored-by: Daniel Clark <daniel.clark@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for c17ca8e - Browse repository at this point
Copy the full SHA c17ca8eView commit details -
Configuration menu - View commit details
-
Copy full SHA for dfc7740 - Browse repository at this point
Copy the full SHA dfc7740View commit details -
Fix assertion failure in Qwen 1.5 with prefix caching enabled (vllm-p…
…roject#3373) Co-authored-by: Cade Daniel <edacih@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 54be8a0 - Browse repository at this point
Copy the full SHA 54be8a0View commit details -
Configuration menu - View commit details
-
Copy full SHA for b983ba3 - Browse repository at this point
Copy the full SHA b983ba3View commit details
Commits on Mar 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 78b6c48 - Browse repository at this point
Copy the full SHA 78b6c48View commit details -
[Misc] add HOST_IP env var (vllm-project#3419)
Co-authored-by: Simon Mo <simon.mo@hey.com>
Configuration menu - View commit details
-
Copy full SHA for b522c44 - Browse repository at this point
Copy the full SHA b522c44View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21539e6 - Browse repository at this point
Copy the full SHA 21539e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 253a980 - Browse repository at this point
Copy the full SHA 253a980View commit details -
Configuration menu - View commit details
-
Copy full SHA for 429284d - Browse repository at this point
Copy the full SHA 429284dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a7c8716 - Browse repository at this point
Copy the full SHA a7c8716View commit details -
[Fix] Add args for mTLS support (vllm-project#3430)
Co-authored-by: declark1 <daniel.clark@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 03d37f2 - Browse repository at this point
Copy the full SHA 03d37f2View commit details -
Fixes the misuse/mixuse of time.time()/time.monotonic() (vllm-project…
…#3220) Signed-off-by: Tao He <sighingnow@gmail.com> Co-authored-by: simon-mo <simon.mo@hey.com>
Configuration menu - View commit details
-
Copy full SHA for 14b8ae0 - Browse repository at this point
Copy the full SHA 14b8ae0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 604f235 - Browse repository at this point
Copy the full SHA 604f235View commit details -
Configuration menu - View commit details
-
Copy full SHA for a7af453 - Browse repository at this point
Copy the full SHA a7af453View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8fa7357 - Browse repository at this point
Copy the full SHA 8fa7357View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb96c1e - Browse repository at this point
Copy the full SHA fb96c1eView commit details
Commits on Mar 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 10585e0 - Browse repository at this point
Copy the full SHA 10585e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb7a219 - Browse repository at this point
Copy the full SHA bb7a219View commit details -
[Misc] PR templates (vllm-project#3413)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 413366e - Browse repository at this point
Copy the full SHA 413366eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0b60121 - Browse repository at this point
Copy the full SHA 0b60121View commit details -
Configuration menu - View commit details
-
Copy full SHA for d44257e - Browse repository at this point
Copy the full SHA d44257eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 19c5c4b - Browse repository at this point
Copy the full SHA 19c5c4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3123f15 - Browse repository at this point
Copy the full SHA 3123f15View commit details -
Configuration menu - View commit details
-
Copy full SHA for 14e3f9a - Browse repository at this point
Copy the full SHA 14e3f9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for cf6ff18 - Browse repository at this point
Copy the full SHA cf6ff18View commit details -
Configuration menu - View commit details
-
Copy full SHA for ad50bf4 - Browse repository at this point
Copy the full SHA ad50bf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e67598 - Browse repository at this point
Copy the full SHA 8e67598View commit details -
Configuration menu - View commit details
-
Copy full SHA for 120157f - Browse repository at this point
Copy the full SHA 120157fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b78837 - Browse repository at this point
Copy the full SHA 6b78837View commit details
Commits on Mar 17, 2024
-
[Misc] Use dataclass for InputMetadata (vllm-project#3452)
Co-authored-by: youkaichao <youkaichao@126.com>
Configuration menu - View commit details
-
Copy full SHA for abfc4f3 - Browse repository at this point
Copy the full SHA abfc4f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 93348d9 - Browse repository at this point
Copy the full SHA 93348d9View commit details
Commits on Mar 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9101d83 - Browse repository at this point
Copy the full SHA 9101d83View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8c654c0 - Browse repository at this point
Copy the full SHA 8c654c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 482b0ad - Browse repository at this point
Copy the full SHA 482b0adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 097aa0e - Browse repository at this point
Copy the full SHA 097aa0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for c0c17d4 - Browse repository at this point
Copy the full SHA c0c17d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9fdf3de - Browse repository at this point
Copy the full SHA 9fdf3deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 49eedea - Browse repository at this point
Copy the full SHA 49eedeaView commit details -
Configuration menu - View commit details
-
Copy full SHA for b30880a - Browse repository at this point
Copy the full SHA b30880aView commit details
Commits on Mar 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b37cdce - Browse repository at this point
Copy the full SHA b37cdceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a9c583 - Browse repository at this point
Copy the full SHA 6a9c583View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef65dcf - Browse repository at this point
Copy the full SHA ef65dcfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7341c77 - Browse repository at this point
Copy the full SHA 7341c77View commit details -
Configuration menu - View commit details
-
Copy full SHA for c614cfe - Browse repository at this point
Copy the full SHA c614cfeView commit details -
Configuration menu - View commit details
-
Copy full SHA for c2f97b6 - Browse repository at this point
Copy the full SHA c2f97b6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a60c9b - Browse repository at this point
Copy the full SHA 2a60c9bView commit details -
Configuration menu - View commit details
-
Copy full SHA for cc63d03 - Browse repository at this point
Copy the full SHA cc63d03View commit details -
Configuration menu - View commit details
-
Copy full SHA for 63e8b28 - Browse repository at this point
Copy the full SHA 63e8b28View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0536ff5 - Browse repository at this point
Copy the full SHA 0536ff5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20478c4 - Browse repository at this point
Copy the full SHA 20478c4View commit details
Commits on Mar 20, 2024
-
[PREFIX CACHING FOLLOW UP] A bunch of fixes to block allocator perfor…
…mance when automatic prefix caching is disabled (vllm-project#3357) Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 9474e89 - Browse repository at this point
Copy the full SHA 9474e89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ad521d - Browse repository at this point
Copy the full SHA 4ad521dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ee1449 - Browse repository at this point
Copy the full SHA 5ee1449View commit details -
Configuration menu - View commit details
-
Copy full SHA for 84eaa68 - Browse repository at this point
Copy the full SHA 84eaa68View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba8ae1d - Browse repository at this point
Copy the full SHA ba8ae1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 80e2548 - Browse repository at this point
Copy the full SHA 80e2548View commit details -
[1/n] Triton sampling kernel (vllm-project#3186)
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 426ec4e - Browse repository at this point
Copy the full SHA 426ec4eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e435de - Browse repository at this point
Copy the full SHA 6e435deView commit details -
Configuration menu - View commit details
-
Copy full SHA for f1c0fc3 - Browse repository at this point
Copy the full SHA f1c0fc3View commit details
Commits on Mar 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 523e30e - Browse repository at this point
Copy the full SHA 523e30eView commit details -
[PREFIX CACHING FOLLOW UP] OrderedDict-based evictor (vllm-project#3431)
Co-authored-by: rsnm2 <rshaw@neuralmagic.com> Co-authored-by: Luka <luka@paperspace>
Configuration menu - View commit details
-
Copy full SHA for 6ebd02b - Browse repository at this point
Copy the full SHA 6ebd02bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3bbff9e - Browse repository at this point
Copy the full SHA 3bbff9eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4c07dd2 - Browse repository at this point
Copy the full SHA 4c07dd2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8657323 - Browse repository at this point
Copy the full SHA 8657323View commit details -
[Misc] Bump up transformers to v4.39.0 & Remove StarCoder2Config (vll…
…m-project#3551) Co-authored-by: Roy <jasonailu87@gmail.com> Co-authored-by: Roger Meier <r.meier@siemens.com>
Configuration menu - View commit details
-
Copy full SHA for c188ecb - Browse repository at this point
Copy the full SHA c188ecbView commit details -
Configuration menu - View commit details
-
Copy full SHA for b7050ca - Browse repository at this point
Copy the full SHA b7050caView commit details
Commits on Mar 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ea5f14e - Browse repository at this point
Copy the full SHA ea5f14eView commit details -
Configuration menu - View commit details
-
Copy full SHA for e90fc21 - Browse repository at this point
Copy the full SHA e90fc21View commit details -
Configuration menu - View commit details
-
Copy full SHA for f721096 - Browse repository at this point
Copy the full SHA f721096View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d4972c - Browse repository at this point
Copy the full SHA 7d4972cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 23a5da5 - Browse repository at this point
Copy the full SHA 23a5da5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e32fb9c - Browse repository at this point
Copy the full SHA e32fb9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for ae1c368 - Browse repository at this point
Copy the full SHA ae1c368View commit details -
Configuration menu - View commit details
-
Copy full SHA for 691c2c1 - Browse repository at this point
Copy the full SHA 691c2c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e240eb4 - Browse repository at this point
Copy the full SHA e240eb4View commit details -
t5 Sampler does not pass vocab size to constructor; input_metadata.pr…
…ompt_lens is treated as a list in T5
Configuration menu - View commit details
-
Copy full SHA for cbfba8e - Browse repository at this point
Copy the full SHA cbfba8eView commit details -
add_request now correctly swaps decoder_prompt, prompt in encoder/dec…
…oder mode; removed encoder/decoder argument of Sequence
Configuration menu - View commit details
-
Copy full SHA for 501551c - Browse repository at this point
Copy the full SHA 501551cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 08435e4 - Browse repository at this point
Copy the full SHA 08435e4View commit details
Commits on Mar 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6e459a2 - Browse repository at this point
Copy the full SHA 6e459a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e1ca33 - Browse repository at this point
Copy the full SHA 8e1ca33View commit details -
Configuration menu - View commit details
-
Copy full SHA for e097732 - Browse repository at this point
Copy the full SHA e097732View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a44585 - Browse repository at this point
Copy the full SHA 2a44585View commit details
Commits on Mar 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 91a4608 - Browse repository at this point
Copy the full SHA 91a4608View commit details
Commits on Mar 27, 2024
-
inefficient but effective & Attention-wrapper-compatible implementati…
…on of relative position encoding based on packed-variable-length-sequences
Configuration menu - View commit details
-
Copy full SHA for d0c5e36 - Browse repository at this point
Copy the full SHA d0c5e36View commit details
Commits on Mar 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3737d5b - Browse repository at this point
Copy the full SHA 3737d5bView commit details
Commits on Apr 1, 2024
-
first pass at enc/dec support that runs e2e but doesn't produce corre…
…ct T5 inference result. Nothing is broken by this commit, unless there is a subsequent commit with changes in order to pass regression tests.
Configuration menu - View commit details
-
Copy full SHA for 38946ed - Browse repository at this point
Copy the full SHA 38946edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c39f55 - Browse repository at this point
Copy the full SHA 3c39f55View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ec2fde - Browse repository at this point
Copy the full SHA 4ec2fdeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 38f55ed - Browse repository at this point
Copy the full SHA 38f55edView commit details -
intermediate activations for prompt_run look right! Decoded token loo…
…ks wrong though. Added not_causal option for attn_bias to kernel interface contracts; also switched to batch size 1 to avoid incorrectness likely caused by packed-variable-sequence-length mask having zeroes rather than -inf's
Configuration menu - View commit details
-
Copy full SHA for 1aedc80 - Browse repository at this point
Copy the full SHA 1aedc80View commit details
Commits on Apr 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c1258b4 - Browse repository at this point
Copy the full SHA c1258b4View commit details
Commits on Apr 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0af1022 - Browse repository at this point
Copy the full SHA 0af1022View commit details
Commits on Apr 4, 2024
-
vLLM T5 matches nativegit status! fixes: decode-phase cross-input-met…
…adata has correct blocktable, slot_mapping=None, and correct (max) context length(s) (derived from prompt); decode-phase decoder self-attention relative position encoding mask has 1 x K geometry where 1 is the number of new tokens generated in a step and K is context length padded to the nearest multiple of block size, and also mask is reshuffled with contiguous (); ensured general correctness of cross-attention input_metadata; modified T5 example script to prevent HF/vLLM T5 instances from being length limited; net effect: batch-size 1 seems to work but batch-size >1 not supported
Configuration menu - View commit details
-
Copy full SHA for 9e8d234 - Browse repository at this point
Copy the full SHA 9e8d234View commit details -
Configuration menu - View commit details
-
Copy full SHA for f5242a0 - Browse repository at this point
Copy the full SHA f5242a0View commit details -
Configuration menu - View commit details
-
Copy full SHA for de0fd31 - Browse repository at this point
Copy the full SHA de0fd31View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a67647 - Browse repository at this point
Copy the full SHA 5a67647View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed05d47 - Browse repository at this point
Copy the full SHA ed05d47View commit details
Commits on Apr 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d5a8b92 - Browse repository at this point
Copy the full SHA d5a8b92View commit details
Commits on Apr 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f555f5d - Browse repository at this point
Copy the full SHA f555f5dView commit details
Commits on Apr 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2c12b44 - Browse repository at this point
Copy the full SHA 2c12b44View commit details -
Configuration menu - View commit details
-
Copy full SHA for dba02b2 - Browse repository at this point
Copy the full SHA dba02b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for db201b6 - Browse repository at this point
Copy the full SHA db201b6View commit details