Highlights
- Pro
Block or Report
Block or report austin362667
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
-
cccriscv/mini-riscv-os
cccriscv/mini-riscv-os PublicBuild a minimal multi-tasking OS kernel for RISC-V from scratch
-
Roofline model
Roofline model 1[Roofline: An Insightful Visual Performance Model
2for Floating-Point Programs and Multicore Architectures](https://people.eecs.berkeley.edu/~kubitron/cs252/handouts/papers/RooflineVyNoYellow.pdf)
34## Arithmetic(Operational) `Intensity` = `Work`/`Memory Traffic`
5 -
Transoformer QA
Transoformer QA 1[Large-Scale Pretraining with Transformers](https://d2l.ai/chapter_attention-mechanisms-and-transformers/large-pretraining-transformers.html)
23更 high level 來看, BERT 是用了 Transformer 的 encoder; GPT 則是用了 Transformer 的 decoder.
451. 注意力機制中 Q, K, V 意義上是什麼, 是如何產生的?
-
script-one/script1
script-one/script1 PublicScript1 is a programming language that run everywhere and call any library.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.