Performance-Portable Particle-in-Cell Simulations for the Exascale Era ✨
-
Updated
Jun 11, 2024 - C++
Performance-Portable Particle-in-Cell Simulations for the Exascale Era ✨
Productive, portable, and performant GPU programming in Python.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Advanced High Performance Computing in C with OpenMP, CUDA, MPI and NCCL. The folder project includes my final project for the special course. I implemented a Jacobi-solver for the Poisson partial differential problem both using OpenMP in the CPU, using CUDA on the GPU and using CUDA, MPI and NCCL on multiple GPUs.
Suite of python packages for multiparticle simulations of particle accelerators.
✨ Zero-code distributed tracing and profiling, observability via eBPF 🚀
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, Axolotl, etc.
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
A small OpenCL benchmark program to measure peak GPU/CPU performance.
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
(GPU accelerated) Multi-arch (linux/amd64, linux/arm64/v8) JupyterLab Julia docker images. Please submit Pull Requests to the GitLab repository. Mirror of
Developer kits best known configurations setup scripts for various kinds of Intel platforms and GPUs
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."