Skip to content
View c0deJedi's full-sized avatar

Block or report c0deJedi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
c0deJedi/README.md

Sean Lobjoit

Cloud and AI practitioner. Founder of Perpetual Squared.

I help organisations cut through hype and build AI and cloud systems that actually work. I'm also the type of guy to spend a weekend figuring out how to run a 14B LLM on a laptop with soldered RAM and 8GB VRAM, just because it's an interesting opportunity to solve for.

Enterprise

  • Cloud architecture: AWS, GCP, Azure
  • AI/LLM systems and agent frameworks
  • Platform engineering and DevSecOps
  • MLOps and AI infrastructure
  • Enterprise architecture and cost optimisation at scale

Projects

I work in a variety of domains professionally but in my spare time been looking into solving some hardware limitations I've run into:

  • nbd-vram - CUDA daemon that backs an NBD block device with GPU VRAM. Turns your graphics card into a 1.3 GB/s swap device. No kernel module, no P2P API - just dlopen(libcuda.so) and a Unix socket.
  • llm-fit - LD_PRELOAD hook that redirects cudaMalloc to cudaMallocManaged. Hypothesis: models slightly over VRAM capacity load fully on GPU via CUDA Unified Memory. Finding: works in a narrow ~200-300MB sweet spot. Beyond that, PCIe page migration is slower than Ollama's native CPU split.

Writing

No-fluff takes on cloud strategy, AI architecture, and the decisions that separate good systems from great ones.

seanlobjoit.com · LinkedIn

Pinned Loading

  1. nbd-vram nbd-vram Public

    Use your NVIDIA GPU's VRAM as swap space on Linux. Built for laptops with soldered memory and no upgrade path. If you have an RTX card sitting there with 8GB of VRAM and you're getting swapped to S…

    C 181 3

  2. llm-fit llm-fit Public

    Research into CUDA Unified Memory as a VRAM extension for LLM inference

    C 1