dmaivel

posts

outperform cublas with opengl #programming, #research
May 5, 2024 • 4541 words • 22 mins

Compute libraries, like CUDA and OpenCL, are responsible for handling the compute pipeline over the GPU, offering acceleration for intensive mathematical routines like matrix multiplication. Compute has even been introduced in graphics libraries as an independent pipeline, including OpenGL and Vulkan, in the form of compute shaders. But, do we really need …
virtualization in usermode #assembly, #x86-internals
Jun 17, 2023 • 1727 words • 9 mins

Is it possible to execute a kernel completely in usermode? Well, the short answer seems to be no, as a few issues become immediately apparent. How will we execute privileged instructions? How will memory be addressed?

With a few hacks and tricks, its becomes apparent that there is some possibility. Note that this article assumes the host machine is running …

SharedGL #opengl, #ivshmem
OpenGL 4.6 for Windows/Linux guests in QEMU/KVM via shared memory or sockets
libdecomp #decompiler
library for decompiling multi-architecture bytecode into optimized source code
covirt #virtualizer, #obfuscator
x86-64 code virtualizer w/ various obfuscation passes, including MBA
ntoseye #debugger, #windows-internals
windows kernel debugger for Linux hosts running Windows under QEMU/KVM
vscc #compiler-backend, #jit
x86-64 JIT compiler backend with no third party dependencies
glBLAS #opengl, #linear-algebra
BLAS functions written in OpenGL fragment shaders, challenging cuBLAS
cugrad #cuda, #autodiff
automatic differentiation library written in C++ and CUDA from scratch