Hello. A few of us are looking to fund a short term contract for someone knowledgeable about Clojure CUDA programming to build an inferencing engine that runs LLM models similar to LLaMA on the GPU. The reference Python code is <300 lines, so for someone already good at CUDA, a Clojure port would probably be doable in a few days. Please PM me if interested.