GPU-accelerated Llama3.java inference in pure Java using TornadoVM github.com 47 points by pjmlp 4 days ago