Posts with categories “arXiv”

Llamas on the Web: Memory-Efficient, Performance-Portable, and Multi-Precision LLM Inference with WebGPU

LlamaWeb brings memory-efficient, performance-portable, multi-precision LLM inference to the browser with a WebGPU backend for llama.cpp, reducing memory use and improving decode throughput across diverse devices.

May 20, 2026 ✦ By Reese Levine, Rithik Sharma, Nikhil Jain, Abhijit Ramesh, Zheyuan Chen, Neha Abbas, James Contini and Tyler Sorensen

categories:

arXiv

sqlelf: a SQL-centric Approach to ELF Analysis

sqlelf models ELF objects as relational databases, enabling expressive SQL queries, aggregation, and cross-object analysis for more accessible and efficient ELF exploration.

May 06, 2024 ✦ By Farid Zakaria, Zheyuan Chen, Andrew Quinn and Thomas R. W. Scogland

categories:

arXiv