Edge AI – Deployment of LLMs on Edge Devices via Hybrid Compression Techniques
→
Summary
Engaged in a research-driven project proposing a novel hybrid compression technique combining structured pruning and data-aware low-rank decomposition to efficiently deploy Large Language Models on resource-constrained IoT devices.