Smarter, Leaner, Faster: Practical Efficiency For Democratizing Large Language Models