A complete, production-ready implementation of a hybrid optimization framework for ultra-low-bit LLM training with GPU/CPU optimization support. This system combines: ...