NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs

Open in new window