SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling

Open in new window