Edge Caching Optimization with PPO and Transfer Learning for Dynamic Environments