Towards Agentic Self-Learning LLMs in Search Environment