Agents de connaissance via RL