Browsing: Policy Gradient Algorithm