Browsing: Multimodal Learning for Cross-modal Understanding