Learning Generalizable And Transferable Representations Across Domains And Modalities