Seguir
Zhe Gan
Zhe Gan
Research Scientist, Apple
Dirección de correo verificada de apple.com - Página principal
Título
Citado por
Citado por
Año
Uniter: Universal image-text representation learning
YC Chen, L Li, L Yu, A El Kholy, F Ahmed, Z Gan, Y Cheng, J Liu
European Conference on Computer Vision, 104-120, 2020
1812*2020
Attngan: Fine-grained text to image generation with attentional generative adversarial networks
T Xu, P Zhang, Q Huang, H Zhang, Z Gan, X Huang, X He
Proceedings of the IEEE conference on computer vision and pattern …, 2018
15702018
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Y Pu, Z Gan, R Henao, X Yuan, C Li, A Stevens, L Carin
NIPS, 2016
8312016
Patient knowledge distillation for bert model compression
S Sun, Y Cheng, Z Gan, J Liu
arXiv preprint arXiv:1908.09355, 2019
6522019
Semantic compositional networks for visual captioning
Z Gan, C Gan, X He, Y Pu, K Tran, J Gao, L Carin, L Deng
Proceedings of the IEEE conference on computer vision and pattern …, 2017
4852017
Less is more: Clipbert for video-and-language learning via sparse sampling
J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
4242021
Adversarial feature matching for text generation
Y Zhang, Z Gan, K Fan, Z Chen, R Henao, D Shen, L Carin
International conference on machine learning, 4006-4015, 2017
4002017
Freelb: Enhanced adversarial training for natural language understanding
C Zhu, Y Cheng, Z Gan, S Sun, T Goldstein, J Liu
International Conference on Learning Representations, 2020
3722020
Large-scale adversarial training for vision-and-language representation learning
Z Gan, YC Chen, L Li, C Zhu, Y Cheng, J Liu
Advances in Neural Information Processing Systems 33, 6616-6628, 2020
3702020
Hero: Hierarchical encoder for video+ language omni-representation pre-training
L Li, YC Chen, Y Cheng, Z Gan, L Yu, J Liu
arXiv preprint arXiv:2005.00200, 2020
3642020
Stylenet: Generating attractive visual captions with styles
C Gan, Z Gan, X He, J Gao, L Deng
Proceedings of the IEEE conference on computer vision and pattern …, 2017
3272017
Relation-aware graph attention network for visual question answering
L Li, Z Gan, Y Cheng, J Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
3152019
Generating informative and diverse conversational responses via adversarial information maximization
Y Zhang, M Galley, J Gao, Z Gan, X Li, C Brockett, B Dolan
Advances in Neural Information Processing Systems 31, 2018
2682018
Discourse-aware neural extractive text summarization
J Xu, Z Gan, Y Cheng, J Liu
arXiv preprint arXiv:1910.14142, 2019
2412019
An empirical study of training end-to-end vision-and-language transformers
ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2032022
Generating Text via Adversarial Training
Y Zhang, Z Gan, L Carin
Workshop on Adversarial Training, NIPS 2016, 2016
1952016
Club: A contrastive log-ratio upper bound of mutual information
P Cheng, W Hao, S Dai, J Liu, Z Gan, L Carin
International conference on machine learning, 1779-1788, 2020
1802020
Hierarchical graph network for multi-hop question answering
Y Fang, S Sun, Z Gan, R Pillai, S Wang, J Liu
arXiv preprint arXiv:1911.03631, 2019
1792019
Git: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
arXiv preprint arXiv:2205.14100, 2022
1612022
Tactical rewind: Self-correction via backtracking in vision-and-language navigation
L Ke, X Li, Y Bisk, A Holtzman, Z Gan, J Liu, J Gao, Y Choi, S Srinivasa
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1542019
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20