Seguir
Kevin Lin
Kevin Lin
Dirección de correo verificada de microsoft.com - Página principal
Título
Citado por
Citado por
Año
Deep learning of binary hash codes for fast image retrieval
K Lin, HF Yang, JH Hsiao, CS Chen
IEEE Conference on Computer Vision and Pattern Recognition Workshops, 27-35, 2015
7742015
End-to-end human pose and mesh reconstruction with transformers
K Lin, L Wang, Z Liu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1954-1963, 2021
7382021
GIT: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
Transactions on Machine Learning Research (TMLR), 2022
5282022
The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang
arXiv preprint arXiv:2309.17421, 2023
5042023
Adversarial ranking for language generation
K Lin, D Li, X He, Z Zhang, MT Sun
Advances in Neural Information Processing Systems (NeurIPS), 3158-3168, 2017
4392017
Learning compact binary descriptors with unsupervised deep neural networks
K Lin, J Lu, CS Chen, J Zhou
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1183-1192, 2016
4252016
Supervised learning of semantics-preserving hash via deep convolutional neural networks
HF Yang, K Lin, CS Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (2), 437-451, 2018
4062018
Mm-vet: Evaluating large multimodal models for integrated capabilities
W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang
ICML 2024, 2024
3962024
Mesh graphormer
K Lin, L Wang, Z Liu
IEEE/CVF International Conference on Computer Vision (ICCV), 12939-12948, 2021
3532021
Mitigating hallucination in large multi-modal models via robust instruction tuning
F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang
ICLR 2024, 2024
340*2024
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
3192023
SwinBERT: End-to-end transformers with sparse attention for video captioning
K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17949 …, 2022
2722022
VIOLET: End-to-end video-language transformers with masked visual-token modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv preprint arXiv:2111.12681, 2021
2142021
Reco: Region-controlled text-to-image generation
Z Yang, J Wang, Z Gan, L Li, K Lin, C Wu, N Duan, Z Liu, C Liu, M Zeng, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14246 …, 2023
1262023
Vivo: Visual vocabulary pre-training for novel object captioning
X Hu, X Yin, K Lin, L Zhang, J Gao, L Wang, Z Liu
Proceedings of the AAAI Conference on Artificial Intelligence, 1575-1583, 2021
119*2021
Abandoned object detection via temporal consistency modeling and back-tracing verification for visual surveillance
K Lin, SC Chen, CS Chen, DTD Lin, YP Hung
IEEE Transactions on Information Forensic and Security 10 (7), 1359-1370, 2015
1192015
Cross-domain complementary learning using pose for multi-person part segmentation
K Lin, L Wang, K Luo, Y Chen, Z Liu, MT Sun
IEEE Transactions on Circuits and Systems for Video Technology 31 (3), 1066 …, 2020
992020
Disco: Disentangled control for realistic human dance generation
T Wang, L Li, K Lin, Y Zhai, CC Lin, Z Yang, H Zhang, Z Liu, L Wang
CVPR 2024, 2024
91*2024
Rapid clothing retrieval via deep learning of binary codes and hierarchical search
K Lin, HF Yang, KH Liu, JH Hsiao, CS Chen
ACM International Conference on Multimedia Retrieval (ICMR), 499–502, 2015
912015
Lavender: Unifying video-language understanding as masked language modeling
L Li, Z Gan, K Lin, CC Lin, Z Liu, C Liu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 23119 …, 2023
862023
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20