-🌱 I’m currently an AI Resident at FPT Software AI Center (AIC), ex-AI Engineer at Data & AI Lab (DAL), VNG Corporation.
Research topics:
-
Large Multimodal Models Reasoning
-
Multimodal (Vision-Language) Compositionality
-
Efficient (Large) Multimodal Models: Parameter-Efficient Fine-Tuning (PEFT), Small models, Knowledge Distillation.
My current research experience comprises of Intelligent Industrial Systems, Multimodal Learning and Image/Video Understanding, including:
-
[2023-Present] Efficient Cross-Modal Learning & Understanding: Video-Language Matching, Parameter-Efficient Fine-Tuning (PEFT), Multimodal Compositionality, Structured Representation (Scene Graph Generation).
-
[2021-2023] Intelligent Industrial/Traffic Systems Applications: Tracked-Vehicle to Video Retrieval, Person/Vehicle Re-Identification, Person/Vehicle Tracking, Face Recognition/Verification.