Skip to main content
Flow Simulation at the Exascale
Flow Simulation at the Exascale
Main navigation
Home
People
All Profiles
Organizers
Speakers
Events
All Events
Events Calendar
News
multimodal alignment
Towards Scalable and Efficient Semantic Video Search
Mattia Soldan, Ph.D., Electrical and Computer Engineering
Jul 13, 18:00
-
19:00
B4 L5 R5209
video-language grounding
semantic video retrieval
multimodal alignment
This dissertation advances fine-grained, content-aware video retrieval by developing novel models and frameworks for Video-Language Grounding, enabling accurate alignment between natural language queries and specific temporal segments in unstructured video content.