Highlights
Timeline Construction
We discover that temporal information forms a low-dimensional manifold in VLM embedding spaces. Our Bézier curve approach explicitly models chronological progression, enabling efficient temporal inference through geometric timeline representations.
TIME10k Dataset
A comprehensive benchmark with 10,091 temporally annotated images spanning 309 years (1715-2024) across 6 object categories: Aircraft, Cars, Mobile Phones, Music Instruments, Ships, and Weapons & Ammunition.
Time Probing Benchmark
Systematic evaluation of 37 state-of-the-art VLMs reveals significant temporal awareness capabilities. EVA-CLIP and OpenCLIP achieve the best performance with 6.2-6.3 year Mean Absolute Error and 0.85-0.86 Time Awareness Index.
Temporal Manifold Discovery
First discovery that temporal information can be represented as a ~13-dimensional non-linear manifold within high-dimensional VLM embedding spaces, enabling both analysis and practical applications.
Key Contributions
Dataset Statistics
Music Instruments: 436 images (1715-2009) | Ships: 841 images (1744-1999) | Weapons & Ammo: 15 images (1939-2003)